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24-pin Registered TTL/CMOS PAL Devices 











PRODUCT PINS TECHNOLOGY toy (ns) log (mA) DESCRIPTION PAGE 
= + an | 
AmPAL22XP 10 24 TTL 20, 30, 40 90, 180, 210 | XOR gate 5-271 
PAL20S10 24 TTL 35 240 Product term steering | 5-103 
AmPAL22P 10 24 TTL 15,25 105, 210 24-pin superset 5-291 
Am/PAL20L10 24 TTL 15, 20, 25, 30 105, 165, 210 10 outputs 5-113, 5-306 
PAL20L8 24 TTL, E CMOS 15, 25, 35, 45 0.1, 105, 210 Standard 5-122 
PAL6L16 24 TTL 25 90 Wide output 5-141 
PAL8L14 24 TTL 25 90 Wide output 5-141 
PAL12L10 24 TTL 40 100 Simple combinatorial | 5-147 
PAL14L8 24 TTL 40 100 Simple combinatorial | 5-147 
PAL16L6 24 TTL 40 100 Simple combinatorial ; 5-147 
PAL18L4 24 TTL 40 100 Simple combinatorial | 5-147 
PAL20L2 24 TTL 40 100 Simple combinatorial | 5-147 
PAL20C1 24 TTL 40 100 Simple combinatorial | 5-147 


























PRODUCT ee Lee aNeLos : fax (MHz) log (mA) DESCRIPTION PAGE 
AmPALC29MA16 24 EE CMOS 20, 15 120 Advanced Async. Macro| 5-209 
AmPALC29M16 24 EE CMOS 20, 15 120 Advanced Macrocell 5-231 
PAL32VX10 24 TTL 25, 22 180 J-K, varied terms 5-70 
Am/PAL22V10 24 TTL, E CMOS 40, 33, 28.5, 20, 18 90, 180 Versatile 5-79, 5-249 
PAL22RX8 24 TTL 28.5 210 J-K flip-flops 5-87 
PAL20RA10 24 TTL 20 200 Asynchronous 5-95 
AmPAL20XRP 10 24 TTL 30, 22, 14 105, 180,210 | XORgate & polarity | | 5-271 
AmPAL20XRP8 24 TTL 30, 22, 14 105, 180, 210 XOR gate & polarity 5-271 
AmPAL20XRP6 24 TTL 30, 22, 14 105, 180,210 | XOR gate & polarity 5-271 
AmPAL20XRP4 24 TTL 30, 22, 14 105, 180, 210 XOR gate & polarity 5-271 
PAL20RS10 24 TTL 20 240 Product term steering 5-103 
PAL20RS8 24 TTL 20 240 Product term steering | 5-103 
PAL20RS4 24 TTL 20 240 Product term steering | 5-103 
PAL20X10 24 TTL 22 180 XOR gate 5-113 
PAL20X8 24 TTL 22 180 XOR gate 5-113 
PAL20X4 24 TTL 22 180 XOR gate 5-113 
AmPAL20RP10 24 TTL 37, 25 105, 210 Programmable polarity | 5-291 
AmPAL20RP8 24 TTL 37,25 105, 210 Programmable polarity | 5-291 
AmPAL20RP6 24 TTL 37,25 105, 210 Programmable polarity | 5-291 
AmPAL20RP4 24 TTL 37,25 105, 210 Programmable polarity | 5-291 
PAL20R8 24 TTL, E CMOS 37, 25, 20, 15 0.1, 105, 210 Standard 5-122 
PAL20R6 24 TTL, E CMOS 37, 25, 20, 15 0.1, 105, 210 Standard 5-122 
PAL20R4 24 TTL, ECMOS 37, 25, 20, 15 0.1, 105, 210 Standard 5-122 
PAL32R16 40 TTL 16 280 MegaPAL™ device 5-158 
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In late 1987, the two programmable logic market leaders, Monolithic Memories and Advanced Micro 
Devices, merged into one great company. We combined our strong traditions of customer service 
and innovation to offer you the best line of programmable logic devices. The 1988 Data Book 
presents the technical specifications on the entire product line. The separate 1988 Handbook 
presents all of the necessary support material, whether you are accustomed to using MMI or 
AMD products. 


AMD/MMI has the products, manufacturing capacity and technology to perpetuate the leadership 
in the programmable logic field which we pioneered. In the 1988 Data Book you will notice that we 
not only have the broadest programmable logic line, but we also have the industry’s most 
comprehensive CMOS programmable logic line. We think that you will find the Data Book and 
Handbook informative and useful. If you have any comments or questions on the books or the 
product line, please contact us. 
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Cyrus Tsui 
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Description 


This 1988 PAL Device Handbook/Data Book is your complete guide to all programmable logic 
devices (PLDs) from Monolithic Memories and Advanced Micro Devices. The merger of the two 
companies provides a greater wealth of products and services for you. Note that all PLDs which 
were in production before the merger are still being produced. 


The PAL Device Handbook/Data Book is organized into two volumes and six easy-to-use sections: 


PAL Device Handbook 


Section 1: Introduction 
Includes an overview of the PLD product family. 


Section 2: Applications 
Includes detailed application examples. The first few chapters provide tutorials in PLD design. 
The application notes are grouped by application area. 


PAL Device Data Book 


Section 3: Programming and Quality 
Includes information on PLD software programs, programming information, PLD technology and 
quality discussions, and package information. 


Section 4: PALASM 2 Software User Documentation 
Includes complete documentation for PALASM 2 software. 


Section 5: Data Sheets 
Includes specifications for all PLDs from the combined company. PAL devices formerly from MMI 
are under “PAL Devices,” while PAL devices formerly from AMD are under “AmPAL Devices.” 


Section 6: Appendices 
Includes quick reference information. 


If you have any questions or comments on PLDs or any other products, please contact your most 
convenient AMD/MM! sales office, listed at the end of each book. 
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The Programmable Array Logic device, commonly known as the 
PAL® device, was invented at Monolithic Memories over 12 years 
ago. The concept for this revolutionary type of device sprang forth 
as a simple solution to the shortcomings of discrete TTL logic. 


The successfully proven PROM technology which allowed the 
end user to “write on silicon” provided the technological basis 
which made this kind of device not only possible, but very popular 
as well. 


The availability of design software made it much easier to design 
with programmable logic. As designers were freed from the 
drudgery of low-level implementation issues, new complex de- 
signs were easier to implement, and could be completed more 
quickly. 


This chapter outlines some basic information essential to those 
who are unfamiliar with Programmable Logic devices (PLDs). 
The information may also be useful to those who are current users 
of programmable logic. The specific issues which need to be 
addressed are: 


¢ What is a PLD? 

* What other implementations are possible? 

* What advantages do PLDs have over other 
implementations? 


What is a PLD? 


In general, a programmable logic device is a circuit which can be 
configured by the user to perform a logic function. Most “stan- 
dard” PLDs consist of an AND array followed by an OR array, 
either (or both) of which is programmable. Inputs are fed into the 
AND array, which performs the desired AND functions and 
generates product terms. The product terms are then fed into the 
OR array. In the OR array, the outputs of the various product 
terms are combined to produce the desired outputs. There are 
three fundamental types of standard PLD: the PROM, the PAL 
device, and the PLS device. 


PROMs 
PROMs are usually thought of as memory elements. However, 


the PROM has a fixed AND array (which decodes the memory 
address) followed by a programmable OR array (Figure 1). For 


each of a given set of input combinations (addresses), it gener- 
ates a value which has been programmed into the device. 






FIXED AND 
ARRAY 


& INDICATES PROGRAMMABLE CONNECTION 
+ INDICATES FIXED CONNECTION 
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Figure 1. PROM Array Structure 
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PAL Devices 


The PAL device has a programmable AND array followed by a 


fixed OR array (Figure 2). The fact that the AND array is - 


programmable makes it possible for the devices to have many 
inputs. The fact that the OR array is fixed makes the devices small 
(which means less expensive) and fast. 


PLS Devices 


The PLS (Programmable Logic-based sequencer) devices are 
based on the standard PLA architecture (Figure 3), where both 
the AND and the OR arrays are programmable. This arrange- 
ment allows for greater overall flexibility. The architecture is a bit 
more costly in terms of die size and speed, so for simple logic 
functions, a PAL device is usually more cost effective. However, 
this architecture is very effective for sequencers, where the 
flexibility allows larger state machines than might fit in a PAL 
device. 


Other PLDs 


In addition to the basic sum-of-products PLDs, some more 
complex PLDs dedicated to sequencing are available, most 
notably the PROSE™ device and the Am29PL141. Their archi- 
tectures are described elsewhere in the handbook, but their fun- 
damental benefits are the same as those of the more traditional 
PLDs. 
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In practice, the distinctions between architectures are not as 
significant as the differences between the types of functions to be 
performed. For this reason, this handbook is organized primarily 
into discussions about PAL devices and discussions about se- 
quencers, whether those sequencers be implemented in PAL 
devices, PLS devices, or one of the dedicated sequencers. 


What Other Implementations Are 
Possible? 


There are essentially four alternatives to programmable logic: 


¢ Discrete Logic 

° Gate Arrays 

¢ Standard Cell Circuits 
¢ Full Custom Circuits 


Discrete Logic 


Discrete logic, or conventional TTL logic, has the advantage of 
familiarity; hence its popularity. It is also quite inexpensive when 
only unit cost is considered. The drawback is that the implemen- 
tation of even a simple portion of a system may require many units 
of discrete logic. There are “hidden” costs associated with each 
unit that goes into a system, which can render the overall system 
more expensive. 











PROGRAMMABLE AND ARRAY 
- INDICATES PROGRAMMABLE CONNECTION 
-$- INDICATES FIXED CONNECTION 

Figure 2. PAL Device Array Structure 
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Designing with discrete chips can also be very tedious. Each 
design decision directly affects the layout of the board. Changes 
are difficultto make. The designis also more difficult to document, 
making it harder to debug and maintain later. These items all 
contribute to a long design cycle when discrete chips are used 
extensively. 


Gate Arrays 


Gate arrays have been increasing in popularity. The attractive- 
ness of this solution lies in the device's flexibility. By packing the 
functions into the device, a great majority of the available silicon 
is actually used. Since such a device is customized for an 
application, it would seem to be the optimum device for that 
application. 


However, one also pays substantial development costs, espe- 
cially in the case of a design which needs changes after silicon 
has already been processed. Even though the unit costs are 
generally quite low for gate arrays, the volumes required to make 
their use worthwhile excludes them as a solution for many 
designers. This fact, added to the long design cycle and high risk 
involved, make this solution practical for only a limited number of 
designers. 


Standard Cell Circuits 
Standard cell circuits are quite similar to gate arrays, their main 


advantage being that they consist of a collection of different parts 
of circuits which have already been debugged. These circuits are 
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then assembled and collected to perform the desired functions. 
This can ideally lead to reduced turnaround from conception to 
implementation, and a much more efficient circuit. 


The drawback is that even though the individual components of 
the circuit have been laid out, a complete layout must still be per- 
formed to arrange the cells. Instead of just customizing the metal 
interconnections, as is done in a gate array, the circuit must be 
developed from the bottom up. Development costs can be even 
higher than for gate arrays, and despite the standard cell concept, 
turnaround time often tends to be longerthan planned. Again, the 
volume must be sufficiently high to warrant the development 
costs. 


Full Custom Circuits 


Full custom designs require that a specific chip be designed from 
scratch to perform the needed functions. The intent is to provide 
a solution which gives the designer exactly what is needed for the 
application in question; no more and no less. Ideally, not a square 
micron of silicon is wasted. This normally results in the smallest 
piece of silicon possible to fit the needs of the design, which in turn 
reduces the system cost. Understandably, though, development 
costs and risks for such a design are extremely high, and volumes 
must be commensurately high in order for such a solution to be 
of value. 
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Figure 3. PLA Array Architecture 
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What Advantages do PLDs Have Over 
Other Implementations? 


As user-programmable semicustom circuits, PLDs provide a 
valuable compromise which combines many of the benefits of 
discrete logic with many of the benefits of other semicustom 
circuits. The overall advantages can be found in several areas: 


¢ Ease of design 
* Performance 
* Reliability 

¢ Cost savings 


Ease of Design 


The support tools available for use in designing with PLDs greatly 
simplify the design process by making the lower-level implemen- 
tation details transparent. In a matter of one cr two hours, a first 
time PLD user can learn to design with a PAL device, program it, 
and implement the design in a system. 


The design support tools consist of design software and a 
programmer. The design software is used in generating the 
design; the programmer is used to configure the device. The 
software provides the link between the higher-level design and 
the low-level programming details. 


Allof the available design software packages (of which Monolithic 
Memories’ PALASM® software is the most widely used) perform 
essentially the same tasks. The design is specified with relatively 
high-level constructs; the software takes the design and converts 
it into a form which the programmer uses to configure the PLD. 
Most software packages provide logic simulation, which allows 
one to debug the design before actually programming a device. 
The high-level design file also serves as documentation of the 
design. This documentation can be even easier to understand 
than traditional schematics. 


Depending on the capabilities desired, a device programmer can 
cost anywhere from under $1,000.00 to around $15,000.00 fora 
high-volume production programmer. Many PLD users do not 
find it necessary to purchase a programmer; it is often quite cost 
effective and convenient to have either the manufacturer or an 
outside distributor do the programming for them. For design and 
prototyping, though, itis very helpful to have a programmer; this 
allows one to implement designs immediately. 


The convenience of programmable logic lies in the ability to 
customize a standard, off-the-shelf product. PLDs can be found 
in stock to suit a wide range of speed and power requirements. 
The variety of architectures available also allows a choice of the 
proper functionality for the application at hand. Thus adesigncan 
be implemented using a standard device, with the end result 
essentially being a custom device. If a design change is needed, 
it is a simple matter to edit the original design and then program 
a new device, or, in the case of reprogrammable CMOS devices, 
erase and reprogram the old device. 


Board layout is vastly simplified with the use of programmable 
logic. PLDs offer great flexibility in the location of inputs and 
outputs on the device. Since larger functions are implemented 
inside the PLD, board layout can begin once the inputs and 


outputs are known. The details of what will actually be inside the 
PLD can be worked out independently of the layout. In many 
cases, any needed design changes can be taken care of entirely 
within the PLD, and will not affect the PC board. 


Performance 


Speedis one of the main reasons that designers use PAL devices. 
The TTL PAL devices presently on the market can provide equal 
or better performance than the fastest discrete logic available. 
ECL PAL devices extend the benefits of programmable logic to 
the even higher-speed realm of ECL logic. Today’s fastest PAL 
devices are being developed on the newest technologies to gain 
every extra nanosecond of performance. 


Performance cannot come strictly at the expense of power 
consumption. Since PLDs can be used to replace several 
discrete circuits, the power consumption of a PLD may well be 
less than that of the combined discrete devices. As more PLDs 
are developed in CMOS technology, the option for even lower 
power becomes available, including zero standby power devices 
for systems which can tolerate only minute standby power con- 
sumption. 


Reliability 


Reliability is an area of increasing concern. As systems get larger 
and more complex, the increase in the amount of circuitry tends 
to reduce the reliability of the system; there are “more things to go 
wrong”. Thus a solution which inherently reduces the number of 
chips in the system will contribute to higher reliability. Aprogram- 
mable logic approach can provide device quality levels up to 50 
parts per million (ppm), while also providing a more reliable 
solution due to the smaller number of devices required. 


With the reduction in units and board space, PC boards can be 
laid out less densely, which greatly improves the reliability of the 
board itself. This also reduces crosstalk and other potential 
sources of noise, making the operation of the system cleaner and 
more reliable. 


Cost 


For any design approach to be practical, it must be cost effective. 
Cost is almost always a factor in considering a new design or a 
design change. But the calculation of total system cost can be 
misleading if not all aspects are considered. Many of the costs 
can be elusive or difficult to measure. For example, it is difficult 
to quantify the cost of market share lost due to late product 
introduction. 


The greatest savings over a discrete design are derived from the 
fact that a single PLD can replace several discrete chips. Board 
space requirements can drop by 25% or more when PLDs are 
used. Figure 4 illustrates some of the costs of the various 
solutions discussed so far, with many of the factors that may not 
always be considered included for comparison. These involve 
such items as inventory costs, inspection costs, test costs, board 
materials costs, and of course the very costly time spent design- 
ing and debugging such systems, and isolating and replacing 
units which fail. With each design change, the cost of a custom 
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Unit NREs/ Purchase Design Gate Count | Engineering Design Cost 
Cost Development Costs Volume Flexibility per Device Design Turnaround of Each 
Range For Best Time Time Design Change 
Economics 
PLDs: BLANK Low-Med None 5K-200K 1/2 Week Short Very Low 
(1-10 Days) 
PLDs: FACTORY Low-—Med Low 10K-200K 3-10 Moderate Med 
PROGRAMMED Weeks (8-10 Wks.) 
DISCRETE LOGIC Low-Med None 1K-10K 1 Week Short 
(1-10 Days) 
GATE ARRAYS Low Med-High 10K-200K Med-High 1K-10K 12-40 Long High 
Weeks (3-9 Mos.) 
STANDARD CELLS Low High 100K-300K Med-High 1K-10K 26-52 Long High 
Weeks (6-12 Mos.) 
FULL CUSTOM Very Low High 200K and Up High 1K-60K 1-2 Yrs. Long High 
(6-12 Mos.) 
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Figure 4. Cost and Design-time Comparisons 
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Introduction 


solution rises dramatically, while that of a user-customizable 
approach is minimal. The relationship between the various 
alternatives is summarized in Figure 5. 


Another economic benefit of the use of PLDs is that when one PAL 
device is used in several different designs, as is often the case, 
the user has not committed that device to any one of the particular 
designs until the device has been programmed. This means that 
inventory can be stocked for several different designs in the form 
of one device. As requirements change, the parts can be 
programmed to fit the need. And in the case of reprogrammable 
CMOS devices, one is not committed even after programming. 


There is also a cost-effective PLD solution for high-volume 
production. Just as a ROMis aPROM which has been hard-wired 
for mass production, HAL® (Hard Array Logic) devices can be 
produced in high volumes for extra cost savings. This mask- 
programmed version can be produced in high volume with 
unparalleled quality. In addition, in the event that production 
quantities for your system show an unexpected increase, the 
equivalent PAL or ProPAL™ devices (in-factory programmed 
PAL. devices) can be quickly obtained and programmed in your 
factory, by Monolithic Memories, or through distribution to cover 
the temporary shortage. More details on HAL and ProPAL 
devices are provided on page 3-104 of the Data Book. 


One final subtle cost issue is derived from the ease with which a 
competitorcan copy adesign. PLDs have a unique feature called 
a security fuse, whose purpose is to protect a design from being 
copied, By using secured PLDs extensively in a system, one can 
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PROGRAMMABLE 
LoGic 







STANDARD 
PRODUCTS 


WEEKS 
DEVELOPMENT TIME 


safely avoid having one’s system easily deciphered. The added 
design security provided by this feature can buy extra market 
time, forcing competitors to do their own original design work 
rather than copying the designs of others. 


Summary 


Programmable logic provides the means of creating semi-custom 
designs with readily available standard components. There is a 
wide variety of PLDs; PAL devices are most widely used, and 
perform well for basic logic and some sequencing functions. 
Other dedicated sequencers provide the circuitry required to 
implement more complex designs. 


By assuming some of the attributes of gate arrays, programmable 
logic provides the cost savings of any other semicustom device, 
without the extra engineering costs, risks, and design delays. 
Reliability is also enhanced as quality increases and board 
complexity decreases. 


The design tasks are greatly simplified due to the design tools 
which are now available. Design software and device program- 
mers allow top-down high-level designs, with a minimum of time 
spent on actual implementation issues. Simulation allows some 
design debug before a device is programmed. 


For all of these reasons, programmable logic has become, and 
will continue to be, the design methodology of choice among 
digital systems designers. 


STANDARD 
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Figure 5. Development Cost vs Time for Alternative Logic 
Implementations 
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Introduction 


Monolithic Memories and Advanced Micro Devices offer a wide 
variety of PLDs, implemented in a variety of technologies. In this 
section, we will briefly discuss the device families, and look at the 
various architecture, speed, and power options. More specific 
device information can be found in the individual data sheets in 
Section 5. Discussions on some of the special architectural fea- 
tures of many of the devices can also be found in their respective 
datasheets. 


There are three basic PLD areas addressed by Monolithic 
Memories’ devices: 


¢ PAL devices—general purpose 
* Programmable sequencers—state machines 
¢ LCA™ devices—high density 


The largest application area is that covered by the PAL devices. 
There is a wide variety of PAL devices, ranging from simple 
devices that address general logic design problems to more 
sophisticated devices that deal with more complex problems. 


There is also a series of sequencers which are not PAL devices, 
featuring architectures particularly well suited to sequencing 
operations. While there are PAL devices that work well as state 
machines in addition to their other applications, these dedicated 
sequencers have given up some of their generality to provide 
optimal state machine solutions. 


The final area covered is that of high-density design, addressed 
by the LCA devices. The LCA device takes the approach of a 
programmable gate array to provide a PLD with many usable 
gates. 


Design software radically simplifies the design of any circuit ina 
PLD. PALASM software is used for allof the PLDs except the LCA 
devices. The LCAdevices make use of XACT™ software for logic 
configuration. The PALASM software documentation is in 
Section 4; XACT software is discussed more fully on 
page 3-17 of the Data Book. 


PAL Devices 


PAL devices are available in three different technologies: 


¢ TTL 
* CMOS 
* ECL 


The CMOS devices often provide the same functions as the TTL 
devices, and can be used in the same sockets, with the added 
benefit of lower power. Thus the TTL and CMOS devices will be 
discussed together, followed by a summary of specific CMOS 
issues. ECL devices require completely different design consid- 
erations, andcannotbe interchanged with TTLor CMOS devices. 
They will therefore be discussed separately. 


PAL devices generally have a mnemonic naming scheme, which 
provides some basic information about the devices’ capabilities. 


Because Monolithic Memories and Advanced Micro Devices 
recently merged, there are presently two slightly different naming 
conventions in use. Some devices were originally Monolithic 
Memories’ devices; some were Advanced Micro Devices’; some 
devices were produced by both companies before the merger. In 
order to minimize the confusion after the merger, the old device 
names are being maintained. The two naming conventions willbe 
discussed separately. Throughout this handbook, “AmPAL” 
designates products originally from Advanced Micro Devices. 


Note that all products that Monolithic Memories and Advanced 
Micro Devices were producing before the merger are still in 
production. 


Monolithic Memories Nomenclature 


Monolithic Memories originally set the PAL device naming stan- 
dard. An example of an older device name is shown in Figure 1. 


As devices have become faster, and as CMOS has become 
available, it has become necessary to change the way we 
describe the performance of newer devices. While the conven- 
tion is slightly different, it does make it easier to see exactly what 
the speed grade and power level of a device are. An example of 
such a new device is shown in Figure 2. 


Advanced Micro Devices 
Nomenclature 


The Advanced Micro Devices nomenclature follows the Mono- 
lithic Memories nomenclature with a few modifications. Atypical 
older device name is shown in Figure 3. 


Advanced Micro Devices also found the need to modify the 
nomenciature with the advent of faster devices and CMOS 
devices. An example of a newer device is shown in Figure 4. 


The relationship between a device and its name will become 
apparent in the Tables following. 


The package designators also differ. Across reference for com- 
mercial packages is shown in Table 1. 














MONOLITHIC; ADVANCED 
MEMORIES | MICRO DEVICES 





PACKAGE 


Plastic DIP 
Plastic SKINNYDIP® 
Ceramic DIP 

Ceramic SKINNYDIP 
PLCC-20-pin 

PLCC from 24-pin DIP, 

JEDEC 

PLCC from 24-pin DIP 
PLCC from 28-pin DIP 
LCC 





























Table 1. Package Designators 
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PAL 16 


PAL | 


NUMBER OF INPUTS 


ARCHITECTURE 
Notes: 


ECL devices also have either ‘10H’ or '100’ following ‘PAL’ to 
indicate the ECL family compatibility. 


The number of inputs includes feedback, if feedback exists. 
The architecture code varies. It is usually mnemonic, and the 


meanings of each code will be listed in each of the following 
sections. 


8 A -2 


a POWER LEVEL 


SPEED GRADE 


NUMBER OF OUTPUTS 
(OR FLIP-FLOPS) aaah 


The speed grade indicates the propagation delay. In general, the 
speeds are as follows: 


blank: 35 ns (Note that ~2 and —4 devices are slower) 
A: 25 ns 
B: 15 ns 
D: 10 ns 


The power level can be one of three designators: 
blank: ‘full power’, 180-240 mA 


-2: ‘half power’, 90-105 mA 
—4: ‘quarter power’, 45-55 mA 


Figure 1. Monolithic Memories Nomenclature for Older Products 


PAL C 4 
PAL | 
TECHNOLOGY 


NUMBER OF INPUTS 


ARCHITECTURE 


Notes: 
The technology designator can be one of four codes: 
blank = TTL 
C =CMOS 
10H = 10KH ECL 
100 = 100K ECL 


The number of inputs includes feedback, if feedback exists. 


6 tL 8 Q -25 
L SPEED GRADE 


POWER LEVEL 


NUMBER OF OUTPUTS 
(OR FLIP-FLOPS) mee 


The architecture code varies. It is usually mnemonic, and the 
meanings of each code will be listed in each of the following 
sections. 


The power level can be one of four designators: 


blank: ‘full power’, 180-240 mA 
H: ‘half power’, 90-105 mA 
Q: ‘quarter power’, 45-55 mA 
Z: ‘zero power’, <0.1 mA standby current 


Figure 2. New Monolithic Memories Nomenclature 
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AmPAL 16 L 8 A 


L 
AmPAL tf L POWER LEVEL 


NUMBER OF INPUTS SPEED GRADE 
ARCHITECTURE NUMBER OF OUTPUTS 
402 03 (OR FLIP-FLOPS) 


The speed grade indicates the propagation delay. In general, the 





Notes: speeds are as follows: 
The number of inputs includes feedback, if feedback exists. blank: 35 ns 
; . , . A: 25 ns 
The architecture code varies. It is usually mnemonic, and the B: 15ns 
meanings of each code will be listed in each of the following D: 10 ns 
sections. 
The power level can be one of three designators: 
blank: ‘full power’, 180-240 mA 
L: ‘half power’, 90-105 mA 
Q: ‘quarter power’, 45-55 mA 
Figure 3. Advanced Micro Devices Nomenclature for Older Products 
AmPAL .C| 22 XP 10 <=30 L 
AmPAL ik POWER LEVEL 
TECHNOLOGY 
SPEED GRADE 
NUMBER OF INPUTS 
NUMBER OF OUTPUTS 
402 04 ARCHITECTURE (OR FLIP-FLOPS) 
Notes: The speed grade indicates the propagation delay in nano- 


seconds. 
The technology designator can be one of two codes: 
The power level can be one of three designators: 
blank = TTL 
C = CMOS blank: ‘full power’, 180-240 mA 
L: ‘half power’, 90-105 mA 
The number of inputs includes feedback, if feedback exists. 


The architecture code varies. It is usually mnemonic, and the 
meanings of each code will be listed in each of the following 
sections. 


Figure 4. New Advanced Micro Devices Nomenclature 
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AMD/MMI Programmable Logic Product 
Cross Reference Guide 


Asa merged company, the combined PLD offerings of AMD/MMI 
provide the PLD user the broadest line of PLDs available from one 
supplier. The combined product offering also contains a number 
of devices for which each of the original separate companies had 
equivalent devices. 


This is a guide to the devices that are equivalent and can be 
directly substituted (with exceptions as noted in the comments 
section). 


These are the only products which are direct substitutes for each 
other, or which we will be encouraging customers to use as 
alternate parts. 











TTL and CMOS PAL Devices 


TTL PAL devices are used more widely by far than devices made 
with any other technology. They provide very high speed without 
the design requirements of ECL. CMOS devices provide the 
benefits of reduced power consumption and erasability. Each 
CMOS device is either TTL compatible, or has a TTL-compatible 
version. 


The available devices can be generally placed into four functional 
groups: 


¢ Simple combinatorial 
¢ Simple registered 

* Sequencer 
¢ Asynchronous 






































































AMD 
COMMENTS 
Part Number Speed/Power Part Number | Speed/Power 
AmPAL16R8D 10/180 PAL16R8D 10/180 
AmPAL16R6D 10/180 PAL16R6D 10/180 
AmPAL16R4D 10/180 PAL16R4D 10/180 
AmPAL16L8D 10/180 PAL16L8D 10/180 
AmPAL16R8B 15/180 PAL16R8B 15/180 
AmPAL16R6B 15/180 PAL16R6B 15/180 
AmPAL16R4B 15/180 PAL16R4B 15/180 
AmPAL16L8B 15/180 PAL16L8B 15/180 
AmPAL16R8AL PAL16R8B-2 
AmPAL16R6AL PAL16R6B-2 
AmPAL16R4AL PAL16R4B-2 
AmPAL16L8AL PAL16L8B-2 
















































































































AmPAL16R8A 25/180 PAL16R8A 25/180 AMD—20-ns t,, 
AmPAL16R6A 25/180 PALI6R6A 25/180 AMD—20-ns t,,, 
AmPAL16R4A 25/180 PAL16R4A 25/180 AMD—20-ns t,,, 
AmPAL16L8A 25/155 PAL16L8A 25/180 
AmPAL16R8Q PAL16R8B-4 35/55 
AmPAL16R6Q PAL16R6B-4 35/55 
AmPAL16R4Q PAL16R4B-4 35/55 
AmPAL16L8Q PAL16L8B-4 35/55 
AmPAL16R8L 35/90 PAL16R8A-2 35/90 
AmPAL16R6L 35/90 PAL16R6A-2 35/90 
AmPAL16R4L 35/90 PALI6R4A-2 35/90 
AmPAL16L8L 35/80 PAL16L8A-2 35/90 
AmPAL16R8 35/180 PAL16R8A-2 35/90 Note that the MMI part requires half the power 
AmPAL16R6 35/180 PAL16R6A-2 35/90 (90 mA I,,) of the AMD part (180 mA) 
AmPAL16R4 35/180 PAL16R4A-2 35/90 
AmPAL16L8 35/180 PAL16L8A-2 35/90 
AmPAL18P8L PAL10H8 The 18P8L can replace any of the MMI parts, 
AmPAL18P8L PAL12H6 but the reverse is not necessarily true. 
AmPAL18P8L PAL14H4 
AmPAL18P8L PAL16H2 
AmPAL18P8L PAL10L8 
AmPAL18P8L PAL12L6 
AmPAL18P8L PAL14L4 
AmPAL18P8L PAL16L2 
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Simple Combinatorial Devices 


These PAL devices provide simple combinatorial functions 
quickly and efficiently.. There is a wide selection of devices, 
allowing the user to choose the right device for the given 
application. 


There are five basic characteristics that differentiate these 
devices: 


* The number of inputs 

* The number of outputs 

* The number of product terms per output 
* The speed: propagation delay 

* Power consumption 


Several of the devices have internal feedback, which routes 
output pins back as inputs. This is generally coupled with 
programmable three-state capability, so that the output pins can 
be used as outputs, inputs, or bidirectional pins. 


Most of the devices have fixed output polarity. The AMPAL18P8, 
AmPAL22P10, PAL16P8, PAL20S10, and AmPAL22XP10 all 
have programmable output polarity. 


The PAL20S10 has a feature called “product term steering”, 
which allows product terms tobe allocated between outputs. This 
is described more fully on page 5-103 of the Data Book. 

















PAL10H8 
PAL12H6 
PAL14H4 
PAL16H2 
PAL10L8 
PAL12L6 
PAL14L4 
PALi6L2 
PAL16C1 









PAL16L8D 
AmPAL16L8D 
PAL16L8B 
AmPAL16L8B 
PALC16L8Z-25 
PALC16L8Q-25 
PAL16L8B-2 
AmPAL16L8AL 
PAL16L8A 
AmPAL16L8A 
PAL16L8B-4 
AmPAL16L8Q 
PAL16L8A-2 
AmPAL16L8L 
AmPAL16L8 
PAL16L8A-4 


PRODUCT 
DEVICE NAME INPUTS OUTPUTS TERMS/OUTPUT 


Se ee ee 


The AmPAL22XP10 has built-in exclusive-OR gates on each 
output. 


In addition to the devices that are purely combinatorial, there ara 
also registered devices with flip-flops that can be bypassed to 
give a combinatorial device. Although these devices are not 
listed in Table 2, they can be used as combinatorial devices. The 
best example is the PAL22V10, which is widely used as a com- 
binatorial device. 


Asummary of the architecture codes for these types of devices 
is shown in Table 2. 


The combinatorial devices are listed in Table 3, with their logic, 
speed, and power characteristics. 


[cone] venue |= ExaLE 


Active-HIGH outputs PAL10H8 
Active-LOW outputs PAL16L8 
Programmable output polarity PAL16P8 
PAL16C1 
AmPAL22XP 10 


Complementary outputs 
Exclusive-OR gate, prog. 
polarity 

Product term steering 





PAL20S10 


Table 2. Combinatorial Architectures 


ss heed 
DATA SHEET 
PAGE NO. 







SPEED 
(t,, in ns) 

























Table 3. Simple Combinatorial PAL Devices 
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PRODUCT SPEED lee DATA SHEET 
TERMS/OUTPUT | (t,, Inns) PAGE NO. 


AmPAL18P8B 
AmPAL18P8AL 
AmPAL18P8A 
AmPAL18P8Q 
AmPAL18P8L 


PAL12L10 
PAL14L8 
PAL16L6 
PAL18L4 
PAL20L2 
PAL20C1 











PAL20L8B 
PAL20L8B-2 
PAL20L8A 
PALC20L8Z-35 
PAL20L8A-2 
PALC20L8Z-45 


AmPAL20L10B 

AmPAL20L10-20 

AmPAL20L10AL 
PAL20L10A 


10 


AmPAL22P10B 22* 8 
AmPAL22P10AL 
AmPAL22P10A 


AmPAL22XP 10-20 2/6t 
AmPAL22XP10-30L 
AmPAL22XP 10-30 

AmPAL22XP10-40L 











* Includes feedback t+ Has an exclusive-OR gate 
** Depending on polarity tt Product term steering 


Table 3. Simple Combinatorial PAL Devices (Continued) 
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Simple Registered Devices 


Abasic registered device is generally equivalent to a combinato- 
rialdevice with registers on the outputs for signalsynchronization. 
All such devices have feedback from the flip-flops back into the 
array as inputs, so that they can also be used for very fast small 
state machines. The differentiating characteristics of these de- 
vices are: 


* The number of inputs 

* The number of outputs 

* The number of flip-flops 

* The number of product terms per output 
* The speed: maximum clock frequency 
* Power consumption 


Most of these devices have dedicated clock and output enable 
pins. On devices such as the PAL16R4, which have some 
dedicated combinatorial outputs, only the registered outputs are 
controlled by the enable pin; the combinatorial outputs have 
programmable three-state functions. 


The PAL22V10, AmPAL22RP10 family, and AmPAL22XRP10 
family have programmable three-state outputs. With these fami- 
lies, the clock pin may also be used as a logic input if desired. 


The PAL16X4 has extra bit-pair decoding circuitry which makes 
it especially well suited to arithmetic operations. This is discussed 
more fully on page 2-53 in the combinatorial logic section. 






| altered 6R8D 
AmPAL16R8D 
PAL16R8B 
AmPAL16R8B 
PALC16R8Z-25 
PALC16R8Q-25 
PAL16R8B-2 
AmPALI6R8AL 
PAL16R8A 
AmPAL16R8A 
PAL16R8B-4 
AmPAL16R8Q 
PAL16R8A-2 
AmPAL16R8L 
AmPAL16R8 
PAL16R8A-4 





PAL16R6D 
AmPAL16R6D 
PAL16R6B 
AmPAL16R6B 
PALC16R6Z-25 
PALC16R6Q-25 
PAL16R6B-2 
AmPAL16R6AL 
PAL16R6A 


The simpler devices all have active-LOW outputs. The 
PALI6RP8 family, PAL22Vi0, AmPAL22RP10 family, 
AmPAL22XRP10 family, PAL20RS10 family, and the PAL32R16 
have programmable polarity. 


The PAL20RS10 family and PAL32R16 have product term steer- 
ing, as was discussed above for the PAL20S10. 


The outputs of the PAL32R16 can also be programmed to ba 
combinatorial in banks of eight. In addition, the PAL82R16 has 
multiple clock, enable, and TTL-level preload pins. 


The PAL22V10 has two programmable initialization product 
terms: a synchronous preset term and an asynchronous reset 
term. These terms control all flip-flops. 


A summary of the architecture codes for these types of devices 
is shown in Table 4. The devices are summarized in Table 5. 

















TR ~—sds~RRegistered outputs tis” outputs T  PALIERS 6R8 
Exclusive-OR gates PAL16X4 
Registered with prog. polarity PAL16RP8 
Registered with term steering PAL20RS10 
Versatile AmPAL22V10 





















Table 4. Registered Architectures 















Table 5. Simple Registered PAL Devices 
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SPEED | STANDBY 
PRODUCT (Fy. lee DATA SHEET 
Feeeene, | NAME INPUTS _| OUTPUTS |FLIP-FLOPS|TERMS/OUTPUT | In MHz) | (mA) PAGE NO. 
AmPAL16R6A : 
PAL16R6B-4 . 
AmPAL16R6Q . 
PAL16R6A-2 . 
AmPAL16R6L . 
AmPAL16R6 . 
PAL16R6A-4 . 
PAL16R4D . 
AmPAL16R4D 
PAL16R4B 
AmPAL16R4B 
PALC16R4Z-25 
PALC16R4Q-25 
PAL16R4B-2 
AmPAL16R4AL 
PALI6R4A 
AmPAL16R4A 
PAL16R4B-4 
AmPAL16R4Q 
PALI6R4A-2 
AmPAL16R4L 
AmPAL16R4 
PAL16R4A-4 























PALI6RP8A 16° 
PAL16RP6A 16° 
PALI6RP4A 16° 


PAL20R8B 
PAL20R8B-2 
PAL20R8A 
PALC20R8Z-35 
PAL20R8A-2 
PALC20R8Z-45 


PAL20R6B 
PAL20R6B-2 
PAL20R6A 
PALC20R6Z-35 
PAL20R6A-2 
PALC20R6Z-45 





PAL20R4B 
PAL20R4B-2 
PAL20R4A 
PALC20R4Z-35 
PAL20R4A-2 
PALC20R4Z-45 





Table 5. Simple Registered PAL Devices (Continued) 
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PRODUCT 
DEVICE NAME | INPUTS | | OUTPUTS | | FLIP-FLOPS] FLOPS| TERMS/OUTPUT 


PAL20RS10 
PAL20RS8 
PAL20RS4 


AmPAL22V10-15 
PALC22V10H-25 

AmPAL22V10A 
PALC22V10H-35 

AmPAL22V10 


AmPAL20RP10B 
AmPAL20RP10AL 
AmPAL20RP10A 


AmPAL20RP8B 
AmPAL20RP8AL 
AmPAL20RP8A 


AmPAL20RP6B 
AmPAL20RP6AL 
AmPAL20RP6A 


AmPAL20RP4B 
AmPAL20RP4AL 
AmPAL20RP4A 


PAL32R16 


* Includes feedback 
** With polarity fuse intact 


Product Overview 


20° 
20° 
20° 


22* 


= 


— — par ener 
oO oO Oo (>) 
_ 


=e 
Oo 


= 
oO 





tt Product term steering 
§ Flip-flops can be bypassed 





DATA SHEET 
PAGE NO. 








5-103 
5-103 
5-103 











+ Has an exclusive-OR gate 


§§ Has varied product term distribution 


Table 5. Simple Registered PAL Devices (Continued) 


PAL Devices as Sequencers 


Several registered PAL devices have added features that make 
them particularly well suited to state machine applications. 


The basic differentiating aspects of these devices are: 


The number of inputs 

The number of outputs 

The number of flip-flops 

The type of flip-flop possible (D, T, J-K, S-R, latch, 
and/or buried) 

The number of product terms per output 

The speed: maximum clock frequency 

Power consumption 


The PAL20X10 series, AmPAL20XRP10 series, PAL22RX8, and 
PAL32VX10 all have an exclusive-OR gate that makes it possible 
to design with D, T, J-K, or S-R flip-flops. Since the PAL20X10 has 
fewer product terms, it is best for counter-like applications. The 
AmPAL20XRP10 series, PAL22RX8 and PAL32VX10 have more 
product terms, for much more sophisticated state machines. 


The AmPAL23S8 has six extra flip-flops that are permanently 
buried in addition to eight output flip-flops. The PALS2VX10 and 
AmPALC29M16 also have flip-flops that canbe buried if they are 
not needed at the outputs. An output pin canbe used as an input 
if its flip-flop is buried. All of these devices are ideal for applica- 
tions incorporating internal state machines as part of the overall 
design. 


The PAL22RX8, PAL32VX10, and AmPAL23S8 have global 
preset and reset control, making system initialization much 
easier. 


The AmPALC29M16is an electrically erasable device that has an 
unusually flexible architecture. Almost all pins can be used as 
inputs or outputs; both inputs and outputs can be registered, 
latched, or combinatorial. There are two pins that can be used as 
clocks. Global preset and reset functions are also provided. 
Several of the flip-flops can be buried to provide internal state 
machines. This device is HC and HCT compatible. 


A summary of the architecture codes for these types of devices 
is shown in Table 6. 


The PAL devices best suited to state machine applications are 
listed in Table 7. 
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DEVICE NAME 


PAL20X10A 
PAL20X8A 
PAL20X4A 


AmPAL20XRP10-20 22* 
AmPAL20XRP10-30L 
AmPAL20XRP 10-30 

AmPAL20XRP10-40L 


AmPAL20XRP8-20 22* 
AmPAL20XRP8-30L 
AmPAL20XRP8-30 

AmPAL20XRP8-40L 





AmPAL20XRP6-20 22* 
AmPAL20XRP6-30L 
AmPAL20XRP6-30 

AmPAL20XRP6-40L 





AmPAL20XRP4-20 22" 
AmPAL20XRP4-30L 


AmPAL20XRP4-30 
AmPAL20XRP4-40L 





PAL22RX8A aa 
AmPAL23S8-20 23* 
AmPAL23S8-25 


AmPALC29M16-35 29° 
AmPALC29M16-45 


PAL32VX10A 
PAL32VX10 


* Includes feedback 
t Has an exclusive-OR gate 








FLIP-FLOP PRODUCT 
OUTPUTS | FLIP-FLOPS; TYPES | TERMS/OUTPUT 
D,T,JK,SR 2/2t 
D,TJK,SR 2/2t 
D,TJK,SR 2/2t 
10 - D.WKSR| —-26t 
Ee _ - 
D,T,JK,SR 






Exclusive-OR gates PAL20X10 
Exclusive-OR gates, prog. polarity |; AmPAL20XRP10 
Registered, exclusive-OR gates PAL22RX8 
Sequencer AmPAL23S8 
Advanced macrocell AmPALC29M16 
Varied term distribution, XOR gate | PAL32VX10 


















Table 6. State Machine Architectures 








apa le | DATASHEET 
in MHz) (mA) | PAGE NO. 





eS | 

















Mi 
D,TJK,SR 


D,B,LO B-16§§ 
D,TuK,SR, 1/8-16t 
BO 


§§ Has varied product term distribution 


D,BO 6-128§ 
ey 5 


22.2 180 
22.2 180 
22.2 180 










120 
120 


180 
| | 2 180 


0 Be=flip-flops are or can be buried; L=latched outputs possible 


§ Some flip-flops can be bypassed 


Table 7. State Machine PAL Devices 





5-113 
5-113 
5-113 


5-286 
5-286 
5-286 
5-286 





5-286 
5-286 
5-286 
5-286 


5-286 
5-286 
5-286 


5-286 





200 5-169 
200 5-169 


5-231 
5-231, 


5-70 
5-70 
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Asynchronous PAL Devices 


Several PAL devices have been designed for applications that 
require asynchronous control of flip-flops. The simplest devices 
are the PAL2ORA10 and the PAL16RA8. These devices essen- 
tially consist of several flip-flops driven by a programmable AND 
array and a fixed OR array. Thus the D-type flip-flops have 
individually programmable clock, set, and reset lines. They also 
have programmable polarity, and the flip-flops can be bypassed 
to give combinatorial outputs. 


These devices are also useful for replacing random discrete logic. 
Each flip-flop is independent; the clocks can be gated if desired, 
as can the set and reset lines. This provides for flexibility both 
when replacing random gates and when designing logic that is 
asynchronous in nature. 


The AmPALC29MA16 combines some of the advantages of the 
AmPALC29M16 with the advantages of the PAL20RA10. Mostof 
the AmPALC29M16 macrocell is preserved, except that each 
flip-flop is given a programmable clock, set, and reset line. Each 
output also has individual three-state control. This device is 
electrically erasable, and is HC and HCT compatible. 


A summary of the architecture codes for these types of devices 
is shown in Table 8. The asynchronous devices available are 
summarized in Table 9. 


| paLzorato | 0 
AmPALC29MA16 













Es eataeameynchroneus. | asynchronous 
Asynchronous macrocell 








Table 8. Asynchronous Architectures 


CMOS PAL Devices 


PAL devices have recently become available in CMOS technol- 
ogy, providing performance competitive with that of many bipolar 


DEVICE NAME 





devices, but with reduced power consumption. The functions of 
these davices have been discussed above, along with the TTL 
devices. The particular characteristics of the CMOS versions are 
discussed below. 


Zero-Power vs Low-Power 


Two basic types of CMOS PAL devices are available: those that 
dissipate essentially no power when in a quiescent state, and 
faster devices which draw a nominal amount of current even 
when quiescent. Devices are thus classified as "zero-power” or 
“low-power”. The low-power devices still require far less current 
than their bipolar equivalents. 


The basic difference between zero-power and low-power devices 
is the wake-up circuitry used in the zero-power devices. Any time 
a signal changes, the device wakes up in order to respond to the 
signal. After the transition is complete, the device goes “back to 
sleep”, as long as no other signals are changing. This wake-up 
circuitry exacts a slight speed penalty, usually about 5 ns, but in 
applications where minimal power dissipation is crucial, the 
benefits of zero-power operation greatly outweigh the extradelay. 


The PALC16R8Z family actually has a “turbo product term”. This 
product term allows the device to operate either in zero-power or 
low-power mode. Since this choice has been provided by a 
product term, it is actually possible to have a device idle in zero- 
power mode, and then switch into low-power mode for faster 
operation. This is particularly appropriate for devices that tend to 
operate in bursts, with idle periods between bursts. 


HC vs HCT 


CMOS devices can be characterized as being HC and/or HCT 
compatible. HC-compatible devices have outputs that swing 
“rail-to-rail”, or from ground to VCC. They also have an input 
threshold which is about 2.5 V. These devices are normally used 
when driving other CMOS devices. 


SPEED 





PAL16RA8 


PRODUCT se erat DATA SHEET 
INPUTS OUTPUTS | TERMS/OUTPUT | (t,, in ns) ce (MA) PAGE NO. 





PAL20RA10-20 
PAL20RA10 


AmPALC29MA16-35 
| AmPALC29MA16-45 


* Includes feedback 
** With polarity fuse intact 
tt Has product term steering 





5-95 





Table 9. Asynchronous PAL Devices 
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HCT-compatible devices have TTL-level output voltages; that is, 
a VOH of about 3.5 V and a VOL of about 0.5 V. The input 
threshold is usually adjusted to 1.5 V, whichis the threshold of TTL 
devices. These devices are more suitable for a mixed TTL and 
CMOS system. 


Roughly speaking, HC and HCT devices can be interchanged. 
However, since HCT outputs do not swing rail-to-rail, if they drive 
another CMOS device, that device will draw more power. HCT 
outputs are faster, have more drive, and are more latch-up 
immune. The input threshold is less critical, and is important only 
in very touchy situations. 


Note that some devices are both HC and HCT compatible. This 
means that they have aquick, high drive output risetoa TTLVOH, 
and then a slower rise to VCC, ultimately providing rail-to-rail 
levels. This is illustrated in Table 10. 


INPUT Vor Von 
FAMILY THRESHOLD (V) | (V) (V) 
HC 


HCT 


2.5 Vec 
1.5 3.5 
HC/HCT 1.5 3.5 -> Vec 


Table 10. CMOS/TTL Compatibllity 









PALC16L8Z-25 
PALC16R8Z-25 
PALC16R6Z-25 
PALC16R4Z-25 


PALC16L8Q-25 
PALC16R8Q-25 
PALC16R6Q-25 
PALC16R4Q-25 


PALC20L82Z-35 
PALC20L8Z-45 


PALC20R8Z-35 
PALC20R8Z-45 





PALC20R6Z-35 
PALC20R6Z-45 


PALC20R4Z-35 
PALC20R4Z-45 


AmPALC29M16-35 
AmPALC29M16-45 


AmPALC29MA16-35 
AmPALC29MA16-45 


* Includes feedback 
** With turbo term 


8 35 0 HC/HCT 
8 45 0 HC/HCT 


UV Erasable vs Electrically Erasable 


CMOS PAL devices come in one of two basic technologies: 


« UV erasable 
* Electrically erasable 


UV-erasable devices use EPROM technology in implementing 
the array. They canbe erased by exposure to ultraviolet light if the 
device is packaged in a windowed package. These packages are 
more expensive, so the devices are also available in lower-cost 
plastic packages. UV-erasable devices in plastic packages are 
not erasable, and are thus referred to as “One-Time Program- 
mable”, or OTP devices. 


Electrically erasable devices can be erased by providing the 
appropriate signals to the device. No window is required in the 
package, and the erase time is much faster than the erase time 
of UV-erasable devices. Erasure is implemented by the device 
programmer as a portion of the overall programming algorithm. 


Summary of CMOS PAL Devices 


The PAL devices available in CMOS have been described along 
with their bipolar counterparts in the preceding pages. They are 
summarized in Table 11, along with their power, compatibility, and 
erasability characteristics. 


0.1/45** 
0.1/45** 
0.1/45** 
0.1/45** 


HC/HCT 
HC/HCT 
HC/HCT 
HC/HCT 


HCT 
HCT 


HCT 
HCT 


HC/HCT 

HC/HCT U 
HC/HCT | 
HC/HCT 


U 5-133 

5-133 
U : 
U 
HC/HCT U 
HC/HCT U 
U 
U 


5-133 
5-133 








a 
JA 
90 HCT 
90 HCT 
HC/HCT 
HC/HCT 
HC/HCT 
HC/HCT 


V 
V 
V 
Vv 
EE 
EE 








Table 11. Summary of CMOS PAL Devices 
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ECL PAL Devices 


For a long period of time, PLDs were available only for TTL- 
compatible systems. More recently, however, PAL devices have 
been introduced which address the users of ECL logic. 


Two basic families of ECL logic are addressed by the ECL PAL 
devices. These families differ in their voltage ranges and the 
compensation built into the circuitry (Table 12). These differences 
also affect noise margins and thresholds. Refer to page 3-150 for 
more information on ECL design. 


The ECL PAL devices presently have a 6-ns propagation delay; 
the differences between the devices can be found in the architec- 
tures. The PAL1O0H20P8 and PAL10H20G8 are simpler devices 
which are compatible with other 10KH logic; the former is a 
combinatorial device with programmable polarity, the latter has 
output latches. Both devices have product term steering for 
added flexibility in allocating logic resources. These features are 
described in more detail on page 5-382. 


The PALIOH20EV/EG8 and PAL10020EV/EG8 are 10KH and 
100K compatible options of the same device, respectively. These 
devices have more product terms than the first two devices, and 
have outputs that can be programmed as combinatorial or regis- 
tered/latched. These devices also have global reset and preset 
controls. 


A summary of the architecture codes for these types of devices 
is shown in Table 13. The ECL PAL devices and their basic 
characteristics are summarized in Tables 14 and 15. 


FAMILY COMPENSATION 


10KH Voltage 
100K Voltage, temperature 

















-5.2V + 5% 
-4.5V + 0.3V 





Table 12. 10KH ECL vs 100K ECL 









DEVICE NAME 


PAL10H20P8 
PAL10H20G8 
PAL10H20EV/EG8 








* Includes feedback 
tt Has product term steering 


PRODUCT 
INPUTS oer eae nite chaldesiekce 


§ Flip-flops can be bypassed 
§§ Has varied product term distribution 


Programmable Sequencers 


Many PAL devices can be used to design very fast, efficient 
sequencers. Optima! among these devices are the PAL32VX10 
andthe AmPAL23S8. However, there are additional devices with 
entire architectures dedicated to the task of sequencing. 


Since these devices are primarily intended for use as state 
machines, the important characteristics to be evaluated are: 


¢ The number of inputs 

¢ The number of outputs 

¢ The number of states that can be implemented 

* The number of branches available from any state 
¢ The speed: maximum clock frequency 

* Power consumption 


The programmable sequencers available are shown in Table 16. 


The PMS14R21 (also known as the PROSE™ device) actually 
consists of a PAL device for branching, followed by a PROM to 
store the states. This device also has the Diagnostics-On-Chip™ 
(DOC™) testability circuitry, allowing the device to be inserted 
into a serial scan path. This is described more thoroughly on page 
3-123 of the testability section. 


The PLS105, PLS167, and PLS168 are based on the standard 
PLS architecture, where both the AND array and the OR array are 
programmable. S-R flip-flops are used to minimize the need for 
specifying ‘HOLD’ conditions. There is also a “complement 
array”, which makes it easier to define default state transitions. 


fp | _ Combinatorial, prog. | Combinatorial, prog. polarity | TPALIOH20P8 | OH20P8 
Latched outputs PAL10H20G8 
Latched or combinatorial PAL10H/10020EG8 
Registered or combinatorial | PAL10H/10020EV8 















Table 13. ECL PAL Device Architectures 








ae DATA SHEET 
ro OF asd _| i | PAGE NO. | 
| 6ns—i| soto |S saes 
6ns 225 5-382 
125 MHz 220 5-381 





Table 14. 10KH-Compatibie PAL Devices 





* Includes feedback 
§§ Has varied product term distribution 


PRODUCT DATA SHEET 
DEVICE NAME Baad eUTEuTS: eens TERMS/OUTPUT } EAGE NO: NO. 


§ Flip-flops can be bypassed 












Table 15. 100K-Compatible PAL Devices 
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The PMS14R21, PLS105, PLS167, and PLS168 are all sup- 
ported by PALASM software, using aconvenient entry format that 
has been optimized for state machines. This essentially makes 
it possible to transfer a state diagram directly to a design file 
without the need for manual logictransformations or calculations. 


The Am29PL141 Fuse Programmable Controller (FPC) is an in- 
struction-based sequencer. This is especially convenient for de- 
signers who are more comfortable using instruction-based de- 
vices. The design is implemented more as microprocessor 
program might be written. Indeed, the architecture itself, which is 
described in more detail on page 5-339, resembles that of a 
simple processor. This device has its own assembler which is 
used to reduce the sequencer “program” into a file that can be 
used to program the device. 


The Am2971 is a Progammable Event Generator. It is a versatile 
timing device that can be used as a digital substitute for analog 
delay lines or as a general-purpose user-programmable timing/ 
waveform generator. 


LCA Devices 


Large-scale designs are now possible with the introduction of the 
Logic Cell Array (LCA) devices. These devices can be thought of 









PLS105-37 
PLS167-33 
PLS168-33 









as programmable gate arrays, and provide an efficient, cost- 
effective means of implementing dense designs within a single 
PLD. 


Unlike other PLDs, which are non-volatile, LCA devices make use 
of static RAMtechnology. This makes them easy to develop, easy 
to test and debug, and reprogrammable. 


There is an entire family of support software and hardware 
packages available to aid in the design of LCA applications. 
These are described in more detail on page 3-17. 


The architecture of an LCA device resembles that of a gate array 
more closely than a traditional PLD. It has an array of Configur- 
able Logic Blocks (CLBs) that can be connected via a variety of 
interconnect resources. There are also Input/Output Blocks 
(IOBs) for transferring the signals on and off the chip. 


The main features that distinguish one LCA device from another 
are: 


¢ The number of I/O pins 

¢ The number of Configurable Logic Blocks 

¢ The speed: internal maximum toggle frequency 
¢ Power consumption 


The available devices are summarized in Table 17. 


eae 


DATA SHEET 
In MHz) | PAGE NO. 














200 5-331 
200 5-331 
200 5-331 





* Depends highly on state diagram topology. May be limited by number of product terms or number of flip-flops. 
Table 16. Programmable Sequencers 










DEVICE NAME 


CLBs ore a TOGGLE 






STANDBY 
ee (MA) 


DATA SHEET 
PAGE NO. 










SPEED 


in MHz) 















M2064-70 
M2064-50 
M2064-33 








M2018-50 
M2018-33 





5-483 
5-483 






5-483 
5-483 





Table 17. LCA Devices 
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Introduction 


This section is intended as a beginner's introduction to PLD 
design, although experienced users may find it agood review. We 
will take a step-by-step approach through two very simple de- 
signs to demonstrate the basic PLD design implementation 
process. 


By “beginner”, we mean a logic designer who is just beginning to 
use programmable logic. You may have a lot of experience with 
discrete digital logic, or you may have just graduated from 
college. We assume a basic understanding of digital logic, 
although there is a logic reference on page 6-1 should you 
need a refresher. Some computer experience is helpful, but not 
essential. 


Through this effort, you will be introduced to PALASM software 
and to the concept of device programming. Because of the 
simplicity of the designs, we will be using PAL devices, but the 
implementation concepts will be applicable to all of Monolithic 
Memories’ and Advanced Micro Devices’ PLDs. 


Only enough detail of the software will be presented to allow you 
to implement the examples. This will also give you enough 
information to understand most of the other design examples in 
this handbook. More software details can be found in the actual 
software documentation in section 4. 


To keep the discussion simple, we will make some assumptions 
about the system being used to process your designs. We will 
assume anIBMPC/XT™ orcompatible system with one hard disk 
and one floppy drive, running PC-DOS™ or MS-DOS™ version 
2.1 or higher. The installation procedure for the software will be 
described for this setup. The procedures for other setups and 
machines are not radically different; details can be found in the 
PALASM software documentation, section 4. 


We will also take no significant shortcuts for these examples, 
even though there may be times when we could. In this way, you 
can gain a better understanding of exactly what is happening as 
you implement your design. 


Wewill talk about device programming, describing all ofthe steps 
that are necessary to program a PLD. However, due to the wide 
variety of programmers available, we will not get down to the level 
of detail that tells you exactly which buttons to push. Although we 
will get as close as we can, we must defer the details to your 
programmer manual. 


As we work the design examples, we will, of course, have to 
choose devices into which to put the designs. However, we will 
not dwell here on the parameters that go into choosing a device. 
This is covered more thoroughly in the rest of this section. 


Installing PALASM Software 


Before we actually start implementing a design, we must install 
PALASM software. If you already have the software installed, you 
may disregard this discussion. The procedure described here will 
allow you to install the software package onto an IBM PC/XT or 
compatible with a hard disk and at least one floppy drive. 


After you have turned on the computer and allowed it to boot up, 
you can install the software by placing disk #1 into drive A, and 


typing: 
A:PAL2INST<CR> 


If you are new to the PC, <CR> is the ‘RETURN’ key on the right 
side of the keyboard. 


The program will ask you which disk you wish to install the 
programs on. Enter the disk name, or just hit <CR> if installing 
onto disk C. The installation program creates directories for all of 
the programs. If you presently have an older version of PALASM 
software installed in a similar directory structure, the installation 
program will warn you that the contents of the directories will be 
changed. Otherwise, it will start installing. 


Note that the installation procedure may modify two files which 
are used when the computer boots up: AUTOEXEC.BAT and 
CONFIG.SYS. These changes should not affect any other 
programs you run. 


The installation program will then give you a choice of programs 
to install; hit 


1<CR> 

to install the software. The installation program will ask you to 
insert various disks as it installs the programs. The disks are 
labelled, so this should be a simple procedure. After the software 
has been installed, reboot the system by holding down 
<CTRL><ALT><DEL> 

at the same time. This only needs to be done after installation. In 
the future when you turn the computer on, the computer will boot 
automatically. 

You can now call up the menu by typing 

PALASM<CR> 

Now you need to install the editor or word processing program 


you wish to use. You may use any editor you wish, as long as it 
can generate a clean ASCIl text file. In addition, you need to 
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install a communication program; any one will do. Asimple one 
called PC2 is provided with the PALASM software. You caninstall 
these programs by hitting 


<F6> 


When the setup screen comes up, hit the down-arrow key until it 
gets to the line that says “Editor”. Enter the directory path and 
program name for the editor you wish to use. For example, if you 
use WordStar™, and you have the program located in the 
directory called “WS” on disk C, then enter 


C:\ws\ws 
Add enough spaces to delete whatever was there before. 


This takes you into the next field. Here you need to enter the 
filetype of your editor, which will be either .COM or .EXE. 


Once this is complete, for the WordStar example, the line should 
now look like: 


C:\ws\ws com 


Hit <CR> again to get to the next field. Here you enter the name 
of your communication software. If you wish to use PC2, then 


type 


\PALASM\SUPL\PC2 

(adding spaces to delete what was there before.) 
<CR> 

.EXE 


If you are not using PC2, then replace “PC2” with the name of your 
program. This procedure works regardless of the editor or 
communication software you wish to use. When you are 
finished, hit 


<ESC> 
to get back to the menu. 


You are now ready to start. 


Constructing a Combinatorial Design 
—Basic Gates 


The first example we will try is a very simple combinatorial circuit 
consisting of all of the basic logic gates, as shownin Figure 1. This 
willbe helpful for those designs where you are integrating random 
logic into a PAL device to save space and money. 


Ascanbe seen from the figure, there will be six separate functions 
involving a total of twelve inputs. It is important to bear in mind that 
programmable logic provides a convenient means of implement- 
ing designs. With a real design, some work would be required 
before this point to conceptualize the design, but due to the 
simplicity of these circuits, we are already in a position to start the 
implementation. 
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Cc 
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Figure 1. The Basle Logic Gates 


Building The Equations 


We will start by generating Boolean equations. The first function 
to be generated is an inverter. This is specified according to 
Figure 1 as: 


B=/A 

Here the “equals” sign (‘=’) is used to assign a function to output 
B. The slash (‘/’) is used to indicate negation, since it is impossible 
to put a bar over a letter in an ASCII file. Thus this equation may 
be read: 

B is TRUE if NOT A is TRUE 


The next function is a simple AND gate. As shown in Figure 1, we 
can write 


E = CD 

Here we use the “equals” sign again, but this time we have 
introduced the asterisk (‘*’) to indicate the AND operation. This 
equation may be read: 

E is TRUE if C AND D are TRUE 

The third function is an OR gate, which may be written: 
H=F+G 

The “plus” sign (‘+’) is used to specify the OR operation here. 
Because of the sum-of-products nature of logic as implemented 
in PLDs, it is often easy to place product terms on separate lines, 


which improves the readability. We may rewrite this equation as: 


H=F 
G 


+ tt 


This equation may be read: 


H is TRUE if F OR G is TRUE 
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For the moment, we will assume that we have active-HIGH 
outputs on our device. The functions we have generated so far 
have essentially been active-HIGH functions. At times we wish to 
generate active-LOW functions; the next two functions are 
active-LOW functions that we wish to implement in an 
active-HIGH device. 


When we talk in terms of an active-HIGH or an active-LOW 
device, the real question is whether there is an extrainverter atthe 
output. An active-HIGH device has an AND-OR structure; an 
active-LOW device has an AND-OR-INVERT structure which 
inverts the function at the output (see Figure 2). 


a. AND-OR Structure 


b. AND-OR-INVERT Structure 


403 02 
Figure 2. Active HIGH vs. Active LOW 


NAND and NOR gates could be generated very simply in an 
active-LOW device, because we would just have togenerateAND 
and OR functions, and let the output inverter generate their 
complements. However, given that we wish to implement these 
functions in an active-HIGH device, we must invoke DeMorgan's 
theorem, as shown in Figure 3. 


/(X*Y) = /X + /Y 
/(X + Y) = /X*/Y¥ 
Figure 3. DeMorgan’s Theorem 
PALASM software has the capability of applying DeMorgan’s 
theorem to functions, so we may generate our NAND function by 
writing: 
L = /(I*J*K) 
or, if preferred, 
L = /I 
+ /J 


+ /K 


Likewise the NOR function may be specified as 


O= /(M 
+ N) 

or 

O = /M*/N 


Finally, an exclusive-OR (XOR) gate may be specified either as 
R=P :+: Q 
where :+: represents the XOR operation, or more explicitly as 


R = P*/Q 
+ /P*Q 


We have now specified all of the functions in terms of their 
Boolean equations. The equations are summarized in Figure 4. 


Bea. 7h 
E = C*D 
H=F 
+G 
L=/I 

+ /d 

+ /K 
O = /Mt/N 
R= PHO 

+ /P*Q 


Figure 4. Basic Gates Equations 


Understanding the Logic Diagram 


We will use a PAL12H6 for this function, since it has all of the 
resources needed to implement all of these functions. Aportion 
of the logic diagram for this device is shown in Figure 5. The full 
logic diagram for this device can be found on page 5-62. 


The logic diagram shows all of the logic resources available in a 
particular device. In each device, inputs are provided in true and 
complementversions, as shown in Figure 5. These drive what are 
often called “input lines”, which are the vertical lines in the logic 
diagram. These input lines can then be connected to product 
terms. The name “product term” is really just a fancy name for an 
AND gate. However, PLDs provide very wide gates, which can be 
cumbersome to draw. To save space, the product terms are 
drawn as horizontal lines with asmallAND gate symbolat one end 
to indicate the function being performed. 


So we see that in the PAL12H6, there are twelve inputs and six 
outputs. Four of the outputs, on pins 14 through 17, have two 
product terms connected to the OR gates. The outputs on pins 
13 and 18 have four product terms connected to the.OR gates. 
Thus pins 13 and 18 can be used to implement more complex 
functions than the other four outputs. 


Although you really do not need to be concerned with the actual 
implementation of these functions inside the PAL device, you may 
be curious. Figure 6 shows how the inverter andthe AND gate are 
implemented in the PAL12H6. An ‘X’ indicates a connection. A 
product term that is not used is indicated by an 'X’ in the small 
AND gate. 
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Figure 5. A Portlon of a Logic Diagram 
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Building the Design File 


Once the design has been conceptualized, the design file must be 
generated. This can be done with any editor program. The only 
requirement is that the editor produce a clean ASCII file. Most 
word processing programs have this capability, so you can use 
your favorite program. Any hidden control characters for format- 
ting may cause problems when assembling; thus only “non- 
document” or “clean” modes should be used. The filename is 
usually given the extension '.PDS' (for PAL device Design Speci- 
fication), although this is not required. 


We now know exactly what our functions are going to be. We 
have twelve inputs, six outputs, and the NAND function requires 
three product terms. Note that if we had specified 


L = /(I*JT*K) 
instead of 
L = /I 

+ /J 

+ /K 


for the NAND gate, it would not be as obvious how many product 
terms would be needed. 


We are now in a position to create the design file. First you need 
to enter the PALASM menu. This menu will let you perform all of 
the tasks you need, including editing your design file. Callup the 
menu by typing 


PALASM <CR> 

When prompted, hit any key to bring the menu onto the screen. 
Enter the file name of the design file you wish to create in the field 
called “Input PDS file’. The cursor should be there when the 
menu appears on the screen. Then hit 

<CR> 

This will take you to the filetype field, which should already say 
“,PDS”. We recommend that you use“.PDS" for allof your design 
files. Hit 

<CR> 

agaln. At this time, the menu will say that the file cannot be 
opened; this is just because you have not actually created the file 
yet, so there is noneed to be concerned. Youcan now create your 
design file by hitting 

<F3> 

This will call up your editor, and you can begin building the file. 
We start with what is called the DECLARATION SECTION. This 


section allows you to document your design, and also provides 
some definitions for the assembler. 


The first step is to provide some documentation information. This 
is done with six keywords: 


TITLE (The title of your design goes here) 

PATTERN (The pattern name or number goes here) 
REVISION (The revision number or level goes here) 
AUTHOR (Your name goes here) 

COMPANY (Your company name goes here) 

DATE (The date of creation or modification goes here) 


This is followed by the CHIP declaration, which tells the assem- 
bler which device is being used and what the pin names are going 
to be. We do this by typing: 


CHIP (any name goes here) (the device type goes here) 
(the pin list goes here) 


Thus, we can create the declaration section for our design as 
shown in Figure 7. 


TITLE Basic gates 

PATTERN P0000 

REVISION A 

AUTHOR Stateyour Namehere 
COMPANY Name of your Co., Inc. 
DATE 7/22/87 


CHIP GATES PAL12H6 
ACDEGIJKM GND 
NPBEHROLQ VCC 


Figure 7. The Declaration Section 


Note that the pin list has all pins In order, from pin 1 to pin 20, 
including VCC and GND. You can see that the inputs were 
assigned to pins 1-12 and 19, whereas the outputs have been 
assigned to pins 13-18. Of course these assignments must be 
made knowing which pins are inputs and outputs, as per the logic 
diagram. Knowing that the NAND gate requires three product 
terms, we assigned this function to pin 18, since it can provide up 
to four product terms. 


We can make the pin list easier to read by adding the pin numbers 
as comments. Anything following a semicolon (;) is treated as a 
comment, and Ignored by the processor. This helps make the 
design file more readable. The pin list with comments is shown 
in Figure 8. Notice how much easiar it is to read. 


CHIP GATES PAL12H6 


1 


;PINS 7 8 9 10 
J K M GND 


123 4 5 6 
A Cc DF Git 
;PINS 11 12 13 14 15 16 17 18 19 20 

N P B EH ROL Q vCC 


Figure 8. Pin List with Comments 
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Next we enter the equations. This section begins with the 
keyword 


EQUATIONS 


after which all of the equations are entered, exactly as shown in 
Figure 4. Note that the equations may be defined in any order, not 
necessarily in the order presented in the pin list. 


The fact that this is an active-HIGH device does have some 
bearing on the way the file is built. This is discussed at length on 
page 6-19 and in the software documentation; we will need to 
address the issue again below, when we do an active-LOW 
design. For now, suffice it to say that the design as specified here 
will work correctly for an active-HIGH device. 


Finally, add comments to the equations to document the design. 
Liberal commenting is encouraged to make it easier for you and 
others to understand your design in the future. 


The equation section, with comments, is shown in Figure 9. 


EQUATIONS 
B=/A yinverter 
E = C’*D 7AND gate 
H=F ;OR gate 
+G 
L=/I ;NAND gate after applying 
+ /I ; DeMorgan’s theorem 
+ /K 
O = /M*/N ;NOR gate after applying 
; DeMorgan’s theorem 
R = P/Q 7;XOR gate, expanded 
+ /P*Q 


Figure 9. The Equation Section 


When you have completed the file, save the file and get out of the 
editor. 


Generating a JEDEC File 


Once the design file has been entered, you can assemble the 
design to get aJEDEC file. We have two purposes here: to make 
sure there are no basic mistakes in the file, and to generate a 
JEDEC file for programming. 


To start processing, hit 


<F5> 


This gives you a choice of processing options on the right of the 
screen. To run all of the processing steps automatically, hit 


6 


The first program will now run, as evidenced by the file scrolling 
up the bottom of the screen. If there are any errors, they will 
appear on the screen and in an error file. These errors will be 
syntactical in nature; typos, misspelled words, missing declara- 
tions, malformed expressions. 


Once the first program has been run with no errors, the equations 
are minimized. It is a good practice to run the expander and 
minimizer for all designs. Minimization may allow you to fit a 
design into a smaller device. It also improves the testability of 
your design. More complicated designs may have to be ex- 
panded and minimized. 


After the minimization has been performed, the JEDEC file will be 
generated. If there are any reported errors, it will likely be 
because you requested some function that the part you chose 
could not provide. Some examples are: 

* Active-HIGH equations in an active-LOW device 

¢ Too many product terms 

* Registered equations in a device that has no registers 
Since you have used the “autorun” feature, the simulation pro- 
gram will automatically be run. Since we have not specified the 
simulation yet, this will generate an error. Ignore the error 
message for now. 
When processing is complete, hit 
<ESC> 
to get back to the main menu. 


If there were any errors during processing, you can see what they 
were by viewing the run-time log. This is done by hitting 


<F7> 

to get a choice of things to view. To view the run-time log, hit 
1 

You can scroll down until you find any errors and make a note of 
what needs correcting. Once you have found the error(s), youcan 
get back to the menu by hitting 


<ESC><ESC> 


Any corrections should be made by editing the design file. After 
making any changes, the design needs to reprocessed. 
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The JEDEC file will have the same file name as your design file, 
except that it will have the extension ‘JED’. The program also 
creates an ‘xplot’ file (with file extension '.XPT’) which relates the 
final design implementation to the logic diagram of the PLD being 
used. The xplot file is usually not needed, but it confirms the 
physical implementation of the gates. Figure 10 shows a portion 
of the xplot for this design; the inverter and the AND gate are 
shown. You can see how this relates to the logic diagram by 
referring back to Figure 6. 


You may view the complete xplot file by hitting 
<F7><PgDn><1> 

To return to the main menu after inspecting the xplot file, hit 
<ESC> 


40 X--- X--- -- 00 --O00 ~-00 --00 ---- ---- 
41 XXXX XXXX XXOO XXOO XXOO XXOO XXXX XXXX 
42 0000 0000 0000 0000 0000 0000 0000 0000 
43 0000 0000 0000 0000 0000 0000 0000 0000 
44 0000 0000 0000 0000 0000 0000 0000 0000 
45 0000 0000 0000 0000 0000 0000 0000 0000 
46 0000 0000 0000 0000 0000 0000 0000 0000 
47 0000 0000 0000 0000 0000 0000 0000 0000 ° 


48 i-=-X A= —== 00 --00 --00 --00 ---- ---- 
49 XXXX XXXX XXOO XXOO XXOO XXOO XXXX XXXX 
SOQ XXXX XXXX XXOO XXOO XXOO XXOO XXXX XXXX 
51 XXXX XXXX XXOO XXOO XXOO XXOO XXXX XXXX 
52 0000 0000 0000 0000 0000 0000 0000 0000 
53 0000 0000 0000 0000 0000 0000 0000 0000 
54 0000 0000 0000 0000 0000 0000 0000 0000 
55 0000 0000 0000 0000 0000 0000 0000 0000 


Figure 10. A Portion of an Xplot 


Simulating the Gates 


After you have verified that your design file is correct, it is time to 
verify that the design itself is correct. This is done by simulating 
the design. Simulation provides a way for you to see whether your 
design is working as you expect it to. You provide a series of 
commands, or events, which are then simulated by the software. 
if requested, the software can tell you if the simulation matches 
what you expect, and, if not, where the problems are. 


The simulation section is the last part of the design file. It is not 
required, but is invariably helpful both in debugging the design, 
and in generating what can eventually be used as a portion of a 
test vector sequence. It is introduced by the keyword: 


SIMULATION 


The events to be simulated are specified by placing logic levels 
on pins of interest. Any pins not specifically mentioned will either 


maintain their present level, or, if no level was aver defined, 
remain undefined. The SETF command is used to set levels. 


We can start by simulating the inverter. To prove that the inverter 
works as expected, first we wish to set input A HIGH to verify that 
the output goes LOW. This is done with the statement: 


SETF A 


Since all we did here was set A HIGH, all other inputs are still 
undefined. However, this is enough to determine the output for 
signal B. 


The simulator will calculate the value for B and put it in the output 
file (to be discussed later), but it is helpful to write what you expect 
B to be. If the simulator calculates a result that is different from 
what you expect, it will alert you, and place a marker in the output 
file to tell you where the problem is. You can do this with the 
CHECK statement. Since A is HIGH here, we expect B to be 
LOW. To verify that B is indeed LOW, use the statement: 


CHECK /B 


Note that CHECK statements are not required for simulation. 
However, without them, you must examine the simulation output 
to see if the design functioned correctly. 


We also need to verify that when A goes LOW, B goes HIGH, as 
expected. This can be done with the pair of statements: 


SETF /A 
CHECK B 


We have now fully exercised the inverter. However, the standard 
output file forthe simulator, called the history file, tracks all signals 
cn the device, in increasing pin order. From Figure 8, this would 
be in the order 


ACDFGIJKMNPBEHROLQ 


Instead, we may wish to place related signals near each other. 
We can do this with the TRACE_ON and TRACE_OFF com- 
mands. These commands cause a second output file to be 
generated, which will only show signals and events as deter- 
mined by the TRACE_ON command. 


In our case, we would like to have the output of the gate appear 
right after its inputs, so that the workings of the gate can be easily 
examined. This can be done with the command: 


TRACE ONABCDEFGHIdJKLMNOPQR 


At the end of the simulation, the TRACE_OFF command is used 
to end the special trace. 


Note that both a history file and a trace file will now be generated. 
The history file will have the simulation results in increasing pin 
order; the trace file will have the signals in the order requested 
with the TRACE_ON command. 
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Now we can exercise all of the other gates in the circuit. The full 
simulation section is shown in Figure 11. 

SIMULATION 

TRACE ONABCDEFGHIJKLMNOPQR 


slook at the inverter 


SETF A sset A HI 
CHECK /B ;verify that B is LO 
SETF /A gset A LO 
CHECK B sverify that B is HI 


send of inverter trace 


;look at AND gate 


SETF /c /D sset C, D LO 

CHECK /E jverify E LO 

SETF D 7C stays LO, D goes HI 
CHECK /E 7E should stay LO 

SETF Cc /D sset C HI, D LO 

CHECK /E 7E should stay LO 

SETF D 3C stays HI, D goes HI 
CHECK E 3E should now be HI 


send of AND gate trace 


slook at OR gate 


SETF /F /G sset F, G LO 

CHECK /H zverify H LO 

SETF G 7F stays LO, G goes HI 
CHECK H 7H should go HI 

SETF F 3G stays HI, F goes HI 
CHECK H 7H should stay HI 

SETF /G ;F stays HI, G goes LO 
CHECK H 7H should stay HI 


zend of OR gate trace 
zlook at NAND gate 


SETF /I /J /K ;IJK = 000 

CHECK L ;verify L HI 

SETF K ;IJK = 001 

CHECK L ;L should still be HI 
SETF J /K ;IJK = 010 

CHECK L 7L should still be HI 
SETF K ;IJK = 011 

CHECK L 7;L should still be HI 
SETF I /J /K ;IJK = 100 

CHECK L ;L should still be HI 
SETF K ;IJK = 101 

CHECK L 7L should still be HI 
SETF J /K ;IJK = 110 

CHECK L 7L should still be HI 
SETF K ;IJK = 111 

CHECK /L 7L should go LO 


send of NAND gate trace 
jlook at NOR gate 


SETF /M /N zset M, N LO 

CHECK O sverify O HI 

SETF N 7M stays LO, N goes HI 
CHECK /O 370 should go LO 


SETF M 3M, N now both HI 


CHECK /O 70 should stay LO 
SETF /N 7M stays HI, N goes LO 
CHECK /O 70 should stay LO 


zend of NOR gate trace 


jlook at XOR gate 


SETF /P /Q set P, Q LO 

CHECK /R iverify R LO 

SETF Q 7P stays LO, Q goes HI 
CHECK R 7R should go HI 

SETF P /Q snow P HI, Q LO 

CHECK R 7R should stay HI 

SETF Q sboth inputs HI 

CHECK /R 7R should go LO 
TRACE_OFF zend of trace 


Figure 11. The Simulation Section 


This completes the simulation. The entire design file is shown in 
Figure 21 at the end of this section. You can now run the 
simulation to make sure that your design is correct. 


Running the Simulator 


After adding the simulation, the file needs to be reprocessed. Do 
this again by hitting 


<F5> 

and then hitting 

6 

This completely reprocesses the file and runs the simulation. 


Each of the events specified in the simulation section of the 
design file is executed. The resulting outputs are calculated, and, 
if CHECK statements are used, the calculated outputs are com- 
pared to the expected outputs. If the two values do not match, 
then you are expecting an output that the circuit is not generating. 
This means either that the expected values are wrong, or that 
there is a problem with the circuit itself. In either case, the design 
file must be modified to fix the problem, and then the file should 
be completely reprocessed. 


The main output of the simulator is the history file. In addition, a 
trace file willbe generated if the TRACE_ON statement was used, 
as inthe example above. These files will have the same name as 
your original design, except that the extension for the history file 
is ‘HST’ and that for the trace file is ‘TRF’. 

You may look at the history and trace files with your editor, or print 
them outforcloser analysis. Note that the history file contains the 
results of every signal and every event in increasing pin order, 
whereas the trace file contains signals in the order specifically re- 
quested by the TRACE_ON commands. 


It is usually easier to look at the simulation output as waveforms. 
To generate the waveforms, hit 


<F7> 

To see the waveforms for the trace file, hit 

6 

Once the waveforms are on the screen, you may scroll around 
using the arrow keys. The vertical bar cursor will help you line up 
events. 

After you are through examining the file, hit 

<ESC> 

to get out of the waveform generator, and then, 


<ESC> 


to get back to the main menu. 
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This makes it much easier to debug your design. You can look at 
the entire simulation in the history file, or break it up into manage- 
able bits in the trace file. 


The simulator also converts the simulation results into test 
vectors, and appends the vectors to the JEDEC file. This gives 
you two JEDEC files; one with vectors and one without. The new 
file has the same name as your original design file, but with the 
extension ‘JDC’. This file can be used with programmers that 
provide functional tests. 


Constructing a Registered Design— 
Basic Flip-Flops 


Next we will do a very simple registered design: we will be 
designing all of the basic flip-flop types (Figure 12). We will 
conceptualize the design by reviewing briefly the behavior of the 
D-type flip-flop. We will then present the results for T, J-K, and S- 
R flip-flops. More detail on these flip-flop types can be found in 
the logic reference, on page 6-8. 


Remember that the devices we willbe using only have D-type flip- 
flops. Thus we will be emulating the other flip-flops with D-type 
flip-flops. 


CLK 

D DT 
DC 

T TT 
TC 

J JKT 

K JKC 

S SRT 

R SRC 
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Figure 12. Basic Flip-Flops 


Building the D-Type Flip-Flop 
Equations 


AD-type flip-flop merely presents the input data at the output after 
being clocked. Its basic transfer function can be expressed as 


DT := D 


where we have used pins DT ("D True”) and D as shown in 
Figure 12. 


Note the use of ‘:=’ here instead of '=’. This indicates that the 
output is registered for this equation. The difference is illustrated 


in Figure 13. 
i—+ : 
b. DT=D 
D DT 
CLK | 
403 07 a. DT:=D 


Figure 13. Registered vs. Combinatorial Equations 


We can also generate the complement signal (named DC) with 
the statement 


DC := /D 


As per Figure 12, we want to add synchronous preset and clear 
functions to the flip-flops. This can be done with two input pins, 
called PR and CLR. To add these functions to the true flip-flop 
signal, we add /CLR to every product term and add one product 
term consisting only of PR. Likewise, for the complement 
functions, we add /PR to every product term, and add one product 
term consisting only of CLR. This is shown in Figure 14. 


DT := D*/CLR 
+ PR 


DC := /D*/PR 
+ CLR 


Figure 14. D-Type Flip-Flops with Clear and Preset 
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Inthis way, when clearing the flip-flops, the active-HIGH flip-flops 
have no product terms true, and go LOW; the active-LOW flip- 
flops have the last product term true, and will therefore go HIGH. 
The reverse will occur for the preset function. 


There is still one hole in this design: what happens if we preset 
and clear atthe same time? As it Is right now, both outputs will go 
HIGH. This makes no sense since one signal is supposed to be 
the inverse of the other. To rectify this, we can give the clear 
function priority over the preset function. We can do this by 
placing /CLR on every product term for the true flip-flop signal. 
The results are shown in Figure 15. 


EQUATIONS 


DT := D*/CLR 
+ PR*/CLR 


DC := /D*/PR 
+ CLR 


Figure 15. D-Type Flip-Flops with Clear Priority 
Building the Remaining Equations 


The same basic procedure can be applied to all of the other flip- 
flops. The equations are shown in Figure 16. 


semulating all flip-flops with D-type flip-flops 


DT := D*/CLR 
+ PR*/CLR 


DC := /D*/PR 


soutput is D if not clear 
yor 1 if preset and not clear at the same time 


youtput is /D if not preset 


+ CLR yor 1 if clear 


TT se T*/TT*/CLR 
+ /T*TT*/CLR 
+ PR*/CLR 
TC := T*/TC*/PR 
+ /T*TC*/PR 
+ 


790 HI if toggle and not clear 
;stay HI if not toggle and not clear 
;go HI if preset and not clear at the same time 


790 HI if toggle and not preset 
;stay HI if not toggle and not preset 


CLR 7go HI if clearing 


JKT 3:= J*/JKT*/CLR 
+ /K*IKT*/CLR 
+ PR*/CLR 

JKC := /J3*/JKC*/PR 


+ K*/JKC*/PR 
+ 


7g0 HI if J and not clear 
;stay HI if not K and not clear 
7go HI if preset and not clear at the same time 


7g0 HI if not J and not preset 
;stay HI if K and not preset 


CLR 7go HI if clear 


SRT := S*/CLR 
+ /R*SRT*/CLR 
+ PR*/CLR 
SRC := R*/PR 


+ /S*SRC*/PR 
+ 


7go HI if set and not clear 
7stay HI if not reset and not clear 
;go HI if preset and not clear at the same time 


;go HI if reset and not preset 
;stay HI if not set and not preset 


CLR 7go HI if clear 


Figure 16. Flip-Flop Equations Section 
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CLK 


CLR 


PR 


‘= T° [ArT] */CLR 
+/T° [TT] */CLR 
+PR/CLR 





Figure 17. Feedback In the Equation for TT 


Notice that in some of the equations above, the output signal itself 
shows up in the equations. This is the way in which feedback from 
the flip-flop can be used to determine the next state of the flip-flop. 
An equivalent logic drawing of the TT equation is shown in 
Figure 17. 


Completing the Design File 


We are now in a position to select a device and complete the 
device file. The device used will be a PAL16R8, which is actually 
an active-LOW device. The logic diagram for this device can be 
found on page 5-47. The implications of an active-LOW device 
are discussed more fully on page 6-19. For now, suffice it to say 
that one way of implementing active-LOW logic is to declare 
outputs in the pin list with a slash (‘7) in front of them. This 
essentially defines them as ‘negative’ or ‘inverted’ pins. 


The declaration section is built up just like the last one, and is 
shown in Figure 18. 


TITLE Basic flip-flops 

PATTERN P0001 

REVISION A 

AUTHOR Stateyour Namehere 

COMPANY Nameofyour Co., Inc. 

DATE 7/22/87 

CHIP Flip-flops PAL16R8 

sPINS 1 23 4 5 #6 7 8 9 10 
CLK J K T PR CLR D S R_ GND 

7PINS 11 12 13 14 15 
OE /SRC /SRT /DC /DT 


7;PINS 16 17 18 19 20 
/TC /TT /JIKC /JKT VCC 


Figure 18. Basic Flip-Flops Declaration Section 


Notice that this device has aclock and an output enable pin, called 
CLK and OE, respectively. The input and output pins are named 
as shown in Figure 12. The output pins are all defined with 
slashes, to indicate that they are inverted pins. 


The design can now be processed by hitting 
<F5> 

and then hitting 

6 


This will tell you whether there are any basic problems with your 
design. You can correct any mistakes by editing the file and then 
reprocessing. 


Simulating the Flip-Flops 


After processing the design and correcting any mistakes, we can 
write the simulation. We have all of the simulation instructions we 
need, except for one new instruction which simplifies manipula- 
tion of the clock signal. It is possible to use SETF instructions with 
the clock pin, but then each clock transition would require two 
instructions: one to set the clock HIGH, and one to bring it 
back LOW. 


Instead, we can use the CLOCKF instruction. This pulses a clock 
pin in one instruction. Of course, registered outputs will not 
change state until after the rising edge of the clock signal. Since 
we have named the clock pin CLK, we can clock this device with 
the instruction 


CLOCKF CLK 


We can simulate the true D-type flip-flop as follows: 


SETF D ;set the D input HI 

CLOCKF CLK ;clock the device 

CHECK DT ;verify that the output went HI 
SETF /D ;set the D input LO 

CLOCKF CLK ;clock the device 

CHECK /DT ;verify that the output went LO 


Before we actually simulate any registered design like this, two 
items must be initialized: the clock and the output enable pin (OE). 
Since the CLOCKF statement executes a HIGH-LOW pulse on 
the clock pin, we must first make sure that the clock is set LOW 
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to begin with. We must also enable the outputs, which is done by SETF CLR /PR jset the clear pin 
setting the OE pin LOW. These tasks can be accomplished with CLOCKF CLK jelear the circuit 


the statement: 


SETF /CLK /OE 


jmake sure the clear function worked 
CHECK /DT DC /TT TC /JKT JKC /SRT SRC 
SETF /CLR jremove the clear signal 


We also need to initialize the flip-flops. At the same time we can 
verify that the CLR function is working correctly: We are now in a position to simulate the entire circuit. Athorough 


simulation is shown in Figure 19. 


SIMULATION 

TRACE_ON CLR PR D DT DC T TT TC JK JKT JKC SR SRT SRC 
jinitialize the circuit 

SETF CLR /PR j;set the clear pin 
CLOCKF CLK jclear the circuit 


;make sure the clear function worked 
CHECK /DT DC /TT TC /JKT JKC /SRT SRC 


SETF /CLR ;xemove the clear signal 
;check out the preset function 

SETF PR ;set the preset pin 
CLOCKF CLK ;preset the circuit 

CHECK DT /DC TT /TC JKT /JKC SRT /SRT 

SETF /PR ‘ remove the preset signal 
verify that clear has priority 

SETF PR CLR ;set both clear and preset 
CLOCKF CLK 

CHECK /DT DC /TT TC /JKT JKC /SRT SRC 

SETF /PR /CLR j;remove both preset, clear 


;disable all flip-flop inputs for now 
SETF /D /T /J /K /S /R 


;check out D-type flip-flops 


SETF D ;set the D input HI from LO 

CLOCKF CLK ;clock the device 

CHECK DT /DC ;verify that the true output went HI 
jand the complement went LO 

SETF D shold a HI 

CLOCKE CLK 

CHECK DT /DC verify that state maintained 

SETF /D ;set the D input LO from HI 

CLOCKF CLK ;clock the device 

CHECK /DT DC jverify that the true output went LO 
jand complement went HI 

SETF /D shold a LO 

CLOCKF CLK 

CHECK /DT DC jverify that state maintained 


;check out T-type flip-flops 


SETF /T shold a LO 

CLOCKF CLK 

CHECK /TT TC true output should still be LO 
;complement output HI 

SETF T jtoggle froma LO 

CLOCKF CLK 

CHECK TT /TC jboth outputs should have changed state 


Figure 19. Filp-Flop Simulation Section 
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SETF /T 
CLOCKF CLK 
CHECK TT /TC 


SETF T 
CLOCKF CLK 
CHECK /TT TC 


seheck out J-K 
SETF /J /K 
CLOCKF CLK 
CHECK /JKT JKC 


SETF J /K 
CLOCKF CLK 
CHECK JKT /JKC 


SETF /J /K 
CLOCKF CLK 
CHECK JKT /JKC 


SETF /J K 
CLOCKF CLK 
CHECK /JKT JKC 


CLOCKF CLK 
CHECK /JKT JKC 


SETF J K 
CLOCKF CLK 
CHECK JKT /JKC 


SETF J /K 
CLOCKF CLK 
CHECK JKT /JKC 


SETF J K 
CLOCKF CLK 
CHECK /JKT JKC 


;check out S-R 
SETF /S /R 
CLOCKF CLK 
CHECK /SRT SRC 


SETF S /R 
CLOCKF CLK 
CHECK SRT /SRC 


SETF /S /R 
CLOCKF CLK 
CHECK SRT /SRC 


SETF S /R 
CLOCKF CLK 
CHECK SRT /SRC 


SETF /S R 
CLOCKF CLK 
CHECK /SRT SRC 


CLOCKF CLK 
CHECK /SRT SRC 


TRACE_OFF 


shold a HI 

;both outputs should have held their state 

;toggle froma HI 

;both outputs should have changed state 
flip-flops 

shold a LO 

;both outputs should have held their state 

;set a HI 

youtputs should have changed 

shold a HI 

sboth outputs should have held their state 

;reset a LO 

youtputs should have changed 


;reset an output that is already LO 
;make sure that the outputs didn’t change 


;toggle from a LO 
;verify that outputs changed 
7set an output that is already HI 
;make sure the outputs didn’t change 
;toggle from a HI 
;verify that outputs changed 
flip-flops 
shold a LO 
sboth outputs should have held their state 
7set a HI 
;outputs should have changed 
zhold a HI 
;both outputs should have held their state 
;set an output that is already HI 
;make sure the outputs didn’t change 
7reset a LO 
;outputs should have changed 


j;reset an output that is already LO 
;make sure that the outputs didn’t change 


Figure 19. Flip-Flop Simulation Section (Cont'd.) 
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The file can now be simulated in the same manner as the basic 
gates design. The simulation results can be viewed either by 
examining the history or trace file, or by generating waveforms. 
The complete design file is shown in Figure 22 at the end of this 
section. 


Programming a Device 


After simulating the design, and verifying that it works, itis time to 
program adevice. There are several steps to programming, but 
the exact operation of the programmer naturally depends on the 
type of programmer being used. We willbe as explicit as we can 
here, but you will need to refer to your programmer manual for the 
specifics. 


The first thing that must be done after turning the programmer on 
is to select the device type. This tells the programmer what kind 
of programming data to expect. The device type is usually 
selected either from a menu or by entering a device code. Your 
programmer manual will have the details. 


Next a JEDEC file must be downloaded. To transfer the JEDEC 
file from the computer to your programmer, you will need to 
provide a connection, as shown in Figure 20, This is normally 
done with an RS-232 cable connected between the 
programmer's port and a serial port on the computer (usually 
COM1). 
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Figure 20. A Connector Must Be Provided Between the 
Computer and the Programmer 


If your programmer can perform functional tests, and you wish for 
those tests to be performed, you should download the '.JDC’ file; 
otherwise you should download the '.JED’ file. 


To download data, the programmer must first be set up to receive 
data. The programmer manual will tell you how to do this. The 
data can then be sent from the computer by hitting 


<F4> 


This sets up communication between the computer and the 
programmer. Whichever communication program is installed will 
be invoked. This is used to transmit the JEDEC file to the 
programmer. The details below assume you are using PC2; if not, 
follow the instructions for your program to accomplish the same 
steps. 


Before actually sending the data, you must verify the correct com- 
munication protocol. Check to make sure you know what protocol 
the programmer is expecting; then hit the <F2> key on the 
computer. This allows you to set up the baud rate, data bits, stop 
bits, and parity. 


Once the protocol has been set up, hit the <F1> key. You will be 
asked for the file name; enter the name of the JEDEC file you wish 
to use. The computer will then announce that it is sending the 
data, andtell you whenitis finished. Note that just because it says 
it has finished sending data does not mean that the data was 
received. Your programmer will indicate whether or not data was 
received correctly. 


Once the data has been received, the programmer is ready to 
program adevice. Place a device in the appropriate socket, and 
follow the instructions for your programmer to program the 
device. This procedure programs and verifies the connections in 
the device, and, if a '.JDC’ file was used, will perform a functional 
test. 


The programmer will announce when the programming proce- 
dure has beencompleted. You may then take the device and plug 
it into your application. 


If you have actually programmed one of the examples that we 
created above, you naturally don’t have a board into which you 
can plug the device. If you do have a lab setup, you may wish to 
play with the devices to verify for yourself that the devices perform 
just as you expected them to. 


You will find much more detail on many issues that were not 
discussed in this section in the remaining sections of this hand- 
book. This section should have provided you with the basic 
knowledge you need to understand the remaining design ex- 
amples in this book, and to start your own designs. 
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TITLE Basic gates 

PATTERN P0000 

REVISION A 

AUTHOR Stateyour Namehere 
COMPANY Nameofyour Co., Inc. 
DATE 7/22/87 


CHIP GATES PAL12H6 


*PINS 1 23 4 5 6 7 8 9 10 
A C DF GI JS K M GND 


*PINS 11 12 13 14 15 16 17 18 19 20 
N P B EH ROL Q Vee 





EQUATIONS 
B= /A ;inverter 
E = C*D 7AND gate 
H=F 7OR gate 
+G 
L=/I ;NAND gate after applying DeMorgan’s theorem 
+ /3 
+ /K 
O = /M*/N 7NOR gate after applying DeMorgan’s theorem 
R = P*/Q 7XOR gate, expanded 
+ /B*Q 
SIMULATION 


TRACE ONABCDEFGHIJKLMNOPQR 


jlook at the inverter 


SETF A ;set A HI 
CHECK /B verify that B is LO 
SETF /A sset A LO 
CHECK B ;verify that B is HI 


send of inverter trace 


;look at AND gate 


SETF /C /D jset C, D LO 

CHECK /E j;verify E LO 

SETF D 7C stays LO, D goes HI 
CHECK /E 7E should stay LO 
SETF c /D ;set C HI, DIO 

CHECK /E 7E should stay LO 
SETF D 7C stays HI, D goes HI 
CHECK E ;E should now be HI 


zend of AND gate trace 


jlook at OR gate 


SETF /F /G jset F, GLO 

CHECK /H ;verify H LO 

SETF G 7F stays LO, G goes HI 
CHECK H 7H should go HI 

SETF F 7G stays HI, F goes HI 
CHECK H 7H should stay HI 

SETF /G , ;F stays HI, G goes LO 
CHECK H 7H should stay HI 


;end of OR gate trace 


Figure 21. Complete Basle Gates Design File 
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zlook at NAND gate 
SETF /I /J /K 
CHECK L 

SETF K 

CHECK L 

SETF J /K 
CHECK L 

SETF K 

CHECK L 

SETF I /J /K 
CHECK L 

SETF K 

CHECK L 

SETF J /K 
CHECK L 

SETF K 

CHECK /L 


;look at NOR gate 
SETF /M /N 

CHECK O 

SETF N 

CHECK /O 

SETF M 

CHECK /O 

SETF /N 

CHECK /O 


;look at XOR gate 
SETF /P /Q 

CHECK /R 

SETF Q 

CHECK R 

SETF P /Q 

CHECK R 

SETF Q 

CHECK /R 


TRACE_OFF 


;IJK = 000 

;verify L HI 

;IJK = 001 

;L should still be HI 
;IJK = 010 

;L should still be HI 
;IJK = 011 

;L should still be HI 
;IJK = 100 

;L should still be HI 
;IJK = 101 

;L should still be HI 
;IJK = 110 

7L should still be HI 
;IJK = 111 

7L should go LO 

zend of NAND gate trace 


7set M, N LO 

;verify O HI 

7M stays LO, N goes HI 
70 should go LO 

7M, N now both HI 

30 should stay LO 

7M stays HI, N goes LO 
70 should stay LO 

;end of NOR gate trace 


;set P, Q LO 

;verify R LO 

7P stays LO, Q goes HI 
7R should go HI 

jynow P HI, Q LO 

7R should stay HI 
zboth inputs HI 

7R should go LO 


;end of trace 


Figure 21. Complete Basic Gates Design File (Cont'd.) 
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TITLE Basic flip-flops 
PATTERN P0001 

REVISION A 

AUTHOR Stateyour Namehere 
COMPANY Nameofyour Co., Inc. 
DATE 7/22/87 


CHIP Flip-flo PAL16R8 
*PINS 1 23 4 °5 6 7 8 9 10 
CLIK J K T PR CIR D S R_ GND 
7;PINS 11 12 13 14 15 

OE /SRC /SRT /DC /DT 


7PINS 16 17 18 19 20 
/TC /TT /JKC /JKT Vcc 


EQUATIONS 


yemulating all flip-flops with D-type flip-flops 





DT := D*/CLR youtput is D if not clear 
+ PR*/CLR ;or 1 if preset and not clear at the same time 
DC := /D*/PR soutput is /D if not preset 
+ CLR yor 1 if clear 
TT := T*/TT*/CLR 7g0 HI if toggle and not clear 
+ /T*TIT*/CLR ;stay HI if not toggle and not clear 
+ PR*/CLR 7g0 HI if preset and not clear at the same time 
TC := T*/TC*/PR 3go HI if toggle and not preset 
+ /TXTC*/PR ;stay HI if not toggle and not preset 
+ CLR 7g0 HI if clearing 
JKT := J*/JKT*/CLR 7;go HI if J and not clear 
+ /K*IKT*/CLR ;stay HI if not K and not clear 
+ PR*/CLR 7;g0 HI if preset and not clear at the same time 
JKC := /J3*/JKC*/PR ;go HI if not J and not preset 
+ K*IKC*/PR ;stay HI if K and not preset 
+ CLR 7;g0 HI if clear 
SRT := S*/CLR ;go HI if set and not clear 
+ /R*SRT*/CLR ;stay HI if not reset and not clear 
+ PR*/CLR ;go HI if preset and not clear at the same time 
SRC := R*/PR 7go HI if reset and not preset 
+ /S*SRC*/PR ;stay HI if not set and not preset 
+ CLR 7go HI if clear 


Figure 22. Complete Basic Flip-Flops Design File 
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TRACE_ON CLR PR D DT DC T TT TC JK JKT JKC SR SRT SRC 


pinitialize the circuit 


SETF CLR /PR 
CLOCKE CLK 


;set the clear pin 
;clear the circuit 


jmake sure the clear function worked 
CHECK /DT DC /TT TC /JKT JKC /SRT SRC 


SETF /CLR 


jxemove the clear signal 


;check out the preset function 


SETF PR 
CLOCKE CLK 


;set the preset pin 
;preset the circuit 


CHECK DT /DC TT /TC JKT /JKC SRT /SRT 


SETF /PR 


;xremove the preset signal 


;verify that clear has priority 


SETF PR CLR 
CLOCKEF CLK 


;set both clear and preset 


CHECK /DT DC /TT TC /JKT JKC /SRT SRC 


SETF /PR /CLR 


;xremove both preset, clear 


;disable all flip-flop inputs for now 
SETF /D /T /J /K /S /R 


;check out D-type flip-flops 


SETF D 
CLOCKF CLK 
CHECK DT /DC 


SETF D 
CLOCKE CLK 
CHECK DT /DC 


SETF /D 
CLOCKE CLK 
CHECK /DT DC 


SETF /D 
CLOCKF CLK 
CHECK /DT DC 


;set the D input HI from LO 

;clock the device 

;verify that the true output went HI and the complement went LO 
zhold a HI 

jverify that state maintained 

;set the D input LO from HI 

;clock the device 

jverify that the true output went LO and complement went HI 
zhold a LO 


jverify that state maintained 


jcheck out T-type flip-flops 


SETF /T 
CLOCKF CLK 
CHECK /TT TC 


SETF T 
CLOCKF CLK 
CHECK TT /TC 


SETF /T 
CLOCKE CLK 
CHECK TT /TC 


SETF T 
CLOCKE CLK 
CHECK /TT TC 


shold a LO 

;true output should still be LO, complement output HI 
ztoggle from a LO 

;both outputs should have changed state 

zhold a HI 

sboth outputs should have held their state 

;toggle froma HI 

sboth outputs should have changed state 


Figure 22. Complete Basic Flip-Flops Design File (Cont’d.) 
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;check out J-K 
SETF /J /K 
CLOCKF CLK 
CHECK /JKT JKC 


SETF J /K 
CLOCKF CLK 
CHECK JKT /JKC 


SETF /J /K 
CLOCKF CLK 
CHECK JKT /JKC 


SETF /J K 
CLOCKF CLK 
CHECK /JKT JKC 


CLOCKF CLK 
CHECK /JKT JKC 


SETE J K 
CLOCKE CLK 
CHECK JKT /JKC 


SETF J /K 
CLOCKF CLK 
CHECK JKT /JKC 


SETF J K 
CLOCKF CLK 
CHECK /JKT JKC 


scheck out S-R 
SETF /S /R 
CLOCKF CLK 
CHECK /SRT SRC 


SETF S /R 
CLOCKF CLK 
CHECK SRT /SRC 


SETF /S /R 
CLOCKF CLK 
CHECK SRT /SRC 


SETF S /R 
CLOCKF CLK 
CHECK SRT /SRC 


SETF /S R 
CLOCKF CLK 
CHECK /SRT SRC 


CLOCKEF CLK 
CHECK /SRT SRC 


TRACE_OFF 


flip-flops 


flip-flops 





shold a LO 

;both outputs should have held their state 
;set a HI 

youtputs should have changed 

shold a HI 

;both outputs should have held their state 
;reset a LO 

;outputs should have changed 


;reset an output that is already LO 
;make sure that the outputs didn’t change 


;toggle from a 1O 

jverify that outputs changed 

;set an output that is already HI 
;make sure the outputs didn’t change 
;toggle from a HI 


;verify that outputs changed 


zhold a LO 

;both outputs should have held their state 
7set a HI 

youtputs should have changed 

zhold a HI 

;both outputs should have held their state 
;set an output that is already HI 

;make sure the outputs didn’t change 
;reset a LO 

youtputs should have changed 


;reset an output that is already LO 
;make sure that the outputs didn’t change 


Figure 22. Complete Basic Flip-Flops Design File (Cont’d.) 
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Programmable logic devices (PLDs) are used in digital systems 
design for implementing a wide variety of logic functions. These 
logic functions range from simple random logic replacement (dis- 
cussed earlier in the Beginner's Guide to PLDs on page 2-2) to 
complex control sequencers (discussed in the subsequent sec- 
tions). Programmable logic devices offer the multiple advantages 
of low cost, high integration, ease of use, and easier design 
debugging capability not available in other systems design op- 
tions. These are described in the introduction (on page 1-1) tothis 
handbook. In the following discussion we will detail the PLD 
design process. 


Most PLDs have an AND-OR array structure with programmable 
connections in either or both of the arrays. Aprogrammable array 
implies that the connections can be programmed by the user. 
PLDs are classified depending on which of the two arrays is 
programmable or fixed. The popular PAL (Programmable Array 
Logic) devices have a programmable AND array and a fixed OR 
array. PAL devices are used for a wide variety of combinatorial 
and registered logic functions. PROM (Programmable Read- 
Only Memory) devices—used often as memory and seldom as 
logic—have a fixed AND array and a programmable OR array. 
Both arrays are programmable in PLS (Programmable Logic Se- 
quencer) devices. These devices are used for special state 
machine applications discussed on page 2-117. In addition, there 
are other programmable logic devices that combine program- 
mable arrays with dedicated logic for providing optimal function- 
ality for a specific system application. The PROSE (PROgram- 
mable SEquencer) device is one such device, with its architec- 
ture optimized for state machine designs. A discussion of the 
various PLD architectures is included on page 1-7. In this discus- 
sion we will also examine the various design constraints to be 
considered when selecting the correct architecture for a given 
application. 


All digital logic can be efficiently reduced to two fundamental 
gates, AND and OR, provided boih true and complement versions 
of all input signals are available. Such logic is generally built 
around what is known as the sum-of-products (AND-OR) form. 
Please see page 6-1 far details on sum-of-products form and 
Boolean logic. Programmable logic devices are ideal for imple- 
menting such two-stage logic in the AND and OR arrays. 


Various process technologies offer many design options for 
PLDs. The connections in the programmable arrays can be fuse- 
based, commonly used in both ECL and TTL bipolar technolo- 
gies, E/EEPROM cell-based in UV-EPROM and EEPROM 
CMOS technologies and RAM cell-based in CMOS RAM technol- 
ogy. ECL PLDs are used for very high-speed designs (greater 
than 125 MHz bandwidth), while CMOS is used for low-power 
designs. Bipolar TTL fuse-based PLDs cover the vast midrange 
of applications and are the most popular PLDs. The selection of 
‘echnology is mostly dependent upon the system speed and 
dower constraints. Most design engineers are familiar with these 


constraints, which not only dictate the technology of PLDs but of 
all other logic used in a system. In the following discussion, we 
will assume TTL fuse-based technology for simplicity. A detailed 
discussion of the technology is included on page 3-130. 


Designing with PLDs involves the use of design software and a 
device programmer (Figure 1). The design software eliminates 
the need to identify every connection to be programmed for 
implementing the desired sum-of-products logic. The design 
process begins with the creation of a design file which specifies 
the desired function. The function is typically represented by its 
sum-of-products form and can be derived directly from the timing 


diagram and/or truth tables. Occasionally Karnaugh maps and : ga 


state diagrams are also used. The design file is then assembled 
to produce the “JEDEC’ file. The JEDEC file gets its name from 
the fact that it is an approved JEDEC standard for specifying the 
state of every connection on the device. Simulation can then be 
performed. Ifthe design is correct, the JEDEC file is downloaded 
into a device programmer for programming the connections on 
the device. The device can then be plugged into the PC board 
where it will function. The entire procedure can often be per- 
formed with the designer never having to leave the desk. Most 
programmers interface to personal computers, so that the design 
file can be edited, assembled, simulated, and downloaded, and 
the device programmed, all in one place. 


DESIGN 
FILE 









PLD DESIGN 
SOFTWARE 
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Figure 1. PLDs are Designed using Software 
and a Device Programmer 
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The first stage in a PLD design process (Figure 2) is the concep- 
tualization of a design problem; the second is the selection of the 
correct device; the third is the implementation of the design, which 
also includes simulating the design with test vectors; and finally, 
the actual programming and testing on a system board. We will 
take a simple design example and go through the various stages 
of this design process. 


CONCEPTUALIZE A 
DESIGN PROBLEM 
SELECT 
DEVICE 
IMPLEMENT 
DESIGN 
PROGRAM 
PLD 












PLUG PLD 
INTO BOARD 


Figure 2. Programmable Logic Device Design Process 
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RESET ° 


BUTTON e 


INTERFACE 
LOGIC 










ROMCS1 


Conceptualizing a Design 


The first step in the PLD design process is also required for any 
SSI/MSI design. An advantage of PLDs is that at this stage the 
designer needs to be concerned only with the required logic 
function. With SSI or MSI, various device logic limitations must 
be accounted for before the design can be started. Clearly a 
designer needs to develop a brief and complete functional de- 
scription, based upon the system design requirements. 


We will take the example of a simple address decoder circuit 
required fora 68000 microprocessor. The microprocessor has 24 
address lines along with separate read and write signals. It 
requires some ROM to store the boot-up code as well as some 
RAM for storing and executing programs. The purpose of the 
address decoder circuitry is to select one of the memory ad- 
dresses at atime. The RAMs and ROMs are assigned addresses 
on the 68000 microprocessor address space. The address 
decoder circuit has to select one of the RAMs or ROMs for a 
specific range of addresses, called the address space. This 
selection is accomplished by asserting the specific chip-select 
signal for the RAM or ROM when the microprocessor accesses 
one of the addresses in the address space. There is additional 
circuitry in a typical microprocessor system for addressing I/O 


000000-OFFFFF 
100000—1FFFFF 
200000-2FFFFF 
300000-3F FFFF 
400000-4FFFFF 
500000-—5SFFFFF 


600000-7F FFFF 





404 03 


Figure 3. Memory Address Map 


INIT 
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Figure 4. Microprocessor to Memory Interface 
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devices (such as disk controllers). These devices also require 
that chip-select signals be asserted when the microprocessor 
addresses them. Figure 3 shows an example address mapfora 
68000 microprocessor. 


Figure 4 shows the circuitdiagram. The address signals fromthe 
68000 microprocessor are inputs to the interface logicblock. The 
outputs generated are ROMCS1, ROMCS2, and RAMCS. The 
generation of signals for selecting device I/Osis similar and is not 
shown here for the sake of simplicity. Other system inputs to the 
interface are the address strobe signal generated by the 68000 
microprocessor as well as the read/write signal. The truth table 
for generating the outputs is shown in Figure 5. This truth table 
is derived from the memory address map and the functional de- 
scription of the design. 


ADDRESSES 
HEX | A2t | SIGNAL | 


00000-OFFFF 





10000-1FFFF 


20000-2FFFF 


30000-3FFFF 


40000-4FFFF 


50000-5FFFF 





Figure 5. Truth Table for Chip-Select Signals 
Device Selection Considerations 


The first task forthe designer is to identify the design problem and 
classify it as a combinatorial function or a registered function, 

depending upon whether or not registers are required. In most 

cases, this decision depends upon the functional nature of the 

problem. Sometimes timing and logic considerations can also 

dictate the use of registers; this willbe discussed later. Registers 

are usually not required for such simple combinatorial functions 

such as encoders, decoders, multiplexers, demultiplexers, ad- 

ders, and comparators. Registers are required, however, for 

functions such as counters, timers, control signal generation, and 
state machines. No registers are required for this simple address 
decoding example. 


Abrief look at the places where various architecturalfeatures are 
used will help us select a device for ourdesign example. Based 
on historic record of usage and architectural functionality, PLDs 
can beclassified into three basic application segments. Figure 6 
shows the optimal placement of PAL, PLS, and PROSE/FPC 
(Fuse Programmable Controller) devices for these three seg- 
ments; combinatorial designs, simple registered designs and 
complex registered (sequencer) designs. This classification also 
shows the available functionality and device speeds. These are 
rough estimates that most designers implicitly assume and de- 
velop with experience. These estimates vary drastically among 
different designers, based on personal preferences and experi- 
ence. They have been included here to provide a complete road 
map for device selection and should be considered strictly as a 
tule of thumb. 


COMBINATORIAL 
DESIGNS SPEED 


FUNCTIONALITY ——————> 
SIMPLE 
REGISTERED SPEED 
DESIGNS 
FUNCTIONALITY ————>> 


STATE MACHINES/ 
SEQUENCERS, 






COMPLEX SPEED 
REGISTERED PROSE/FPC 
DESIGNS DEVICES 
404 06 





FUNCTIONALITY —-—-—_—_> 


Figure 6. Various PLD Design Options 


Based on these criteria, the best choice for our combinatorial 
design would be a PAL device. The task nowis to select a PAL 
device forimplementing the desired function. The main function- 
specific selection considerations are shown in Figure 7. These 
items are applicable to most designs. There are other function- 
specific issues which need addressing when selecting a device; 
these willbe discussed in the later sections of the book, when we 
address advanced designs. 


Number of Input pins 
Number of output pins 
Number of I/O pins 

Device speed 

Device power requirements 
Number of registers (if any) 
Number of product terms 
Output polarity control 


PNOGPON=> 


Figure 7. General Device Selection Considerations 


The first resource that must be provided in a PLD is the number 
of pins needed for the basic logic function. This consists of the 
number of input and output pins. Many PLDs have internal 
feedback, which allows the generated output signal to be reused 
as an input. The same feedback also allows the pin to be used 
as adedicated input, if required. This is especially useful for fitting 
various designs with different input/output requirements on the 
same device. The I/O pin capability of certainPLDs can also be 
very useful forcertain bus applications and willbe discussed later. 
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Figure 8 and 9 show the pin resources available on various 
combinatorial and registered PLDs. The task is as simple as 
counting the number of input, output and I/O pins required by the 
design and picking a PLD from the table which has the requisite 
number of pins. We will pick a PAL16L8 for our design, since it 
has sufficient pin resources. 


DEVICE TOTAL DEDICATED 
NUMBER INPUT OUTPUT 


OF PINS PINS PINS 


NNN ALARMOMHM OO 





DEDICATED 





The next selection issue is the device speed. The most important 
timing consideration for combinatorial PLDs is the propagation 
delay (t,,,) of signals from the input to the output of the device. For 
registered PLDs, the important timing consideration is the device 
clocking frequency. This clocking frequency (shown in Figure 9) 
is in turn determined by sum of the register setup time (t,,), and 
clock-to-output propagation delay (t,,,,). Most systems impose 


LOGICAL 
PRODUCT 
TERMS/ 
OUTPUT 


SPEED GRADES 
t,, INns 


10, 15, 25, 35 
25 
20, 30 
15, 25 





15, 25, 35 
15, 20, 25, 35 
20, 30 
35 
15, 25 
20, 30, 40 


@ 
4SHLoowowolipwn 
oOo o 


1 © © 


ECL 


Figure 8. Combinatorial PLDs with Pin Resources and Speed Requirements 
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DEVICE TOTAL | DEDICATED | DEDICATED vO LOGICAL SPEED GRADES 
NUMBER INPUT REGISTERED PINS PRODUCT FREQUENCY IN 
OF PINS PINS OUTPUT TERMS/ MHz 
PINS OUTPUT i 
TTL 
16R8 . 55.5, 37, 25, 16 
16R6 2 55.5, 37, 25, 16 
16R4 4 55.5, 37, 25, 16 
16RP8 . 22.2 
16RP6 2 22.2 
16RP4 o 22.2 
16RA8 20 
16X4 8 4 14 
2388 9 4 33.3, 28.5 
20R8 24 12 8 . 8 37, 25, 16 
20R6 24 12 6 2 8 37, 25, 16 
20R4 24 12 4 4 8 37, 25, 16 
20X10 24 10 10 . 4 22.2 
20X8 24 10 8 2 4 22.2 
20X4 24 10 4 6 4 22.2 
20RP10 24 10 10 - 8 37, 25 
20RP8 24 10 8 2 8 37, 25 
20RP6 24 10 6 4 8 37, 25 
20RP4 24 10 4 6 8 37, 25 
20XRP10 24 10 10 . 8 30, 22.2, 14 
20XRP8 24 10 8 2 8 30, 22.2, 14 
20XRP6 24 10 6 4 8 30, 22.2, 14 
20XRP4 24 10 4 6 8 30, 22.2, 14 
20RS10 24 10 10 19.2 
20RS8 24 10 8 19.2 
20RS4 24 10 4 19.2 
20RA10 24 10 0-10 33, 20 
22RX8 24 14 0-8 28.5 
22V10 24 12 0-10 40, 28.5, 18 
32VX10 24 12 0-10 25, 22.2 
32R16 40 16 16 16 
CMOS 
C16R8 20 8 8 
C16R6 20 8 6 
C16R4 20 8 4 
C20R8 24 12 8 
C20R6 24 12 6 
C20R4 24 12 4 
C22V10 24 
C29M16 
C29MA16 








10H/020EV/G8 
10H20G8 
(Latched) 













Figure 9. Registered PLDs with Pin Resources and Speed Requirements 
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ADDRESS 
DECODING 


MEMORY ACCESS TIME 
TIME TIME 


ADDRESS 
DECODING 


10 ns —|—______ 220 ns —————_——_>«____ 10 ns 
240 ns 


READ / WRITE CYCLE TIME 


404 10 


Figure 10. System Timing Requirements 


some timing restrictions on the internal logic functions. These 
restrictions will determine the necessary t,,, (for combinatorial 
devices) or f,,, (for registered devices). Details on clocking 
frequency are provided on page 2-64. 


In our design example, the PLD will primarily perform address 
decoding. The critical system timing constraint is determined by 
the read/write cycle time of the microprocessor and the memory 
access time available (Figure 10). Most microprocessors allow 
anywhere from 10 to 35 ns for address decoding. Thatis, 10-35 
ns after the address is available, the correct memory chip-select 
signal should be asserted. In our design example the available 
cycle time of 240 ns and memory access time of 220 ns leaves 
barely 10 ns for address decode time. From the table in 
Figure 8, we can check the propagation delay and select the 
appropriate speed of PAL16L8 for our design, which is t,,=10ns. 


We have already briefly discussed the types of applications 
where registers are needed. Sometimes the consideration of 
system timing can affect whether or not registers are needed. 
Devices with registers can hold a signal stable for the long 
durations required by the addressed peripheral or memory. 
However, this slows the initial response or access time of the 
device since the chip select must wait for the setup time before the 
rising edge of the clock cycle. Devices without registers provide 
fast access time but hold the signal valid only as long as the input 
conditions are valid. In most address decoders, the address 
signals are kept asserted by the microprocessor until the read/ 
write cycle is completed. In this case, the registers are not 
required for holding the signals asserted. 


The remaining two general design considerations are the number 
of product terms and output polarity. We will discuss these two as 
we implement the design in the next section. 


Implementing a design 


Implementing a design (Figure 11) requires the creation of a 
design file. This design file is called the PAL device Design 
Specification (PDS) in the PALASM 2 software package. Details 
of PALASM 2 software can be found in section 4 of this handbook. 
The design file contains three types of information. 


1. Basic bookkeeping information 
2. Design syntax 
3. Simulation syntax 


The first section provides documentation information, along with 
the name of the device selected, and the signal names. Allofthe 
signal names are assigned to pins and are defined as inputs or 
outputs. The Declaration Section contains the signal names in 
the ascending order of pin numbers. Figure 12 shows the pin 
assignments of the signals used in our example. Signals which 
are used as I/Os are assigned as outputs and are used as both 
inputs and outputs in the sum-of-products logic explained later. 


Once the design file is complete, it is then assembled and 
simulated. Once it passes assembly and simulation, the resultant 
JEDEC file is downloaded to a device programmer for configuring 
the device. 









SELECT DEVICE 


CREATE DESIGN FILE 
WRITE BOOLEAN 
LOGIC EQUATIONS 


ASSEMBLE 
DESIGN FILE 
SIMULATE 
THE DESIGN 


CREATE DESIGN FILE 
WRITE STATE 
MACHINE 






YES 
DOWNLOAD 
JEDEC FILE 


PROGRAM PLD 


Figure 11. Implementing a Design 
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Title M68000_ ADDRESS DECODER 
Pattern P8000 

Revision A 

Author ENGINEER 

Company MMI SUNNYVALE, CALIFORNIA 
DATE 07/21/87 


CHIP DECODER PAL16L8 


A22 
ROMCS2 


NC A23 A21 


OE ROMCS1 


TOP BOTTOM AS RW NC 
RAMCS INIT NC 


GND 


NC NC NC vcc 


Figure 12. Declaration Section of a Design File 


Design Syntax 


As shown in Figure 11, there are two options available to the 
designer for expressing the design. The first is through traditional 
Boolean logic equations; the second is through a state machine 
syntax. The Boolean logic equations are the only option for 
combinatorial designs and can also be efficient for some regis- 
tered designs. The Boolean equations can be derived from a 
combination of the functional description, the truth table and/or 
the timing diagrams (Figure 13). The state machine approach is 
ideal for large registered control designs, and canbe derived from 
the functional description, state table, state diagram and/or the 
timing diagram (Figure 14). The latter approach can also be used 
with PROSE and PLS devices, and will be discussed in a later 
section on state machine design. 


Boolean Logic Equations 


Boolean equations are used to represent the sum-of-products 
logic form (see page 6-1 for a logic design review). The Boolean 
equations are ideally suited for representing the two-level 
AND-OR logic available in most PLDs. 


A conventional approach to the design is to convert the design 
problem to its discrete logic implementation. Such random SSI 
and MSI logic can be easily implemented in PLDs (see the 
Beginner's Guide, page 2-2). This usually involves converting to 
sum-of-products Boolean logic form. This approach can be a 
chore, and much effort can be saved by implementing a design 
with PLDs in a sum-of-products form right from the start. This 
essentially means that the designer does not have to design 
around the limitations of fixed SSI and MSI functions. A direct 
implementation of a design in sum-of-products form ina PLD can 
also yield a faster circuit. 


Boolean equations can be directly derived from the truth table or 
timing diagram (Figure 13). The truth table is used more often in 
simple combinatorial designs. The timing diagram method is 
used more often in registered control designs. We will first 
discuss the truth table method and then discuss the details of the 
timing diagram method. The state-machine-based registered 
design approach will be discussed on page 2-101. 
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In addition to specifying the logic function, the Boolean equations 
in the design file help document the design. There is no need to 
draw out an equivalentschematic. This allows design modularity; 
the schematic can just show a block for a particular PLD. 
Separate supporting documentation (the design file) provides the 
details without cluttering the drawing. 


Truth-Table-Based Design 


The requirements for our particular design example can be easily 
converted to a truth table format (Figure 15). This truth table is 
based upon the functional description of the design, and is 
derived from the address map (Figure 3) and the truth table 
(Figure 5). 





FUNCTIONAL 
DESCRIPTION 








TRUTH TABLE TIMING DIAGRAM 






LOGIC EQUATIONS 


ASSEMBLE 
DESIGN FILE 


Figure 13. Writing Boolean Logic Equations 
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STATE DIAGRAM 






FUNCTIONAL 
DESCRIPTION 


STATE TABLE TIMING DIAGRAM 


STATE MACHINE FILE 


ASSEMBLE 
DESIGN FILE 
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Figure 14. State Machine Dsecription 














Figure 15. Truth Table for the Address Decoder 


There are three additional input signals in this design example. 
The first, RW, is generated by the microprocessor, and distin- 
guishes between read and write cycles. Since the ROM data is 
only for reading, the ROMCS1 and ROMCS2 signals are asserted 
only when RW is high (when the microprocessor attempts to read 
the ROM) and are not asserted for the write cycle. On the other 
hand, RAMCS is generated for both read and write cycles and the 
state of signal RW is “don’t care”. 


The second additional signal, AS, is the address strobe signal 
generated by the microprocessor, and is asserted only when the 
address lines carry a valid address. All of the chip select signals 
need to be gated with the AS signal to ensure that they are only 
generated for valid addresses, and no spurious chip selects are 
generated. 


The last signal is the INIT signal, which is a system initialization 
signal. This signal is used to initialize the microprocessor for a 
“warm boot,” and none of the chip selects is allowed when this 
INIT signal is asserted. 


Writing Boolean equations from the above logic is very straight- 
forward. The output signal names, along with their polarity, are 


assigned to sum-of-product equations, which are based upon 
inputs and their polarities. 


/ROMCS1 = /A23 * /A22 * /A21 * INIT * /AS * RW 
. ZROMCS2 = /A23 * /A22 * A21 * INIT * /AS * RW 
/RAMCS = /A23 * A22 * /A21 * INIT * /AS 

+ /A23 * A22 * A21 * INIT * /AS 

+ A23 * /A22 * /A21 * INIT * /AS 

+ A23 * /A22 * A2l1 * INIT * /AS 


Figure 16. The Implementation in Boolean Equations 


The equations are derived directly from the truthtables. Eachone 
of the AND equations uses up one product term of the device as 
shown in Figure 16. One device selection consideration is to 
ensure that all the outputs have sufficient product terms to 
accommodate the desired function. The device we have selected 
for this design is the popular PALi6L8D. This device has 
sufficient inputs, outputs, and product terms, and has the requi- 
site speed for the design. 
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This brings us to the issue of output polarity. Suppose we had to 
generate active-HIGH outputs. In that case the output equations 
for the ROMCS1 signal would be : 


ROMCS1 = /A23 * /A22 * /A21 * INIT * /AS * RW 


As the PAL16L8 has active-LOW outputs only, this equation’s 
output polarity needs to be inverted to be able to fit the device. 
Using DeMorgan’s theorem for Boolean logic we get: 


/ROMCS1 = A23 + A22 + A21 + /INIT + AS + /RW 


This equation requires a large number of product terms (six). 
Some signals are efficient and use fewer product terms in their 
true form, while others are more efficient in their inverted form. 
This poses no problems until the signals exceed the device limit 
of 8 product terms provided by the PAL16L8 and other standard 
PAL devices. In such cases, an output polarity control is required. 
The PAL16RP8 family provides just the control for fitting the 
required number of product terms on the device. The device 
selection issues of product terms and output polarity also apply to 
registered designs. A detailed discussion of output polarity is 
included on page 6-15. 


Timing-Diagram-Based Design 


Until now, we have discussed a PLD design using truth tables as 
the primary design vehicle. inthis section we will attempt a design 
using a timing diagram as a design vehicle. 


Earlier in the address decoder design we mentioned the INIT 
signal. This INIT signal is essentially an initialization signal forthe 
entire system, and can be generated by the PAL16L8. Note that 
the PAL16L8D used for decoding still has several unused outputs 
left, one of which can be used to generate the INIT signal. The 
INIT signal is used internally (via feedback) for disabling the chip 
selects during initialization. Externally it can be used to initialize 
othersystem signals. This INIT signal is generated from aRESET 
switch connected to the inputs of the PAL16L8 device as shown 
in Figure 17. 


To avoid unwanted initialization, the RESET switch must be 
debounced. Thatis, we want the INIT signal to remain HIGH until 
the switch actually contacts the bottom side. Once the bottom 
side is hit, INIT should be asserted active LOW. Once asserted, 
it should stay LOW and not change until the top side is hit again. 
The timing requirements of the debounce circuitry are shown in 
Figure 18. Signals TOP and BOTTOM are inputs to the program- 
mable logic device. These signals are activated when the RESET 
switch touches the top and the bottom contacts, respectively. 


vcc 








PAL DEVICE 
DEBOUNCE 
CIRCUIT 


hay 


PAL16L8 
404 17 


Figure 17. RESET Switch for System Initialization 


We can formulate the equations by looking at the timing require- 
ments of the debounce circuitry shown in Figure 18. The idea is 
to identify the key elements of this timing diagram. The arrows in 
Figure 18 show the critical events. The first arrow shows the 
normal state of all the pins when the RESET switch is not 
asserted. Subsequent arrows show each event in the timing of 
the INIT signal, depending upon the movement of the switch. 


RESET 
SWITCH 


TOP 


BOTTOM 


INIT 


404 18 





Figure 18. Timing Diagram for the Debounce Switch 


The logic level of the signals at each critical event carries useful 
logic information for deriving Boolean equations. This logic 
information for each event is converted into direct Boolean 
equations as shown in Figure 19 below. For example, at instant 
1 the INITsignalremains HIGH as long as the TOP signal remains 
LOW; this is converted to INIT = /TOP * BOTTOM. 


1. Normal state INIT = /TOP 
2. Switch travels INIT = TOP * BOTTOM * INIT 
from TOP to BOTTOM 


/INIT 


3. Switch contacts /BOTTOM 


BOTTOM 


4. Switch travels /INIT = /INIT * BOTTOM * TOP 


form BOTTOM to TOP 


5. Normal State Again 


Figure 19. Boolean Logic at Every Instant of Timing 


Since we are working with an active-LOW device, we can com- 
bine the two active-LOW events into one equation: 


/INIT = /BOTTOM 
+ /INIT * BOTTOM * TOP 


Minimizing, this becomes: 


/INIT = /BOTTOM 
+ /INIT * TOP 


This can also be done by way of a truth table and Karnaugh map. 
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Figure 20. Truth Table of INIT logic 


Here TOP or BOTTOM will be LOW if contacted. Note that both 
TOP and BOTTOM can not be contacted at the same time. The 
truth table of Figure 20 yields the Karnaugh map shown in Figure 
21, Grouping the zeros (because the PAL16L8 offers active-LOW 
outputs) yields the Boolean equation identical to the one derived 
from the timing diagram. 


/TOP 
/BOTTOM 


/RESET 
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Figure 21. Karnaugh Map of INIT Signal Logic 


There is essentially no difference between the truth table and 
timing diagram techniques for writing Boolean logic. A careful 
analysis will indicate that we implicitly assumed a truth table inthe 
timing diagram example also. Some designers prefer to make a 
separate truth table (at least in the first few PLD designs), while 
others prefer to design directly from timing diagrams, While the 
truth table method allows a more optimal utilization of product 
terms, the timing diagram method is easier to visualize as it 
retains the design perspective. In both cases the logic should be 
minimized by the design software to ensure that the design is 
testable. Refer to page 3-108 for a discussion on testability. 


Most experienced designers understand the tradeoffs for device 
selection. They implicitly go through the steps of design concep- 
tualization and device selection, explained earlier. They typically 
draw a block around the logic being designed, with the previous 
knowledge that it would fit a PLD which has sufficient inputs, 
outputs, I/Os and product terms. Now that we have selected a 
device and implemented the design, the next step is design 
simulation. 


Simulation 


Design simulation is an integral part of the design process, as 
shown in Figure 22. The purpose is to exercise all of the inputs 
and test the response of outputs to verify that they will work as 
desired in the system. These are essentially test vectors which 
designate the state of every input on the device; the outputs are 





then checked for an appropriate response. The simulation test 
vectors identify any flaws in the design equations which could 
affect the logical operation of the devices programmed. Thus the 
simulation vectors serve as a design debugging tool. 


Simulation test vectors will eventually make up part of a larger set 
of test vectors called “functional test vectors”. These functional 
test vectors are used to exercise a real device after programming 
to identify any individual devices which are defective. Other 
means of identifying defective devices, such as signature 
analysis, are also available. Allof these are discussed in detail on 
page 3-128. In this section we will strictly focus on simulation 
vectors. 


Simulation is included in the design file along with the logic 
equations. There is little standardization in these simulation 
expressions among various PLD design software packages, 
although most of them rely on test vectors to exercise the logic. 
PALASM 2 software offers an advanced event-driven capability 
that retains a closer design perspective than simulation vectors. 
Details of PALASM 2 simulation syntax are included on page 
4-137. 


The simulation vectors or events can be directly derived from the 
truth table and the timing diagram of the design. The logic level 
and functions of all signals canbe expanded and rewritten in atest 
vector form by the software. For example, the truth table for the 
address decoder example discussed earlier can be easily rewrit- 
ten as shown in Figure 23. 
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Figure 22. Device Simulation and Programming 





2-30 


zl Monolithic LAr Memories Pa | 


PLD Design Methodology 


A23. A22. A21 TOP BOTTOM AS RW ROMCS1 ROMCS2 RAMCS INIT 
0 0 oO 0 1 1 1 









2) 
ro) 
ro) 
Oo 
oO 
—_ 
-x 
x 
ao 
a 


co 
are p> Le) Se) ea) ee] Sea} ST) TS 








H H 
0 0 1 0 1 1 1 H H H 
0 0 1 0 1 0 1 H L H 
0 1 0 0 1 1 X H H H 
0 1 0 0 1 0 X H H L 
0 1 1 0 1 1 X H H H 
0 1 1 0 1 0 X H H L 
1 0 0 0 1 1 Xx 
1 0 0 0 1 0 X 






x=jxrct 
xr 


oo oo 

—_ eh 

—oO oo 

x< x< = 

<x>x« | «Kx 

pa nes =x 

oe ales be SO Oe De 
ae 





«x KK 
ps a a a 
ps gee ee Dm ee 0 


Figure 23. Truth Table Used to Derive Simulation Vectors 


These are essentially the simulation vectors which will allow us to Once the simulation is complete, the design file can be as- 
define the inputs to the device and check the outputs of the sembled to generate the JEDEC file. In the preceding discus- 
device. The PALASM 2 syntax for these simulation vectors is sions we have assumed prior knowledge of the design file 
shown in Figure 24. assembly. The procedure for assembly varies with different 

software packages and is covered for the PALASM 2 software 
There is a direct relationship between the truth table vectors of package in the Beginner’s Guide (page 2-6). 


Figure 23 and the simulation shown in Figure 24. The simulator 
then interprets the design file and generates the output logic 
levels and/or waveforms, which can be checked by the designer. 
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SETF /A23 /A22 /A21 
CHECK ROMCS1 ROMCS2 


SETF /AS 

CHECK ROMCS1 ROMCS2 
SETF AS 

SETF /RW /AS 

CHECK ROMCS1 ROMCS2 
SETF RW AS 


SETF /A23 /A22 A21 
SETF /AS 

CHECK ROMCS1 ROMCS2 
SETF AS 

SETF /RW /AS 

CHECK ROMCS1 ROMCS2 
SETF RW AS 


SETF /A23 A22 /A21 
SETF /AS 

CHECK ROMCS1 ROMCS2 
SETF AS 

SETF /RW /AS 

CHECK ROMCS1 ROMCS2 
SETF RW AS 


SETF /A23 A22 A21 
SETF /AS 

CHECK ROMCS1 ROMCS2 
SETF AS 

SETF /RW /AS 

CHECK ROMCS1 ROMCS2 
SETF RW AS 


SETF A23 /A22 /A21 
SETF /AS 

CHECK ROMCS1 ROMCS2 
SETF AS 

SETF /RW /AS 

CHECK ROMCS1 ROMCS2 
SETF RW AS 


SETF A23 /A22 A21 
SETF /AS 

CHECK ROMCS1 ROMCS2 
SETF AS 

SETF /RW /AS 

CHECK ROMCS1 ROMCS2 
SETF RW AS 


SETF TOP 

SETF /BOTTOM 

CHECK ROMCS1 ROMCS2 
SETF BOTTOM 

CHEICK ROMCS1 ROMCS2 
SETF TOP 

CHECK ROMCS1 ROMCS2 


/TOP BOTTOM AS RW 
RAMCS INIT 


; READ CYCLE VALID ROMCS1 ADDRESS 
RAMCS INIT 


; WRITE TO ROMCS1 NOT POSSIBLE 
RAMCS INIT 


; SET UP ROMCS2 ADDRESS 
7; READ CYCLE VALID ROMCS2 ADDRESS 
RAMCS INIT 


; WRITE TO ROMCS2 NOT POSSIBLE 
RAMCS INIT 


; SET UP RAMCS ADDRESS 
; READ CYCLE VALID RAMCS ADDRESS 
RAMCS INIT 


; WRITE CYCLE VALID RAMCS ADDRESS 
RAMCS INIT 


SET UP RAMCS ADDRESS 
READ CYCLE VALID RAMCS ADDRESS 
RAMCS INIT 


Ne Ne 


; WRITES CYCLE VALID RAMCS ADDRESS 
RAMCS INIT 


; SET UP RAMCS ADDRESS 
; READ CYCLE VALID RAMCS ADDRESS 
RAMCS INIT 


; WRITE CYCLE VALID RAMCS ADDRESS 
RAMCS INIT 


; SET UP RAMCS ADDRESS 
; READ CYCLE VALID RAMCS ADDRESS 
RAMCS INIT 


; WRITE CYCLE VALID RAMCS ADDRESS 
RAMCS INIT 


; RESET SWITCH TRAVELING DOWN 
; RESET HITTING BOTTOM 
RAMCS INIT 
; RESET TRAVELING TO TOP 
RAMCS INIT 
; RESET HITTING TOP 
RAMCS INIT 


Figure 24. PALASM 2 Software Simulation for Address Decoder 


an Monolithic ara Memortles an 


PLD Design Methodslogy 





DOWNLOAD JEDEC 
FILE TO DEVICE 















PROGRAMMER 
A 
' l 
' ' 
t ' 
' 1 
1 1 
i ' 
‘ | PROGRAM 
: ' PLD 
' H 
‘ 1 
' 1 
' 1 
t t 
besececesctisbicse sec saws a} 
epee nas eoeeevae were PERFORMED BY 
r DEVICE PROGRAMMERS 
TEST PLD WITH 
SIMULATION AND 
OTHER TEST VECTORS 
TEST 
PLD 


404 25 i ee Seen ter 


USE ON A BOARD 





Figure 25. Device Programming and Testing 


Device Programming and Testing 


Once the design simulation is completed, the final step is device 
programming and testing (Figure 25). Programmers are avail- 
able from a variety of vendors. It is important to note that 
Monolithic Memories qualifies programmers upon verifying that 
the algorithms used by the programmers. are correct and that 
other basic criteria are met. When purchasing a programmer, 
check that the programmer is qualified for the devices you intend 
to use. 


There are two types of programmer available; menu-driven or 
device code based. The menu-driven programmer directly indi- 
cates the part type being programmed, whereas the latter type 
requires the user to enter the device code before programming. 
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Once the JEDEC fuse file has been downloaded, the programmer 
can program the device; the PLD is then ready for use. The 
programmer also verifies the connections after the programming 
cycle. Programmers also provide the capability of reading a 
previously programmed device and creating duplicates of that 
device. 


Testing PLDs 


The testing of PLDs can be performed by the device programmer 
or by other test equipment. For a manufacturing environment 
where high yields are required, device testing is critical. This 
subject merits a separate discussion and has been included on 
page 3-128. After testing is completa, the device security cells 
may be programmed if desired to secure the design from copying. 
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In this section we will take a detailed look at several aspects of 
combinatorial logic design. Most combinatorial design applica- 
tions can be easily segmented into five major fields. 


* Encoders and Decoders 

* Multiplexers 

* Comparators 

« Adders and Arithmetic Logic 
« Latches 


We will not only focus on the design methodology for these func- 
tions, but will also explore further function-specific PLD selection 
requirements. Generalized designs will be developed, which can 
be customized later to suit specific system applications. Ways of 
optimizing the design will also be discussed. 


Encoders and Decoders 


Two of the most important functions required in digital design are 
encoding and decoding. The encoding and decoding of data are 
used extensively in digital communications as well as in periph- 
erals. Both these areas use various complex encoding and de- 
coding techniques, which are described in more detail on pages 
2-357 and 2-507. Most of these techniques are extensions of the 
simple encoding and decoding techniques often used in other 
digital designs. In this discussion we will focus on simple encod- 
ing and decoding techniques. 


Encoders 


A binary code of n bits can be used to represent 2" distinct pieces 
of coded data. Asimple combinatorial encoder is a circuit which 
generates n bits of output information based upon one of the 2" 
unique pieces of input data information. This encoding of infor- 
mation is controlled by other independent control signals in a 
typical digital circuit. 


An illustration of a typical encoder is shown in Figure 1. The 
design methodology typically followed is based on truth tables 
(Figure 2), from which the Boolean equations are directly derived 
forthe design. The same generic device selection considerations 
discussed in the section on PAL device design methodology 
apply for encoder and decoder designs. 


A 
a Co ENCODED 
INPUTS OUTPUTS 
1 
D 
405 01 


Figure 1. A Block Diagram of an Encoder 








OUTPUTS 


INPUTS 











Figure 2. Truth Table of a Typical Encoder 


The Boolean equations can then be optimized using Karnaugh 
maps or the PALASM 2 software minimizer. 


The resulting Boolean equations are: 
Cl = /A* B* /C* /D CO= /A* /B* C* /D 

+ /A* /B* /C* D + /A* /B* /C* D 
The complete PALASM 2 design file (including the simulation) for 
this simple encoder design is shown in Figure 3. 


A Priority Encoder 


Let us take another look at the encoder example of Figure 2. In 
this example it is assumed that only one of the inputs A, B, C or 
D is asserted HIGH at any one time. If two of the inputs are 
asserted HIGH simuitaneously, aconflict would be created. To re- 
solve this, a priority needs to be assigned to each of the inputs. 
Such a priority assignment is used to select a particular element 
when several inputs are asserted simultaneously. Each input is 
assigned a priority with respect to the other inputs. The output 
code generated is the code assigned to the highest priority 
input asserted. 


Thus, a priority encoder is a combinatorial circuit block similar to 
a general encoder, except that the inputs are assigned a priority. 
Such priority encoders are used often in state machine applica- 
tions, where they detect the occurrence of the highest priority 
event. They are also used for microprocessor interrupt control- 
lers, where they detect the highest priority interrupt. Another use 
for priority encoders is in bus control, where they are used in 
arbitration schemes for allowing selective access to the bus. 


The model of a priority encoder is shown in Figure 4. The four 
input signals are A, B, C and D. These are to be encoded as LL, 
LH, HL and HH outputs. Let us assign priority to Dover C, C over 
B, and BoverA. The next design step would be to modify the truth 
table (Figure 5) to reflect these priorities. 
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TITLE SIMPLE ENCODER 

PATTERN ENCODE. PDS 

REVISION 01 

AUTHOR JOE ENGINEER 

COMPANY MONOLITHIC MEMORIES, INC. 
DATE 8/20/87 


This is an example of a simple encoder. The output Cl and CO 
are generated based upon inputs A, B, C and D. The function 
truth table is following: 


A B c D co cl 
1 0 0 0 L L 
0 1 0 0 L H 
0 0 1 0 H L 
0 0 0 1 H H 


Re we te we Se te Se we se te we te 


CHIP ENCODER PAL16H2 


A B C D NC NC NC NC NC GND 
NC NC NC NC CO Cl NC NC NC VCC 


EQUATIONS 

Cl = /A* B* /C * /D 
+ /A * /B * /C * D 

cO = /A* /B * C* /D 
+ /A * /B * /C * D 

SIMULATION 

TRACE ON 


SETF A /B /C /D 
CHECK /CO /Cl 
SETF /A B/C /D 
CHECK /CO Cl 
SETF /A /B C /D 
CHECK CO /Cl 
SETF /A /B /C D 
CHECK CO Cl 


Figure 3. PALASM 2 File for a Simple Encoder 


OUTPUTS 
eae 





























raTeyey 

6 0 L 
1 L 
PRIORITY Co 0 H 
Cc ENCODER Ci 0 H 

cs X 1 0 0 L Priority 

405 04 X X 1 0 H Assignments 
X X X 1 H 
Figure 4. A Four-Input Priority Encoder Block Diagram Figure 5. Priority Encoder Truth Table 
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The Boolean equations, directly derived from the truth table, are: These equations can be further optimized by the design software 
to the following: 
cCl= /A* B* /C* /D CO = /A* /B* C* /D 


+/A* /B* /C* D + /A* /B* /C* D cCl=D+/C*B 
+ B* /C * /D + cx /D coeDdDHC 
+ D + D 


Although a priority encoder is_a purely combinatorial function, 
output registers are frequently used to hold the output signal 
stable for longer durations. An example of a registered 16-input 
priority encoder is shown in Figure 6. 


TITLE 16 INPUT REGISTERED PRIORITY ENCODER 
PATTERN P7090 

REVISION 01 

AUTHOR J. ENGINEER 

COMPANY MONOLITHIC MEMORIES, INC. 

DATE 8/20/87 


CHIP IN_PRTY PAL20R4 


; DESCRIPTION 





;THE 16 INPUT REGISTERED PRIORITY ENCODER ACCEPTS SIXTEEN ACTIVE-LOW INPUTS 
;(I10-115) TO LOAD THE BINARY WEIGHTED CODE OF THE PRIORITY ORDER INTO THE 
;OUTPUT REGISTER (Q3-Q0) ON THE RISING EDGE OF THE CLOCK (CLK). A PRIORITY 
;IS ASSIGNED TO EACH INPUT SO THAT WHEN TWO INPUTS ARE SIMULTANEOUSLY 
:ACTIVE, THE INPUT WITH THE HIGHEST PRIORITY IS LOADED INTO THE OUTPUT 
;REGISTER. THEREFORE THE HIGHEST PRIORITY INPUT (I0=H) PRODUCES HHHH IN 
;THE OUTPUT REGISTER AND THE LOWEST PRIORITY INPUT (115=H) PRODUCES LLLL IN 
+THE OUTPUT REGISTER. 


OPERATIONS TABLE 


/oc CLK 115-10 Q3-Q0 OPERATION 


’ 

, 

; 

; 

: H xX x Z HI-Z 

; L Cc 10 =H 15 I0 INTERRUPT (HIGHEST PRIORITY INPUT) 
; L c Il = 14 Il INTERRUPT 
; L c Il = 13 I2. INTERRUPT 
; L Cc Tl = 12 13) INTERRUPT 
; L Cc Tl = il I4 INTERRUPT 
; L c Il =H 10 I5 INTERRUPT 
; L c Il = 9 I6 INTERRUPT 
; L c Il = 8 I7. INTERRUPT 
; L c Il = 7 I8 INTERRUPT 
; L Cc Il = 6 19 INTERRUPT 
? L Cc Il = 5 I10 INTERRUPT 
; L Cc Il = 4 Ill INTERRUPT 
; L Cc Tl = 3 I12 INTERRUPT 
; L c Il = 2 I13 INTERRUPT 
; L Cc Il = ae 114 INTERRUPT 
; L Cc Tl = 0 115 INTERRUPT (LOWEST PRIORITY INPUT) 


CLK I0 Il 12 13 14 15 16 17 I8 I9 GND 
f/OC 110 I11 112 Q3 Q2 Q1 QO 113 114 115 vcc 


EQUATIONS 


/Q0 /10* Il 

/10*/I1*/12* 13 

/TO*/T1*/12*/13*/T4* I5 

/TO®/T1*/T2*/13*/T4*/IS*/I6% 17 

/TOX/T1*/T2*/13%/T4*/I5*/16*/I7*/1B* 19 
/TO*/T1L*/12*/13*/T4R/TS*/16*/17*/18*/I9*/T10% 11 
/TO®/T1*/12*/13*/T4*/I5%/16%/17*/1B*/I9*/TIO*/I1LI*/I12* 113 
/TO*/TL*/T2*/13*/14*/15*/16*/17*/18*/19*/T1O*/TIL*/T12*/T13*/1144115 


++ete eee sei 


Figure 6. A 16-Input Registered Priority Encoder Design File 
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/10*/11* 12 

/10*/11*/12* 13 

/IO*/11*/12*/13*/14*/I5* 16 

/TO*/I1*/12*/13*/14*/I5*/16* 17 
/IO*®/T1*/12*/13*/14*/I5*/I6*/IT*/IB*/I9* T1O 
/TOR/T1*/L2*/13%/14*/I5*/16*/17*/18*/19*/I10* I11 
/TO®/TI*/I2*/13*/L4*/IS*/I6*/IT*/IB*/I9*/TLO*/ILI*/I12*/I13* 114 
/TOR/T1L*/T2%/T3*/T4R/TS*/IG*/IT*R/IB*/I9*X/TLO®/III*/I12*/113*/T14*115 


++ ee eesti 


/Q2 := /IO*/I1*/12*/13* 14 

/1O*/T1*/12*/13*/14* I5 

/10*/11*/12*/13*/14*/15* 16 

/TO®/T1I*/12*%/13*/14*/15*/16* 17 
/1O®/T1*/12*/13*/14*/15*/16*/17*/1B*/I9*/T1O*/I1I* I12 
/IO®/T1I*/T2%/13*%/14*/T5*/16*/17*/1B*/I9*/TIO*/ILVI*/IL2* 113 
/TO®/T1*/T 28/1 3*/L4*/I5*/T6*/IT*/1B*/I9*/TLO*/I1LI*/I12*/113% 114 
/IO*/T1*/T2*/13%/TA®/TS*/I6*/IT*/IB*/I9*X/TLON/TII*/IL2*/T13*/IT14*I15 


++eeteeti 


/Q3 


+eeetesei 


/IO*/TIL*/12*/13*/14*/IS*/I6*/IT* IB 
/TO*/TI*/12*/13*/T4*/I5*/16*/IT*/IB* 19 
/1O*®/T1*/12*/13*/14*/15*/16*/17*/IB*/I9* 110 
/IO*®/11*/12*/13%/T4*/15*/16*/17*/18*/I9*/T10* 1d 
/TO*®/T1*/T2*/13*%/T4*/15*/16*/I7*/IB*/I9*/TIO*/III* 112 

/TO*/T1*®/T 2/1 3*/14*/15*/I6*/I7*/1B*/I9*/TIO*/I1I*/I12* 113 
/TO®/T1*/T2*/13*/14*/I5*/16*/17*/IB*/I9*/TLO*/I1I*/I12*/113* 114 
/TO®/T1*/12*/13*/L4*/I5*/IG6*/IT*/IB*/I9*/TLO*/ILI*/I12*/I13*/114*115 


; SIMULATION NOT INCLUDED HERE 


Figure 6. A 16-Input Registered Priority Encoder Design File (Cont’d.) 
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INPUT SELECT LINES 


A B Cc D 
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Ce ee ne en ee ee Se op ne ee ee ee 
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OUTPUT LINES 
Q0 Q1 Q2 Q3 Q4 QS Q6 Q7 QB Qd QiO Ali Q1i2 Q1i3 Q1i4 Qis 









Ce ee eo 
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ast Om ms es si ss sa ks sk 
—~sOnF om somo er or or wr or os os 
—_— © — = os —* os |) —4 24 ot ok A os es ek 
opt oae esteem sn es mu rr or on 











Figure 7. The Truth Table of an Active-LOW 4-to-16 Decoder 


Decoders 


A decoder performs the reverse function of an encoder. It 
converts an n-bit code to one of its 2” unique items. It is a 
combinatorial circuit designed such that at most one of its several 
outputs will be asserted based upon the unique input codes. 


Adecoder may have as many outputs as there are possible binary 
input selection combinations. As shown in the truth table 
(Figure 7), only one output may be asserted at any time. When 
anew combination is applied, another output is asserted and the 
original output is returned to its non-asserted state. 


The Boolean logic equations can be directly derived from the truth 
table shown in Figure 7. The procedure is the same as explained 
in the previous section on PLD design methodology on page 2-27. 
The Boolean equations derived are shown in Figure 8. 


/Q0 = /D* /C * /B* /A 
/Ql = /D* /C* /B* A 
/Q2 =/D* /C* B* /A 
/Q3 =/D* /C* Be A 
/Q4 = /D* C* /B* /A 
/98 =/D* C* /B* A 
/206 = /D* C* B* /A 
/Q7 = /D* C* B* A 
/QB: = Di*./C.* (B® /K 
/Q9 = D* /C* /B* A 
/Q10 = D* /C* B* /A 
/Q1= D* /C* B* A 
/Ql2 = D* C* /B* /A 
/Q13 = D* C* /B* A 
/Q14= D* C* B* /A 
/15= D* C* B* A 


Figure 8. Decoder Boolean Logic Equations 


Notice from the truth table that there is no combination of inputs 
that wiil send all the outputs to their non-asserted state. Many 
designs actually need to be able to make all outputs inactive. This 
canbe done simply by putting enable lines in all of the output AND 
gates. Many such design modifications can be easily added once 
the basic Boolean equations have been derived, instead of 
redoing the truth table. Figure 9 shows the PALASM 2 software 
design file of one such 4-to-16 decoder withthe enable function. 


Probably the most commonly used decoders are the address 
decoders required by most microprocessors and bus interfaces. 
These also constitute the most common application of PLDs in 
digital designs. The design considerations for address decoders 
have been covered earlier in the PLD Design Methodology 
section, page 2-23. Later we will develop a general Boolean 
equation for an address decoder circuit when we discuss range 
decoders. 
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Title 4tol6 Decoder 
Pattern 4-16DEC.PDS 
Revision A 


Author Mehrnaz Hada 
Company Monolithic Memories, Santa Clara, CA 
Date 1/9/85 


CHIP Decoder PAL6L16 


Q0 Q1 Q2A BC D EN1 EN2 Q3 Q4 GND 
Q5 Q6 QO7 Q8 Q9 Q10 Qll Q12 Q13 Q14 Q15 vcc 


7The 4 to 16 decoder decodes four binary encoded inputs 
zinto one of 16 mutually exclusive outputs, whenever the 
;two enable lines EN] and EN2 are high. When one or both 
;0f the enable lines are low the outputs are all set to 


shigh values. 


EQUATIONS 

/Q0 = /D*/C*/B*/A* EN1* EN2 ;Decode 0000 
/Ql = /D*/C*/B* A* EN1* EN2 ;Decode 0001 
/Q2 = /D*/C* B*/A* EN1* EN2 :Decode 0010 
/Q3 = /D*/C* B* A* EN1* EN2 ;Decode 0011 
/Q4 = /D* C*/B*/A* EN1* EN2 ;Decode 0100 
/Q5 = /D* C*/B* A* EN1* EN2 ;Decode 0101 
/Q6 = /D* C* B*/A* EN]* EN2 :Decode 0110 
/Q7 = /D* C* B* A* EN]* EN2 ;Decode 0111 
/Q8 = D*/C*/B*/A* EN1* EN2 ;Decode 1000 
/Q9 = D*/C*/B* A* ENL* EN2 ;Decode 1001 
/Q10 = D*/C* B*/A* ENL* EN2 ;Decode 1010 
7Qll1 = D*/C* B* A* EN1* EN2 ;Decode 1011 
/Q12 = D* C*/B*/A* EN1* EN2 ;Decode 1100 
/Q13 = D* C*/B* A* EN1* EN2 ;Decode 1101 
/Q14 = D* Ce B*/A* EN1* EN2 ;Decode 1110 
/Q15 = D* C* B* A* EN1* EN2 ;Decode 1111 
SIMULATION 

TRACE ON DBC A QO Ql Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q10 


Qll Q12 Q13 Q14 Q15 


SETF /D /C /B /A EN1 EN2 
SETF A 
SETF B 

SETF C 

SETF D 

SETF /D 

SETF /C 

SETF /B 

SETF /A 

SETF /EN1 


SETF ENl /EN2 
SETP /EN1 


TRACE_OFF 


;Set outputs to high 


;Set outputs to high 
7Set outputs to high 


Figure 9. Design File of Four-to-SIxteen Decoder with Enable 


Encoder/Decoder Device Selection Considerations 


The general device selection considerations are listed in 
Figure 10. Based upon the number of inputs and outputs 
required, a device can be selected from the table in Figure 11 for 
combinatorial devices or Figure 12 for registered devices. 


Number of Input Pins 
Number of Output Pins 
Number of 1/O Pins 

Device Speed 

Device Power Requirements 
Number of Registers 
Number of Product Terms 
Output Polarity Control 


Figure 10. General Encoder/Decoder Device Selection 


Considerations 
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DEVICE TOTAL DEDICATED DEDICATED LOGICAL SPEED GRADES 
NUMBER INPUT OUTPUT PRODUCT t,, INns 
OF PINS PINS TERMS/ 
OUTPUT 
TTL 
10H8 20 
10L8 20 
12H6 20 
20 
20 
20 
20 
20 
20 
16L8 20 
16P8 20 
16RA8 20 
18P8 20 
6L16 24 
8L14 24 
12110 24 
14L8 24 
16L6 24 
18L4 24 
2012 24 
2001 24 
20L8 24 7 15, 25, 35 
20L10 24 3 15, 20, 25, 30 
20RA10 24 4 
20510 24 8-16 
22P10 24 8 
22XP10 24 8 20, 30, 40 
22RX8 24 8 25 
22V10 24 8-16 15, 25, 35 
32VX10 24 8-16 25, 30 
CMOS 
C16L8 20 10 6 7 
C20L8 24 14 6 7 
C22V10 24 12 10 8-16 
C29M16 24 5 16 8-16 
C29MA16 24 5 16 4-12 





ECL 


Figure 11. Combinatorial Programmable Logic Devices 
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DEVICE TOTAL DEDICATED DEDICATED 
NUMBER INPUT REGISTERED 
OF PINS PINS OUTPUT 
PINS 


LOGICAL SPEED GRADES 

PRODUCT FREQUENCYIN 
TERMS/ MHz 
OUTPUT 


MAX 


55.5, 37, 25, 16 
55.5, 37, 25, 16 
55.5, 37, 25, 16 
22.2 
22.2 
22.2 
20 


feo] 
= hwOOWODMA OO 
Ls) 


37, 25, 16 

37, 25, 16 

37, 25, 16 
22.2 
22.2 
22.2 


20RP10 
20RP8 
20RP6 
20RP4 
20XRP10 
20XRP8 
20XRP6 
20XRP4 


25, 37 
25, 37 
25, 37 
25, 37 
30, 22.2, 14 
30, 22.2, 14 
30, 22.2, 14 
30, 22.2, 14 


PADHALOONS tee a pene 
@ 
oo pre fo men 


~amwwmnonwowowa AA AROOO 


20RS10 
20RS8 
20RS4 


ss 
@2o 
eo 
—_ 
ao 


20RA10 33, 20 
22RX8 28.5 
22V10 40, 28.5, 18 

32VX10 25, 22.2 
32R16 


10H/020EV/G8 
10H20G8 
(Latched) 





Figure 12. Registered Programmable Logic Devices 
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Encoders typically require a large number of inputs and fewer 
outputs, whereas decoders typically require a large number of 
outputs and fewer inputs. We can see from these tables that the 
PAL device for the priority encoder application (Figure 6) should 
be the PAL20R4 and for the decoder application (Figure 7) it 
should be the PAL6L16. 


Another important device selection consideration for encoders 
and decoders is the number of product terms required for a 
design. A careful selection of code values (and priority assign- 
ments in priority encoders) can often reduce the required number 
of product terms. This can sometimes determine whether or not 
a design fits a device successfully. Figure 13 shows the truth 
tables of two simple partial 3-to-2 encoders. The product terms 
required for the two designs are different due to the different 


assignment of encoded bits. 
INPUTS OUTPUTS INPUTS OUTPUTS 








Xl = /A* /B* C Xl = /A* B* /C 
+ /A* /B* C 
XO = /A * B* /C xO = A * /B * /C 


+ /A* /B* C 


Figure 13. Two Encoders With Different Product Term 
Requirements 


Another way of looking at a decoder is as a logic function which, 
depending upon the select code applied, connects one data input 
to the selected outputs. Also known as a Demultiplexer, a 
decoder essentially connects an input to one of 2° outputs based 
upon n select code bits. The reverse logic function, which 
combines data from multiple sources to an output signal, is called 
a multiplexer and is discussed next. 





Multiplexers 


A multiplexer (sometimes referred to as a data selector) is a 
special combinatorial circuit, widely used in digital design. It is 
designed to gate one of several inputs to a single output. The 
input selected for connection to the output is controlled by a 
separate set of select inputs. 


The traditional use of a multiplexer is for “time division multiplex- 
ing” in data communication, when gating several data lines toa 
single data transmission line for short intervals of time. The data 
received is then demultiplexed by using a demultiplexer, de- 
scribed earlier. 


The design methodology employed for multiplexer design is the 
truth-table approach. As an example, we can look at a three- 
input-to-one-output (3:1) multiplexer, which uses two select sig- 
nals Aand B. Based on these two select bits, the data on one of 
the three inputs is sent to the output. The truth table is shown in 
Figure 14. 


OUTPUT 


SELECT INPUTS 


B A HCO 8 8=—«1i C1 1102 O1Y 














Figure 14. Truth Table for a Three-to-One Multiplexer 


Deriving the Boolean equation from this truth table is a straight- 
forward task. In this case no further minimization is possible. The 
Boolean equation is: 

/O1Y = ee * /R * /I1C0 

+ /B A * /T1C1 

+ B* /A * /T1C2 

The complete PALASM 2 design file for this design example 
showing four such three-to-one multiplexers is shown in 
Figure 15. 








ra | Monolithic HR Memories ra | 


2-43 





Combinatorial Logic Design 








TITLE QUAD 3:1 MULTIPLEXER 
PATTERN P7074 

REVISION ol 

AUTHOR COLI/VOLPIGNO 

COMPANY MONOLITHIC MEMORIES, INC. 
DATE 8/20/87 

; DESCRIPTION 


:THIS IS AN EXAMPLE OF A QUAD 3-TO-1 MULTIPLEXER USING A PAL14L4. SELECT 
;LINES A,B ARE ENCODED IN BINARY, WITH A REPRESENTING THE LSB. THE OUTPUTS 
;(Y) ARE ALL HIGH IF THE SELECT LINES ARE BOTH HIGH (B,A=H). 

OPERATIONS TABLE: 


INPUT OUTPUTS 


SELECT 

BOA Y 
LoL co 
L oH cl 
H L C2 
H #H H 


wa Se se we Se Ne Me se me Ne Ne 


CHIP QUAD MUX PAL14L4 


I1CO I1C1 11C2 12C0 12Cl I2C2 13C0 I3C1 13C2 GND 
14CO I4Cl I14C2 O4¥ O3¥Y O2Y O1Y B A vcc 


EQUATIONS 

/O1Y = /B*/A * /11C0 ;SELECT INPUT 1C0 
+ /B* A * /T1Cl ;SELECT INPUT 1Cl 
+ B*/A * /11C2 ;SELECT INPUT 1C2 

/O2Y = /B*/A * /12C0 ;SELECT INPUT 2C0 
+ /Be A * /T2Cl ;SELECT INPUT 2Cl 
+ B*/A * /12C2 ;SELECT INPUT 2C2 

/O3Y = /B*/A * /13C0 ;SELECT INPUT 3C0 
+ /B* A * /T3C1l ;SELECT INPUT 3Cl 
+ BR/A * /13C2 ;SELECT INPUT 3C2 

/O4¥ = /B*/A * /14C0 ;SELECT INPUT 4C0 
+ /B* A * /T4Cl ;SELECT INPUT 4Cl 
+ Be/A * /T4C2 ;SELECT INPUT 4C2 


; SIMULATION NOT INCLUDED HERE 


Figure 15. Quad 3:1 Multiplexer 
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The equations derived in the above example can be easily 
generalized for other multiplexers. The symbol for a general 
2°-inputs-to-one-output multiplexer is shown in Figure 16 where 
n select lines are used. 


INPUTS 


MULTIPLEXER 


Y 
OUTPUT 





So S1 Sn-1 
INPUT SELECT LINES 


405 16 


Figure 16. General Model of a 2°-to-1 Multiplexer 


The Boolean equations are: 
n=2 


Y = /S1*/SO* (IO) 
+ /S1* SO*(I1) 
+ S1*/SO0* (12) 
+ $S1* SO*(I3) 


/S2*/S1*/SO* (IO) 
/82*/S1* SO* (I1) 
/S2* S1*/SO* (12) 
/S2* S1* SO* (I3) 
S2*/S1*/SO* (14) 
S2*/S1* SO* (I5) 
S2* S1*/SO* (16) 
S2* S1* SO* (I7) 


+++ teeter 


Multiplexer Device Selection Considerations 


Multiplexers typically require more inputs than outputs, so the 
devices with a large number of inputs and I/Os are usually more 
useful. Careful consideration must also be given to the number 
of product terms available on each output. 





Several multiplexers are often used simultaneously to route 
multiple address and data bits, under the control of the same 
select lines. In such cases, multiple devices can be cascaded 
when the number of inputs and outputs exceeds device limits. 
Cascading is also possible for large multiplexers that do not fit in 
a single device. In such cases, the select bits should also be 
judiciously selected for each PLD, to minimize the number of 
product terms. 


Another common trick for designing a multiplexer is to connect a 
number of outputs together and control the output enables using 
the select bits to multiplex data. This multiplexing technique is 
used extensively in DRAM controllers for multiplexing address 
signals. Timing considerations for such designs include the 
output enable and disable times, which should be carefully 
selected to avoid output contentions. 


Comparators 
A comparator is a combinatorial circuit designed primarily to 


compare the relative magnitude of two binary numbers. Figure 17 
shows the truth table for a two-bit comparator. 


INPUTS OUTPUTS 





EQL LES GTR 
A=B A<B A>B 


0 
0 
0 
0 
0 
0 
0 
0 
1 
1 
1 
1 
1 
1 
1 
1 


ee oa Mo — a a a ole Molo) 
-=-32300-=00-400-~00 
“~O-0O0-0-0-0-0-0=0 
“O02 00H0C0000-c0000-4 
(ooo Ron Roo Rol a Solel a a ae! 
o--=-00--000-c0000 





Figure 17. Truth Table for a Comparator 


A basic comparator compares two numbers only for equality, 
and generates the EQL signal (indicating A=B). An extension, 
called a magnitude comparator, also generates the LES signal 
(indicating A<B) and GTR signal (indicating A>B). Based on this 
truth table, the equations for the three output signals EQL, LES 
and GTR can be easily derived. These equations can then be 
optimized by using Boolean algebra, Karnaugh maps or the 
minimization routine available with PALASM 2 software. 
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The final Boolean equations are: 


EQL = /A2 * /Al * /B2 * /B1 
+ /A2 * Al * /B2 * Bl 
+ A2* /Al * B2 * /B1 
+ A2* Al* B2* Bl 

LES = /A2 * /Al * /B2 * Bl 
+ /A2 * /Al * B2 * /B1 
+ /A2 * /Al * B2* BI 
+ /A2 * Al * B2 * /BL 
+ /A2 * Al * B2* Bl 
+ A2* /Al * B2* Bl 
= /Al * B2* BL 
+ /A2 * /Al * Bil 
+ /A2 * B2 

GTR = /A2 * Al * /B2 * /B1l 
+ A2 * /Al * /B2 * /B1 
+ A2* Al * /B2 * /B1 
+ A2 * /Al * /B2 * Bl 
+ A2* Al * /B2 * Bl 
+ A2* Al * B2 * /Bl 


Al * /B2 * /B1 
A2 * Al * /B1 
A2 * /B2 


++ Il 


Comparator Device Selection Considerations 


The number of product terms needed is directly related to the 
number of bits compared. For LES (less than) and GTR (greater 
than) functions, the number of product terms required depends 
upon the number of bits in the two operands compared, as well as 
their value. The LES and GTR equations can be rewritten as 
follows: 





























DEVICE TOTAL 
NUMBER 


OF PINS 


DEDICATED 
INPUT 
PINS 






PINS 














TTL 

16C1 20 

20C1 24 
20810 24 
22V10 24 
32VX10 





CMOS 


DEDICATED 
OUTPUT 


LES = B2 * /A2 
+ (B2 :+: /A2) * Bl * /AL 


GTR =. A2 * /B2 
+ (A2 :+: /B2) * Al * /B1 


These equations can then be extended for a general comparison 
of n-bit comparands as follows: 


LES = Bn * /An 
+ (Bn :+: /An) * Ba-1 * /An-1 
+ (Bn :+: /An) * (Bn-1 :+: /An-1) * Bn-2 * 
/An-2 
+ ... 
+ we 
+ we 
+ (Bn :+: /An) * (Bn-1 :+: /An-1) .. 
(B2 :+: /A2) * Bl * /AL 
GTR An * /Bn 


(An :+: /Bn) * An-1 * /Bn-1 
(An :+: /Bn) * (An-1 :+: /Bn-1) * An-2 * 
/Ba-2 


+ + Il 


t+ +eet 


(an t+: /Bn) * (An-1 :+: /Bn-1) ... 
(A2 :+: /B2) * Al * /B1 


The total number of product terms required for an n-bit compari- 
sonis 2"-1, Comparators require a large number of product terms 
and devices that offer many product terms can be used very ef- 
fectively. Figure 18 shows alist of devices that offer many product 
terms. It also includes registered devices, since registers are 
sometimes used to hold the comparator output signals asserted 
for long durations. 





SPEED GRADES 
t,, INns 


LOGICAL 
PRODUCT 
TERMS/ 
OUTPUT 














Figure 18. PLDs With Large Numbers of Product Terms 
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As is obvious from these equations, comparators require exclu- 


PAL16X4, which offers exclusive-OR functions. An example of a 
sive-OR functions, and can also be efficiently implemented inthe 


four-bit comparator implemented on a PAL16X4 is shown in 
Figure 19. 


PAL16X4 

P7031 

BETWEEN LIMITS COMPARATOR/REGISTER 
MMI SUNNYVALE, CALIFORNIA 


PAL DESIGN SPECIFICATION 
BIRKNER/COLI 07/12/81 


CLK LOAD CLEAR BO Bl B2 B3 


; NC /OC2 GND 
/OC1 /NE 


7EQ A3 A2 Al AO /LT /GT VCC 
; DESCRIPTION 


;THE DEVICE CONTINUOUSLY COMPARES THE BUS VALUE (B3-B0) WITH THE VALUE OF 
;THE REGISTER (A3-A0) AND REPORTS THE STATUS ON OUTPUTS LT, EQ, NE, AND GT: 





; * LT INDICATES THAT B IS_- LESS THAN A 
H * EQ INDICATES THAT B IS EQUAL TOA 
; * NE INDICATES THAT B IS NOT EQUAL TO A 
; * GT INDICATES THAT B IS GREATER THAN A 
H STATUS BUS REG 
;/OCl1 /0C2 CLK LOAD CLEAR LT EQ NE GT B3-BO A3-A0 OPERATION 
; H x x x x KOM. Xk x Z REG HI-Z 
; x H x x x 2.2 Oe x x STATUS HI-Z 
: X X L ame ane CMD Gas X A READ REG 
3 X x c H L xX xX X X B B LOAD REG 
3X x cC L L Xx xX xX X X A HOLD 
7 X x c X H Xx xX X X X L CLEAR REG 
Hae L Xx L L STATUS B A COMPARE 
jo---------------------------- --- - +--+ +--+ $$$ 5 5-5-5 === 
IF(OC2) LT = (A3*/B3) ;B3=L, A3=H 
+ (A3:*:B3) * (A2*/B2) ;B2=L, A2=H 
+ (A3:*:B3) * (A2:*:B2) * (Al*/B1) ;B1l=L, Al=H 
+ (A3:*:B3) * (A2:*:B2) * (Al:*:Bl) * (AQ*/BO) ;BO=L, AO=H 
IF(OC2) GT = (/A3*B3) ;B3=H, A3=L 
+ (A3:*:B3) * (/A2*B2) ;B2=H, A2=L 
+ (A3:*:B3) * (A2:*:B2) * (/A1*B1) ;Bl=H, Al=L 
+ (A3:*:B3) * (A2:*:B2) * (Al:*:Bl) * (/A0*BO) ;BO=H, AQ=L 
IF(OC2) EQ = (A3:*:B3) * (A2:*:B2) * (Al:*:Bl) * (AQ:*:BO) ;COMPARE EQUAL 
IP(OC2) NE = (A3:+:B3) + (A2:+:B2) + (Al:+:Bl) + (A0:+:BO) ;NOT EQUAL 
/A3 += (/A3)*/LOAD ;HOLD REG A3 
+ (/B3)* LOAD ;LOAD REG A3 
+ CLEAR ;CLEAR REG A3 
/A2 3= (/A2)*/LOAD ;HOLD REG A2 
+ (/B2)* LOAD ;LOAD REG A2 
+ CLEAR ;CLEAR REG A2 
/Al := (/Al1)*/LOAD ;HOLD REG Al 
+ (/Bl)* LOAD ;LOAD REG Al 
+ CLEAR ;CLEAR REG Al 
/A0 := (/A0)*/LOAD ;HOLD REG AO 
+ (/B0O)* LOAD 7;LOAD REG AO 
+ CLEAR ;CLEAR REG AO 


Figure 19. PAL16X4 Based Four-Bit Comparator 
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FUNCTION TABLE 


CLK /OCl1 /OC2 LOAD CLEAR B3 B2 Bl BO A3 A2 Al AO LT EQ NE GT 


; CONTROL BUS REG 

; /OC OPERATIONS BBBB AAAA 

7;CLK 1 2 LOAD CLEAR 3210 3210 
Cc LX xX H XXXX LLLL 
c LX H L LLLL LLLL 
X LL L L LLLL LLLL 
xX LL L L LLLH LLLL 
X LX L L XXXX LLLL 
Cc LX xX H XXXX LLLL 
C LX H L LHLH LHLH 
X LL L L LLLL LHLH 
X LL L L LHLH LHLH 
xX LL L L HHHH LHLH 
xX LX L L XXXX LHLH 
c Lx xX H XXXX LLLL 
C LX H L HLHL HLHL 
X LL L L LHLL HLHL 
X LL L L HLHL HLHL 
xX LL L L HLHH HLHL 
X LX L L XXXX HLHL 
Cc LX xX H XXXX LLLL 
c LX H L HHHH HHHH 
xX LL L L HHHL HHHH 
xX LL L L HHHH HHHH 
X LX L L XXXX HHHH 
c LX L L XXXX HHHH 
X HX xX x XXXX 2222 
X XH X xX XXXX XXXX 


--STATUS--- COMMENTS 

LT EQ NE GT (HEX VALUES) 

x xX xX X CLEAR REG 

x x x X LOAD REG (0) 

L HL OL COMPARE (0 EQ 0) 
L LH H COMPARE (1 GT 0) 
xX X xX X READ REG (0) 

x xX xX X CLEAR REG 

Xx X X X LOAD REG (5) 

ae Cane ies COMPARE (0 LT 5) 
L HL L COMPARE (5 EQ 5) 
L L HH COMPARE (F GT 5) 
Xx X X X READ REG (5) 

x xX xX X CLEAR REG 

xX X X X LOAD REG (A) 

H L HL COMPARE (4 LT A) 
Lb HL Lb COMPARE (A EQ A) 
L L H BH COMPARE (B GT A) 
xX X xX X READ REG (A) 

x X xX X CLEAR REG 

X X X X LOAD REG (F) 

H LH OL COMPARE (E LT F) 
L HL L COMPARE (F EQ F) 
xX X xX X READ REG (F) 

xX X X X HOLD (F) 

x xX xX X TEST HI-Z (/OC1=H) 
222 2 TEST HI-Z (/O0C2=H) 


Figure 19. PAL16X4 Based Four-Bit Comparator (Cont’d.) 


The values of the comparands themselves affect the number of 
product terms used. When the comparison is made with compar- 
ands which are power-of-two numbers, the number of product 
terms required can be reduced drastically. This essentially relies 
on the fact that when the lower bits of a comparand are all zeros 
only the highest bit needs to be compared, requiring only one 
product term. For example in a two-bit comparator, if A1 is zero 
and A2 is one, the equation for the greater-than function becomes 
very simple and requires only one product term: 


GTR = /B2 


The general equation for the GTR signal can also be simplified 
when comparing anumber B to a fixed power-of-two comparand 
Awith p least significant zeros. 


A = 000010000 ... 00 
n p 1 


GIR = /Bn * /Bn-1.. * /Bptl * /Bp 


This general GTR equation can also be considered as an equa- 
tion for comparing a number to a range of numbers extending 
from zero to numberA. Infact this trick is used very often by many 
system designers for address decoder functions. In PLD Design 
Methodology (page 2-22) the ROMCS1 signal is one such signal 
which is generated for the address range from (000000) hex to 
(OFFFFF) hex. For this design n=23, the comparand A=(OFFFFF 
+ 1)=100000 and p=21. Substituting in the general equation we 
get the same address decoder Boolean logic equation. 


ROMCS1 = /A23 * /A22 * /A21 


As such designs require few product terms and no XOR gates, 
they are efficiently implemented on standard combinatorial PLDs. 
Alist of such standard combinatorial PLDs is shown in Figure 20. 
Ageneral form of range comparators with two boundary compar- 
ands will be discussed later. 
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DEVICE © DEDICATED 
INPUT 


PINS 


20L8 
20L10 
20810 
22P10 
22XP10 
22RX8 
22V10 
32VX10 


DEDICATED 
OUTPUT 





LOGICAL SPEED GRADES 

PRODUCT t,, INns 
TERMS/ 

OUTPUT 


10, 15, 25, 35 


15, 25, 35 
15, 20, 25, 30 


7 
3 


@ 
+ 

— 

o 


20, 30, 40 


‘© © © 


oo 
_ —- 
oo 





Figure 20. Standard Combinatorial PLDs with Sum-of-Products Logic 


The third output signal is the EQL signal. The EQL Boolean 
equation tells us whether the two numbers are identical. Such 
information is useful not only in address decoders, but also in 
digital signal processing designs. This equation requires a large 
number of product terms. Acloser examination reveals that it is 
essentially an exclusive-OR function. 


EQL = /A2 * /B2 * (/Al * /B1 + Al * Bl) 
+A2* B2 * (/Al * /B1l + Al * Bl) 


EQL = (Al :*: Bl) * (A2 :*: B2) ; Exclusive-NOR 
3; function 


lnverting this: 


/EQL = (Al :+: Bl) + (A2 :+: B2) ; Exclusive-OR 
3 function 


This equation can be extended to give a general equation for 
»qual-to comparison for two n-bit comparands. 

'EQL = (An :+: Bn) 
(An-1 :+: Bn-1) 
(An-2 :+: Bn-2) 
(An-3 :+: Bn-3) 


t+eetett 


(al t+: Bl) 


This inverted equation is implementable in the sum-of-products 
form of the exclusive-OR functions, and can be easily expanded 
to the following: 


/EQL = Al * /B1 + /Al * Bl 
+ A2 * /B2 + /A2 * B2 
+ A3 * /B3 + /A3 * B3 
+ .. 
Fun 
+ An * /Bn + /An * Bn 


This gives us a general sum-of-products form of a comparator 
equation which is easily implemented in PAL devices. An n-bit 
comparator requires 2n product terms. An eight-bit comparator 
is implemented in the design shown in Figure 21. This design 
uses the PAL16C1, which is specifically designed for comparator 
applications, and provides a large number of product terms 
connected to EQL and /EQL (NE) outputs. All of the devices that 
have a large number of product terms are illustrated in Figure 18, 
and would be appropriate for such designs. 
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Title Octal Comparator 

Pattern OctComp.pds 

Revision A 

Author Mehrnaz Hada 

Company Monolithic Memories Inc., Santa Clara,CA 
Date 1/29/85 


;The octal comparator establishes when two 8-bit data 
;strings (A7-A0) and (B7-BO) are equivalent (EQ=H) or not 
sequivalent (NE=H). 

CHIP OctalCom PAL16Cl 


A7 AO BO Al Bl A2 B2 A3 B3 GND 
A4 B4 AS B5 EQ NE A6 B6 B7 VCC 


EQUATIONS 
NE = AO*/BO + /AO* BO sA0 :+: BO 
+ Al*/Bl + /Al* Bl :Al :+: Bl 
+ A2*/B2 + /A2* B2 3A2 s+: B2 
+ A3*/B3 + /A3* B3 7A3 :+: B3 
+ A4*/B4 + /A4* B4 ;A4 :+: B4 
+ AS*/B5 + /AS* BS 7A5 3+: BS 
+ A6*/B6 + /A6* B6 ;A6 :+: B6 
+ A7*/B7 + /A7* B7 ;A7 2+: BT 
SIMULATION 
TRACE_ON A7 A6 AS A4 A3 A2 Al AO NE 
B7 B6 BS B4 B3 B2 Bl BO 
SETF A7 /A6 /AS /A4 /A3 /A2 /Al /A0 ;A7=H, B7=L 
/B7 /B6 /BS /B4 /B3 /B2 /Bl /BO 
SETF /A7 A6 ;A6=H, B6=L 
SETF /A6 AS ;AS=H, BS=L 
SETF /A5 A4 ;A4=H, B4=L 
SETF /A4 A3 ;A3=H, B3=L 
SETF /A3 A2 ;A2=H, B2=L 
SETF /A2 Al ;Al=H, B1=L 
SETF /Al AO ;A0=H, BO=L 
SETF /A7 /A6 /AS /A4 /A3 /A2 /Al /A0 ;A7=L, B7=H 
B7 
SETF /B7 B6 ;A6=L, B6=H 
SETF /B6 BS 7A5=L, BS5=L 
SETF /B5 B4 ;A4=L, B4=H 
SETF /B4 B3 +A3=L, B3=H 
SETF /B3 B2 3;A2=L, B2=H 
SETF /B2 Bl ;Al=L, Bl=H 
SETF /Bl BO ;A0=L, BO=H 
SETF /BO ;Test all L's 
SETF A7? A6 AS A4 A3 A2 Al AO ;Test all H's 
B7 B6 BS B4 B3 B2 Bl BO 
SETF /A7 A6 /AS A4 /A3 A2 /Al AO ;Test even ones 
/B7 B6 /BS B4 /B3 B2 /Bl BO 
SETF A7 /A6 A5 /A4 A3 /A2 Al /A0 ;Test odd ones 


B7 /B6 B5 /B4 B3 /B2 Bl /BO 


TRACE_OFF 


Figure 21. Octal Comparator Design Example 
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Note that the EQL equation, as well as GTR and LES equations, 
rely upon the XOR function. Often the logic represented by the 
equations is implemented in two or more devices. There are 
several PAL devices that provide dedicated XOR capability, and 
can efficiently implement the logic. These devices, listed in Figure 
22, can be used for comparator applications. The PAL32VX10 
and the PAL22RX8 offer an architecture with a multiple number 
of product terms on one side of the XOR gate, which allows a 
number to be compared against multiple comparands. The de- 

















DEVICE TOTAL 
NUMBER 


OF PINS 


DEDICATED 
INPUT 
PINS 


DEDICATED 






OUTPUT 
PINS 








22XP10 





20XRP10 
20XRP8 
20XRP6 
20XRP4 


REGISTERED 





sign example in Figure 19 shows the XOR implementation of the 
LES, GTR and EQL functions. 


Let us analyze these equations further. The LES and GTR 
outputs indicate whether one number is greater than or less than 
another. In fact, these equations can also be judiciously com- 
bined to get a comparison of a range of numbers such as 
A>X>B. Such range comparisons are very useful for address 
decoder circuits. 






LOGICAL 
PRODUCT 
TERMS/ 
OUTPUT 


SPEED GRADES 
FREQUENCY IN 
MHz 















toy = 
20, 30, 40 








30, 22.2, 14 
30, 22.2, 14 
30, 22.2, 14 
30, 22.2, 14 


Figure 22. PAL Devices with XOR Capability for Comparators 
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Range Decoders 


Range decoders implemented as address decoders are one of 
the most commonly used applications of PLDs in digital systems. 
Agood example is the address decoder illustrated earlieron page 
2-22, in the PLD Design Methodology section. Range decoders 
compare a number (address) to a given range of comparands 
(addresses). One way to arrive at the range decoder Boolean 
equations is to use the traditional truth table approach. Another 
way is to use the Boolean equations generated earlier in the 
comparator section for greater-than and less-than functions. To 
decode a range of three-bit numbers from B to A, we must 
compare another number X such that A>X>B. The Boolean 
equations for the GTR (A>X) and LES (B<X) functions are 
illustrated below: 


GIR = A3 * /X3 
+ (A3 :+: /X3) * A2 * /X2 
+ (A3 :+: /X3) * (A2 :+: /X2) * Al * /X1 


LES = X3 * /B3 
+ (X3 :+: /B3) * X2 * /B2 
+ (X3 :+: /B3) * (X2 :+: /B2) * X1 * /B1 


Combining these two equations can give us a range signal which 
willbe asserted only when Ais greater than X and X is greater than 
B. The combined Boolean equation follows: 


RANG = ( A3 * /X3 
+ (A3 :+: /X3) * A2 * /X2 
+ (A3 :+: /X3) * (A2 :+: /X2) * Al * /X1 ) 
* 
( X3 * /B3 
+ (X3 i+: /B3) * X2 * /B2 
+ (X3 :+: /B3) * (X2 :+: /B2) * X1 * /B1 ) 


Using Boolean algebra we get the following equation: 


RANG = 
(A3 :+: /X3) * (X3 :+: /B3) 
+ (A3 :+: /X3) * (X3 :+: /B3) 
+ (A3 s+: /X3) * (A2 :+: 
+ (X3 :+: /B3) * (X2 :+: /B2) 
+ (A3 :+: /X3) * A2 * /X2 * X3 * /B3 
+ (X3 :+: /B3) * A3 * /X3 * X2 * /B2 


* (A2 s+: /X2) * Al * /X1 * X2 * /B2 

* (X2 :+: /B2) * A2 * /X2 * X1 * /B1 
/X2) * Al * /X1 * X3 * /B3 

* AB * /X3 * X1 * /Bl 


The general equation for n-bit comparands can also be obtained by extending these equations. 


RANG = ( An * /Xn 
+ (An :+: /Xn) * An-1 * /Xn-1 
+ (An :+: /Xn) * (An-1 :+: /Xn-1) * An-2 * /Xn-2 
+. 
+o 
+ (An :+: /Xn) * (An-1 :+: /Xn-1) .. (A2 :+: /X2) * Al * /X1) 
* 
(Xn * /Bn 
+ (Xn :+: /Bn) * Xn-1 * /Bn-1 
+ (Xn :+: /Bn) * (Xn-1 :+: /Bn-1) * Xn-2 * /Bn-2 
Fae 
2 ee 
+... 
+ (Xn :+: /Bn) * (Xn-1 :+: /Bn-1) .. (X2 :+: /B2) * X1 * /B1) 
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The number of product terms required is clearly very large and 
can easily exceed one hundred for an eight-bit range comparator. 
Most microprocessors have addresses which exceed 16 bits. In 
order to fit the design on a PAL device, one commonly used 
technique is to select the address range defined by A and B such 
that the range extends from address B+1 toA-1, whereA and B+1 
ara power-of-two numbers. Because the address space is 
aligned on the power-of-two boundaries, a number of bits of the 
address comparands willbe zero. Whenimplementedin Boolean 
equations, this substantially reduces the number of product terms 
required. 


The maximum number of product terms required for a three-bit 
range decoder shown above, with any comparand values, is 28. 
If the address is chosen from 2 to 3, resulting in A=4 and B=1, then 
only one product term will be required. 


RANG = /X3 * X2 


Similarly for a range from one to five, B = 1 and A=6 (a multiple- 
of-two), and the number of product terms required is only two. 


RANG = /X3 * X2 
+ X3 * /X2 


Thus a careful selection of range boundaries allows such logic 
functions to be implemented easily in PLDs. Such reduction in 
logic obviously also holds true for discrete implementations. Most 
address decoders are designed with address ranges with 
boundaries that are power-of-two numbers, and require few prod- 
uct terms for implementation. 


For a power-of-two address range, the comparand Awould be a 
power-of-two; 2, 4 or 8. These are numbers whose least signifi- 
cant bits are all zeros. Similarly, comparand B will be a power-of- 
two number (minus one); 1,3, and 7. These are numbers whose 
least significant bits are allones. Substituting these in the general 
equation for range comparators we arrive at: 


A = 0000100000000000 
n p 1 


B = 0000000000011111 
n q 1 


RANG = /Xn * /Xn-1 *..% /Xp *(Xp-1 + Xp-2 + .. +Xq) 


This is a general equation for a power-of-two range comparison. 
In the example on page 2-22, the ROMCS2 signal addresses the 
range from 100000 to 1FFFFF, in which case B=OFFFFF and 
A=2FFFFF. Here n=23, p=22 and q=21. The address decode 
equation for the ROMCS2 signal can be arrived at by substituting: 


ROMCS2 = /A23 * /A22 * A21 

This is the same equation as was found from the truth table. 
3uch designs are very common for address decoder applica- 
ions. These do not require any XOR gates, and can be imple- 


nented in standard combinatorial PLDs with only sum-of-prod- 
Icts logic. A list of such PLDs is shown in Figure 20. 


Adders/Arithmetic Circuits 


Digital systems are designed to carry out a variety of arithmetic 
instructions on binary numerical data. A good example is the 
ALU (Arithmetic Logic Unit) used in all digital computers. The 
basic function of an ALU is that of an adder performing addition 
on two binary numbers. A binary adder takes two inputs, adds 
them, and generates the binary sum. A full adder is a one-bit 
adder with carry-in and carry-out; this is the basic building block 
of any adding circuit. The truth table of such an adder is shown 
in Figure 23. 


INPUTS 





OUTPUTS 








—_= ee Ss A OO OC 
—_~o?-o-0O+-0 
=-=oo 0-0 
=—=a st tO OK" OO O80 





2-3-0000 





Figure 23. Truth Table for a Full Adder 


This truth table is then used to form the Boolean equations in the 
manner described earlier. 


Y= A* /B * /Cin 
+ /A * /B * Cin 
+ A* B* Cin 
+/A* B* /Cin 


Cout = A * Cin 
+A*B 
+B * Cin 


Bigger binary adders can be made by cascading these full 
adders. Each carry-out is directed to carry-in for the next stage. 
Such adders are known as Ripple Adders. 


Combinatorial PAL devices are ideal for this purpose, since they 
also provide internal feedback. Thus one strong consideration in 
such designs is the internal feedback capability of the device, in 
addition to other general device selection considerations. 


These ripple adders have an advantage in that they can be 
cascaded to any length possible. But since the carry-out from the 
least significant bit has to travel all the way to the highest 
significant bit, this can take along time, making such large adders 
inefficient. Adders with built in carry-look-ahead circuitry can 
save time by simultaneously generating the carry-in signal for all 
of the bits. 
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Rewriting the equations for a full adder from above gives: 
YO = AO :+: BO :+: Cin 

where the carry-out signal is: 

cO = AO * BO + (AO + BO) * Cin 


Extending these equations for an n-bit carry-look-ahead adder, 
we can directly get the following equations: 


YO = AO :+: BO :+: Cin 
Yl = Al :+: Bl :+: CO 
where 


CO = AO * BO 
+(AO + BO) * Cin 


Y2 = A2 :+: B2 :+: Cl 
where 


Cl = Al * Bl 
+(Al + Bl)* (AO * BO) 
+(Al + B1)*(AO + BO) * Cin 


Y3 = A3 :+: B3 :+: C2 
where 


C2 = A2 * B2 
+(A2 + B2)*(Al * Bl) 
+(A2 + B2)* (Al + B1)* (AO * BO) 
+(A2 + B2)*(Al + B1)* (AO + BO) * Cin 


In general 
Yn = An :+: Bn :+: Cn-1 


and Cn-1 = An-1 * Bn-1 
+(An-1 + Bn-1) * (An-2* Bn-2) 
+... 
+... 
+(An-1 + Bn-1)* .. * (AO + BO) * Cin 
and finally the carry-out is: 
Cn An * Bn 
(An + Bn) * (An-1 * Bn-1) 
(An + Bn) * (An-1 + Bn-1) * (An-2 * Bn-2) 


++ e+ till 


(An + Bn) *.. * (AO + BO) * Cin 


These equations are essentially a combination of the traditional 
generate and propagate logic for ALU design. 


Adder Device Selection Considerations 


The number of product terms required for implementing a carry- 
look-ahead adder is enormous. The carry-out function alone for 
a four-bit carry-look-ahead adder requires over 36 product terms 
in the sum-of-products form. For asingle-level AND-OR implem- 
entation the number of product terms required for the most 
significant bit Y3 is 28. This will not fit into any of the available 
standard combinatorial devices. A PLD with this many product 
terms per output will not only be slow, but will also allow very low 
silicon utilization. 


Special circuitry, known as bit-pair decoding, has been provided 
onthe PAL16X4 to allow implementation of such functions with up 
to six productterms. Designed primarily for arithmetic operations, 
the PAL16X4 device has an architecture which implements three 
different stages of logic. The first is the bit-pair decoding between 
an input and a feedback. The second is the standard AND-OR 
array logic. The third is the XOR gate provided on selected 
outputs. 


Although adders are strictly a combinatorial function, the 
PAL16X4 provides four registered outputs. Not only are these 
registers used for storing the result, but they are also used 
temporarily to hold one of the two operands being added. The 
other operand is provided directly at the inputs. To keep the 
discussion simple we will not discuss how the first operand is 
loaded into the registers. This load function can be implemented 
by using an extra product term on each output. 


The logic trick lies in the bit-pair decoding function. All of the bits 
of the first operand in the registers (A) and the second operand at 
the inputs (B) are bit-pair decoded. As illustrated in Figure 24, the 
results of this bit-pair decoding are A + B, A+ /B, /A+ B, and 
/A+/B. These outputs are then fed to the AND array as inputs. 


BIT-PAIR 
! DECODING 


A+B 





FEEDBACK 


>l > 
+ + 
wo ol 


woo e---------4 
>! 
+ 
ol 


Ln we ee ee oe ee ee a 405 24 


Figure 24. Bit-Pair-Decoding Function 
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Sixteen AND combinations of these four inputs can then be 
formed on every product term of the AND-OR array. These are 
shown in Figure 25, and include the standard true and comple- 
ments of both the bits as well as XOR, XNOR and various other 
combinations. This bit pair decoding essentially provides an extra 
two-level AND-OR logic level before the AND-OR array. The cost 
as well as extra propagation delay of the extra logic level is 
minimal, since the array size does not increase. 


The equations forthe adder can obviously benefit from multi-level 


levels of logic. The next level of logic is implemented in the 
standard AND-OR array. Every product term of this AND-OR 
array can combine one of the sixteen possible functions of 
different inputs/feedbacks of the device. 


The product terms are then combined together through an OR 
gate to implement the CARRY-OUT function, shown in Figure 26. 
For adder outputs YO, Y1, Y2, and Y3, the product terms are 
combined through the XOR gate, as shown in Figure 27. This 
XOR gate on the outputs is the final logic level on the device. 


logic. The bit-pair decoding is used to implement the first two 
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Figure 25. Sixteen Possible Input Logic Combinations 
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BIT PAIR 
DECODING 


BIT PAIR 
DECODING 


BIT PAIR 
DECODING 


A3* B3 

(A3+B3) * A2* Bo 

(A3 +B) * (Az+Ba2) *Ay* By 

(A3 +Bs3) * (Az + Ba) * (A1+B1) * Ao* Bo 
(A3+ Bsa) * (A2+Bo) * (A1+B1) * (Ao* Ba) * Cin 


CARRY-OUT 








CARRY-OUT= A3* B3 
(Ag+ Bg) * Az *Ba 
(Ag+ Bs) *(Ao+ Bo) *A;* By 405 26 
(A3+ Ba) * (A2+ Ba) * (A1+By) * Ao * Bo 
(A3+ Bg) * (A2+ Ba) * (Ai+B1) * (Ao * Bo) *Cin 


Figure 26. Implementation of CARRY-OUT Function 
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(A2+ Ba) * (A1+B 1) * (Ao* Bo) 
(A2+Bo) * (A1+By4) * (Ao* Bo) * Cin 
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Y3 = Agi+: Bg 

+: ((A2 * Bo) 

+(A2+ Ba) *(A1* Bi) 

+(A2 + Ba) * (Ay + By) * (Ao* Bo) 

405 27 


+(Az+ Bo) * (Ay + By) * (Ao* Bo) * Cin) 


Figure 27. Relationship Between Adder Boolean Equation and PAL16X4 Device Logic 
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Thus the most significant bit of the output can be implemented Figure 28 illustrates an example of one such adder, which also 
using only five product terms; the carry-out function can be includes the logic required for loading the registers with one ofthe 
implemented using only six product terms. All of this is possible operands. 


because the PAL16X4 offers a modified programmable array with 
hardware dedicated to bit-pair decoding at the inputs and an XOR 
gate at the outputs. 


PAL16X4 PAL DESIGN SPECIFICATION 
PNEW J. ENGINEER 9/15/87 
ADDER 


MMI SUNNYVALE, CALIFORNIA 


CLK LOAD ADD BO Bl B2 B3 NC NC GND 
CIN NC NC A3 A2 Al AO G P_ vcc 


; DESCRIPTION 


;THE DEVICE FUNCTIONS AS AN ADDER. IT FIRST STORES ONE OPERAND IN THE FOUR- 
7BIT OUTPUT REGISTER UNDER THE CONTROL OF LOAD SIGNAL. USING FEEDBACK AND A 
7;SECOND OPERAND AT THE INPUTS, IT ADDS THE TWO UNDER CONTROL OF SIGNAL ADD. 
7;THE RESULTS ARE STORED BACK IN THE OUTPUT REGISTERS. THE DEVICE ALSO 
7;GENERATES THE PROPAGATE AND GENERATE SIGNALS WHICH ALLOW A NUMBER OF 
;SIMILAR DEVICES TO BE CASCADED. 





IF(VCC) P= (A3 + B3) ; PROPAGATE SIGNAL 
* (A2 + B2) 
* (Al + Bl) 
* (AO + BO) 
IF(VCC) G = (A3*B3) ; GENERATE SIGNAL 
+ (A3 + B3) * (A2*B2) 
+ (A3 + B3) * (A2 + B2) * (Al*B1) 
+ (A3 + B3) * (A2 + B2) * (Al + Bl) * (AO*BO) 


CARRYO.EQU ADD * CIN 


CARRY1.EQU ADD * (AO*BO) 
+ ADD * (AO + BO) * CIN 


CARRY2.EQU ADD * (Al*B2) 
+ ADD * (Al + Bl) 
+ ADD * (Al + Bl) 


* 


(A0*BO) 
(AO + BO) * CIN 


* 


CARRY3.EQU ADD * (A2*B2) 
+ ADD * (A2 + B2) * (A1*Bl) ‘ 
+ ADD * (A2 + B2) * (Al + Bl) * (AO*BO 
+ ADD * (A2 + B2) * (Al + Bl) * (AO + BO) * CIN 
/A3 := (/A3)*/LOAD * /ADD ;HOLD REG A3 
+ (/B3)* LOAD ;LOAD’ REG B3 
+ (A3:*:B3)*AD ;ADD A3 AND B3 
s+: CARRY3 
/A2 := (/A2)*/LOAD * /ADD ;HOLD REG A2 
+ (/B2)* LOAD ;LOAD REG B2 
+ (A2:*:B2)*ADD ;ADD A2 AND B2 
s+: CARRY2 
7Al := (/Al)*/LOAD * /ADD ;HOLD REG Al 
+ (/B1l)* LOAD ;LOAD REG Bl 
+ (Al:*:B1)*ADD ;ADD Al AND Bl 
:+: CARRY1 
i /AO := (/A0)*/LOAD * /ADD ;HOLD REG AO 
+ (/BO)* LOAD ;LOAD REG BO 
+ (AO:*:BO)*ADD ;ADD AO AND BO 
:+: CARRYO 


FUNCTION TABLE 


7 NOT INCLUDED HERE 


Figure 28. A Four-Bit Adder Implemented in a PAL16X4 
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Latches 


PAL devices are often used to implement latches. One of the 
most common uses for latches is as temporary storage for data 
or addresses. PLD-based latches are often used in address 
decoders to assert the decoded signal for long durations. These 
latches are also very useful for asynchronous digital designs, and 
are used often for contro! and arbitration functions. 


A latch is essentially a simple combinatorial circuit in which the 
output is a function of inputs and feedback. The most commonly 
used latch is the D-type latch. When the control signal latch- 
enable (LEN) is HIGH, the latch is in the “transparent mode” and 
the input signal /D is available at the outputs. When the LEN 
signal is LOW, the input data is latched on the outputs and is 
retained until LEN goes back HIGH. In atypical address decoder, 
the input will be a combination of various address signals, 
decoded as explained earlier for range comparators. The latch- 
ing signal in most microprocessors is called AS (address strobe) 
or ALE (address latch enable). 


The truth table for a latch can be derived directly from this 
functional description, and is shown in Figure 29. 


INPUTS OUTPUTS 


0 1 0 | 
1 1 1 
X 0 /Q (previous) 


Figure 29. Truth Table of a Simple Latch 










The Boolean equations for this latch can be directly derived from 
the truth table: 


/Q = /D * LEN 
+ /Q * /LEN 


The logic implementation for this latch is shown in Figure 30. 


LEN 


o! 


405 30 


Figure 30. A Transparent Latch 


Hazards 


Even when a combinatorial circuit has been designed correctly, 
it may still have erroneous outputs due to “hazards”. Hazards 
exist because physical circuits do not behave ideally. Combina- 
torial complementary output functions based on the same inputs 


are prime candidates for such hazards. As the input changes, the 
two outputs will not respond simultaneously and change state at 
the same instant. Although this will not change the steady-state 
output of the circuit, it may cause a spurious pulse or a “glitch”. 
Such hazards are even more dangerous in latches, where the 
glitch can cause incorrect data to be latched. 


There are two types of hazards, static and dynamic. Static 
hazards occur when the steady-state output of combinatorial 
logic is not supposed to change due to an input transition, but a 
momentary change does occur. Such a glitch can be further 
classified as a static 1 or a static O hazard as shown in Figure 31. 


= LU 


STATIC 0 HAZARD STATIC 1 HAZARD 


405 31 
Figure 31. Static Hazards 


Dynamic hazards involve situations where the the steady-state 
output is supposed to change due to an input transition. The 
hazard occurs when the transient output changes several times 
before settling. Figure 32 shows dynamic hazards. 


405 32 
Figure 32. Dynamic Hazards 


A Karnaugh map is a very good way of detecting hazard condi- 
tions. When trying to detect a static 0 or static 1 hazard, only the 
mapping of the zeros and the ones, respectively, are required. 
For example, the latch equations in Figure 30 can be mapped to 
a Karnaugh map shown in Figure 33. The relationship between 
the Karnaugh mapping and the Boolean equation product terms 
is also illustrated. 


DLEN 





00 01 11 10 
COVERING 
HAZARD 


EN =) 408 33 


Figure 33. Karnaugh Map for Transparent Latch Design 
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The possibility of a hazard exists when the signal LEN changes. 
Initially, when D and LEN are HIGH, the output Q is also HIGH. 
When LEN switches to LOW, it is possible for the output to go 
LOW momentarily. This is because when LEN goes LOW the first 
product term is disabled, and to maintain the output HIGH the 
second product term should be enabled exactly at the same 
instant. Due to the uneven gate delays or routing conditions on 
the board, these two events will not take place simultaneously. 
This is a static 1 hazard. It can also be identified directly in the 
Karnaugh map by the two adjacent but disjoint sets of ones, 
grouped together to form a product term each. 


The hazard conditions can be easily avoided in the PLDs by 
providing an extracover productterm. This product term is shown 
with a dotted line in the Karnaugh map. This third product term 
will Keep the output asserted during the transition of the LEN 
signal, when the control changes from the first product term to the 
second. The modified Boolean equation is shown above. 





TITLE CLEAN OCTAL LATCH 
PATTERN P7096 

REVISION A 

AUTHOR VINCENT COLI 

COMPANY MONOLITHIC MEMORIES, INC. 
DATE 03/10/83 


CHIP OCT_LAT PAL201L10 


/Q = /D * LEN 
+ /Q * /LEN 
+ /D * /Q (Cover product term) 

Devices on which latches are implemented need to provide 

output feedback. Alldevices with I/O pins provide this necessary 

feedback. The only other consideration for selecting a device 

would be the provision of sufficient number of product terms for 

addressing the needs of glitch-free and testable design. 


Acomplete latch design implemented in a PAL20L10 is shown in 
Figure 34. Although the design is now glitch-free, the cover 
product term introduces further complications in the testability of 
this circuit. This design does not have the additional circuitry to 
address the testability requirements. Details on designing for 
testability are discussed on page 3-108. 





G NC DO D1 D2 D3 D4 D5 D6 D7 NC GND 
/OC NC Q7 Q6 Q5 Q4 Q3 Q2 Q1 QO NC VCC 


;DESCRIPTION 


;THIS PAL16L8 IMPLEMENTS AN 8-BIT LATCH FUNCTION WITH THREE-STATE OUTPUTS. 
;THE LATCH PASSES EIGHT BITS OF DATA (D7-D0) TO THE EIGHT OUTPUTS (Q7-Q0) 
7;WHEN LATCH ENABLE IS TRUE (G=HIGH). THE DATA IS LATCHED WHEN LATCH ENABLE 
;IS FALSE (G=LOW). THE OUTPUTS WILL BE DISABLED (HI-Z) WHEN OUTPUT ENABLE 
;IS TRUE (/OC=TRUE) REGARDLESS OF ANY OTHER INPUTS. 


;/OC G D7-DO Q7-Q0 COMMENTS 

7 4H xX > 4 Z HI-Z 

7 L L x Q LATCH OUTPUT 
7 L H D D LOAD LATCH 


;THIS DESIGN SHOWS HOW TO IMPLEMENT A "CLEAN" LATCH SINCE THERE ARE NO 
;OUTPUT GLITCHES AS THE DEVICE CHANGES STATE. 


Figure 34. Clean Octal Latch Design 
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EQUATIONS 


/Q0 


++i 


G*/DO 
/G*/Q0 
/D0*/Q0 


QO.TRST = OC 


/Ql 


++ 


G*/D1 
/G*/Q1 
/D1*/Q1 


Ql1.TRST = OC 


/Q2 


+ 
+ 


G*/D2 
/G*/Q2 
/D2*/Q2 


Q2.TRST = OC 


/Q3 


+ +0 


G*/D3 
/G*/Q3 


/D3*/Q3 


Q3.TRST = OC 


/Q4 


+ + Il 


G*/D4 
/G*/Q4 
/D4*/Q4 


Q4.TRST = OC 


/Q5 


++ iil 


G*/D5 
/G*/Q5 
/D5*/Q5 


Q5.TRST = OC 


/Q6 


+ 
+ 


G*/D6 
/G*/Q6 
/D6*/Q6 


Q6.TRST = OC 


/Q7 


+ +N 


G*/D7 
/G*/Q7 
/D7*/Q7 


Q7.TRST = OC 


;LOAD LATCH (Q0) 


;LATCH OUTPUT 


;COVER ALWAYS HIGH 


;OUTPUT ENABLE 


;LOAD LATCH (Q1) 


;LATCH OUTPUT 


;COVER ALWAYS HIGH 


;OUTPUT ENABLE 


7;LOAD LATCH (Q2) 


7LATCH OUTPUT 


;COVER ALWAYS HIGH 


;OUTPUT ENABLE 


;LOAD LATCH (Q3) 


;LATCH OUTPUT 


;COVER ALWAYS HIGH 


;OUTPUT ENABLE 


;LOAD LATCH (Q4) 


;LATCH OUTPUT 


;COVER ALWAYS HIGH 


;OUTPUT ENABLE 


;LOAD LATCH (Q5) 


;LATCH OUTPUT 


;COVER ALWAYS HIGH 


;OUTPUT ENABLE 


;LOAD LATCH (Q6) 


;LATCH OUTPUT 


;COVER ALWAYS HIGH 


;OUTPUT ENABLE 


;LOAD LATCH (Q7) 


;LATCH OUTPUT 


;COVER ALWAYS HIGH 


;OUTPUT ENABLE 


; SIMULATION NOT INCLUDED HERE 


HAZARD 


HAZARD 


HAZARD 


HAZARD 


HAZARD 


HAZARD 


HAZARD 


HAZARD 


Figure 34. Clean Octal Latch Design (Cont'd.) 
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In the previous section we discussed combinatorial designs, called flip-flops. A bank of these flip-flops forms a register, 
circuits whose outputs are totally independent of any system although individual flip-flops are often called registers. Monolithic 
clock. In this section we will discuss sequential circuits, where Memories and Advanced Micro Devices offer a number of PLDs 
outputs store their previous values until a new clock is applied. which provide registered outputs. The table in Figure 1 lists all of 
The storage elements which retain the previous output values are the PLDs which provide registered outputs. 


DEVICE TOTAL DEDICATED DEDICATED LOGICAL SPEED GRADES 
NUMBER INPUT REGISTERED PRODUCT FREQUENCY IN 

OF PINS PINS OUTPUT TERMS/ MHz 
PINS OUTPUT f 


MAX 





55.5, 37, 25, 16 
55.5, 37, 25, 16 
55.5, 37, 25, 16 


oK@OO WO © © © 


8 
8 
8 
8 
8 
8 
8 
8 
9 


8 
6 
4 
8 
6 
4 
0-8 
4 
4 
8 8 
6 8 
4 8 
10 4 
8 4 
4 4 
10 
8 
6 
4 
10 
8 
6 
4 
10 
8 S 
4 és 


v 
_ 





37, 25, 16 
37, 25, 16 


20RP10 
20RP8 
20RP6 
20RP4 
20XRP10 
20XRP8 
20XRP6 
20XRP4 


30, 22.2, 14 
30, 22.2, 14 
30, 22.2, 14 


OM: OFM 
awmnnaownwnwna on 





19.2 
19.2 
19.2 


20RS10 
20RS8 
20RS4 





33, 20 
28.5 
40, 28.5, 18 
25, 22.2 
16 





PLS167 Total 48 
PLS168 Total 48 


Figure 1. Registered PLDs 
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DEVICE TOTAL DEDICATED 
NUMBER INPUT 


OF PINS PINS 














8 
6 
4 
8 
6 
4 


C22V10 
C29M16 
C29MA16 


10H20EV/EG8 
10020EV/EG8 





DEDICATED 
REGISTERED 
OUTPUT 
PINS 


LOGICAL 
PRODUCT 
TERMS/ 
OUTPUT 


SPEED GRADES 
FREQUENCY IN 











owmwonwowneo 


Figure 1. Registered PLDs (Cont‘d.) 


Before we discuss purely registered designs, let us take a look at 
designs which combine both registered and combinatorial por- 
tions. Registered and combinatorial outputs are often mixed on 
asingledevice. There can be two distinct designs, one registered 
and one combinatorial (often glue logic) combined on a single 
device for higher integration. There may also be a design 
requirement where registered outputs need to be decoded using 
combinatorial logic. 


There are a number of devices which provide both registered and 
combinatorial outputs. Some devices provide programmable 


register bypass, which allows outputs to be programmed as 
registered or combinatorial. Figure 2 shows a list of PLDs with 
both registered and combinatorial outputs. Figure 3 shows alist 
of PLDs with programmable register bypass. 


In most design software packages the output registers are 
signified by the “:=” assignment symbol, as opposed to the “=” 
sign for acombinatorialoutput. This helps easily identify registers 
in each equation. In devices which provide outputs configurable 
as either registered or combinatorial, this sign is also used by the 
software to configure the outputs. 
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DEVICE TOTAL DEDICATED DEDICATED Iie) LOGICAL SPEED GRADES 
NUMBER INPUT REGISTERED PINS PRODUCT FREQUENCY IN 
OF PINS PINS OUTPUT TERMS/ MHz 

PINS OUTPUT Le 
2 8 55.5, 37, 25, 16 
4 8 55.5, 37, 25, 16 
2 8 22.2 
4 8 22.2 
8-0 4 20 
4 8 14 
4 8-12 33.3, 28.5 
2 8 37, 25, 16 
4 8 37, 25, 16 
2 4 
6 4 


20RP8 
20RP6 
20RP4 

20XRP8 
20XRP6 
20XRP4 





20RA10 
22RX8 
22V10 
32VX10 
32R16 
























30, 22.2, 14 
30, 22.2, 14 
30, 22.2, 14 


33, 20 
28.5 
40, 28.5, 18 
25, 22.2 
16 





























20 8 6 2 8 
20 8 4 4 8 
a i 6 2 8 
4 4 8 
C29M16 24 - 16 8-16 20, 15 
C29MA16 24 - 16 4-12 20, 15 
ECL 
Latched 
10H20G8 24 12 0-8 8-0 4-8 N.A. (latch) 
10H/020EG8 24 12 0-8 8-0 8-12 N.A. (latch) 
10H/020EV8 24 12 0-8 8-0 8-12 125 
Figure 2. PLDs with Both Registered and Combinatorial Outputs 
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DEVICE TOTAL DEDICATED DEDICATED vo LOGICAL SPEED GRADES 
NUMBER INPUT REGISTERED PINS PRODUCT FREQUENCY IN 
OF PINS PINS OUTPUT TERMS/ MHz 
PINS OUTPUT fe 
TTL 
16RA8 20 
23S8 33.3, 28.5 
20RA10 33, 20 
22RX8 28.5 
22V10 40, 28.5, 18 
32VX10 25, 22.2 
32R16 16 
CMOS 
C22V10 24 12 0-10 10-0 8-16 33.3, 20 
C29M16 24 5 16 8-16 20, 15 
C29MA16 24 5 16 4-12 20, 15 
ECL 
Latched 
10H20G8 24 12 0-8 N.A. (latch) 
10H/020EG8 24 12 0-8 N.A. (latch) 
10H/020EV8 24 12 0-8 125 











Figure 3. PLDs with Programmable Register Bypass 


General Device Selection 
Considerations 


The same set of general device selection considerations dis- 
cussed in the PLD design methodology section (page 2-21) apply 
to registered designs. The list of items which must be considered 
is repeated in Figure 4 for convenience. A device can be 
conveniently selected from the table of registered PLDs 
(Figure 1) based upon the specific input and output requirements. 


. Number of input pins 

. Number of output pins 

. Number of I/O pins 

. Device speed 

Device power requirements 
. Number of registers 

. Number of product terms 

. Output polarity control 


ONOMAONM= 


Figure 4, General Device Selection Considerations 


Maximum Frequency 


For registered designs, speed is aparameter which needs careful 
consideration. Most combinatorial designs use the propagation 
delay (top) for ensuring that enough time is allowed for the data 
fromthe inputs to appear at the outputs. In registered designs the 
effects of the clock must be taken into account. This is reflected 


in the maximum frequency (f,,,,) parameter, The flexibility inher- 
ent in PLD design provides a choice of configurations from which 
different f,,,, Parameters can be calculated. 


In the first type of design, the PLD is used for a stand-alone 
registered design. In order to decide the next logic level of the 
registers, the present logic level needs to be available at the 
inputs of the registers before they are clocked (Figure 5). Under 
these conditions the clock period is limited by the internal delay 
from the flip-flop outputs through the internal feedback and logic 
to the flip-flops inputs (t, +to-). For this configuration: 


fyayt/(t,, + t,,.) : designated the internal £,.,. 


CLK 








406 01 


FMAx INTERNAL: W(t yt tor) 


Figure 5. Internal f,,,, 
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The second type of system configuration is when a number of 
logic devices with registers, including PLDs, are clocked with a 
commonclock. This is probably the most prevalent configuration. 
In this case the registered outputs are sent off-chip back to the 
device inputs or to the inputs of a second device. The slowest 
path required to set up an input (Figure 6) is the sum of the clock- 
to-output delay and the input setup time for the external signals 
(t,, + to,,)- The same speed grade devices are typically clocked 
together, which means that thet, and t,, , of all the devices will 
usually be the same. 


Thus for this configuration: 


=1/(t, +t 


fax cunt 

This f,,,, is the maximum frequency when external feedback is 
used or when several equivalent-speed devices are used. This 
fuay IS designated the external f,,,,. The tables for registered 
devices show this f,,,, parameter. 


The third type of design is a simple data path application. In this 
case, input data is presented to the flip-flop and clocked; no 
feedback is employed (Figure 7). In this case, the period is limited 
by the sum of data setup time and data hold time (t, +t,). 
However, the minimum clock period (t,,,+t,,, ) is usually a stricter 
limit. Thus the third f designated “f no feedback’ will be 
the lesser of: 


MAX’ MAX’ 


17 tt. oA) 1/ (t,, + t,.) 


Filp-flop Types 


There are four basic types of flip-flops; S-R, J-K, T and the popular 
D-type. Details on the various types of flip-flops are provided in 
the logic reference (page 6-8). In this section we will assume that 
the reader is familiar with these flip-flops. 


Almost all registered PLDs provide the basic D-type flip-flops. 
D-type flip-flops are the simplest to design with and will be used 
throughout this section. Some PLDs provide the capability of 
configuring output registers as either D, T, J-K or S-R. Configur- 
able flip-flops in some cases can reduce the number of product 
terms required for certain designs. The effect of the configurable 
flip-flops will be discussed wherever relevant. 


Synchronous vs Asynchronous 


Registered designs can be easily classified into two categories; 
synchronous and asynchronous. In synchronous designs the 
clock inputs of all the registers are tied together to a common 
clock. This is the most commonly used type of registered design. 
With asynchronous designs, the flip-flops’ clock inputs may not be 
tied together, and the clocks may be gated oreven driven by other 
flip-flops. We will first discuss synchronous registered designs 
and then asynchronous registered designs. 


CLK 





LOGIC is REGISTER 


(SECOND CHIP) 


f wax EXTERNAL: 1(tsu+ tork) 


406 02 


Figure 6. External f,,, 


406 G 





CLK 


V 


REGISTER 


tax NO FEEDBACK: 1/(tsu+ th) OR W(twn + twi) 
Figure 7. f,,, with No Feedback 
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Synchronous Registered Designs 


Synchronous registered designs are used for two major func- 
tions: data handling and control. Registered synchronous de- 
signs for data handling include counters and shift registers. There 
are various types of counters, among them binary counters, 
modulo counters, Johnson counters, and Gray-code counters. 
These counters are differentiated by the sequence of values 
through which the counter travels. A binary counter is the 
simplest form of a counter, and is used most often for data 
functions. Any system requiring a regular count uses a binary 
counter. Modulo, Gray-code and Johnson counters are also used 
for control. 


Allcounters are actually subsets of alargerclass of digital designs 
called state machines. State machines are discussed in detail in 
the next chapter of this handbook. 


CONTROL 
INPUTS NEXT 
STATE 
(COUNT) 
DECODER 


COMBINATORIAL 
LOGIC 


Counters 


Counters are the most commonly used sequential circuits. Aset 
of registers which cycles through a predetermined, unvarying 
sequence is called acounter. Ageneral model of a synchronous 
counter is illustrated in Figure 8. This shows a common clock to 
all the flip-flops, whose outputs are fed back to a combinatorial 
logic array called the next-state (count) decoder. The next count 
is generated by this logic based upon the present count and 
control inputs. Most PLDs use the standard sum-of-products 
form of array for this logic. , 


The relationship between a four-bit counter and its signal timing 
diagram is illustrated in Figure 9. The counters can also be 
represented by state diagrams (Figure 10). The state diagrams 
are bubble-and-arrow diagrams, with each bubble representing a 
count value and each arrow a transition from one count to the 
next. More detail on state diagrams is given in the next chapter 
on state machine design. For counters, the state diagrams are a 
convenient representation tool and will be used in the discussion 
when necessary. 


CLK 





OUTPUTS 


406 04 


Figure 8. General Model of a Counter 


CLK 


Xp 


Xy 


XQ 


STATE 0 1 2 3 4 5 6 7 8 


(COUNT) 














406 05 


Figure 9. Timing Diagram of a Four-bit Binary Counter 
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STATE 0 
0000 
STATE 1 
0001 














STATE 2 
0010 










SIXTEEN STATES NUMBERED FROM 0-15 


STATE 4 
0100 













STATE 10 
1010 





STATE 6 
0110 





STATE 9 
1001 





STATE 8 
1000 








406 06 

Figure 10. State Diagram of a Four-bit Binary Counter 
Binary Counters PRESENT STATE NEXT STATE 
Let us examine a four-bit binary counter. The truth table (also x3 X2 X1~ XO XO 


called the transition table) for such a counter is given in Figure 11. 
The table lists the next state values of all the output registers 
based upon their present values. 














or-t-]/00+f+?t0 0O2+00++_0 
o-o+o0+-c0+-toe+-c0+0=0—- 


0 
0 
0 
0 
0 
0 
0 
0 
1 
1 
1 
1 
1 
1 
1 
1 


“~]@4-4 0000442440000 
—"w?}\S/ 00+? 00 0+00(+00 
—~o-OoO-"OoO-0+0-+0+0+-0 
onAreste Ss oooclmOlcw Vt lc Ht HK HOO FD 





Figure 11. The Truth Table for a Four-bit Binary Counter 
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We derive Boolean equations for each bit directly from the above 
truth table by collecting all the product terms where outputs are 
asserted HIGH (ones). This yields: 


X3:= /X3 * X2* Xl * XO 
+ X3 * /X2 * /X1 * /X0O 
+ X3 * /X2 * /X1 * XO 
+ X3 * /X2 * X1 * /X0 
+ X3 * /X2 * X1* XO 
+ X3 * X2 * /X1 * /X0O 
+ X3 * X2 * /X1 * XO 
+ X3 * X2 * X1 * /X0O 
X2 := /X3 * /X2 * X1 * XO 
+ /X3 * X2 * /X1 * /X0 
+ /X3 * X2 * /X1 * XO 
+ /X3 * X2 * X11 * /XO 
+ X3 * /X2 * X1* XO 
+ X3 * X2 * /X1 * /XO 
+ X3 * X2 * /X1 * XO 
+ X3 * X2 * X1 * /X0 
Xl:= /X3 * /X2 * /X1 * XO 
+ /X3 * /X2 * X1 * /X0 
+ /X3 * X2 * /X1 * XO 
+ /X3 * X2 * Xl * /X0 
+ X3 * /X2 * /X1 * XO 
+ X3 * /X2 * X1* /X0 
+ X3 * X2 * /X1 * XO 
+ X3 * X2 * X1 * /X0O 
XO := /X3 * /X2 * /X1 * /X0 
+ /X3 * /X2 * X1 * /XO 
+ /X3 * X2 * /X1 * /X0O 
+ /X3 * X2 * X1 * /xX0 
+ X3 * /X2 * /X1 * /X0 
+ X3 * /X2 * X1 * /X0 
+ X3 * X2 * /X1 * /XO 
+ X3 * X2 * X11 * /x0 


These Boolean equations are for devices with active-HIGH 
outputs. These equations can be inverted for devices with active- 
LOW outputs. The Boolean equations for active-LOW devices 
can also be directly derived from the truth table by collecting allthe 
product terms where the active-LOW outputs (zeros) are 
asserted. 


Manipulating the equations with Boolean algebra we obtain the 
Boolean logic equations: 


XO := /X0 

X1 := X1 :+: XO 

X2 := X2 2+: (X1 * XQ) 

X3 := X3 s+: (X2 * X1 * XO) 


Similarly for active-LOW output devices 
(since /(A :+: B) = /A :+: B): 


These equations could also be obtained from the Boolean equa- 
tions developed for an adder in the combinatorial design section 
on page 2-54. 


Rewriting the equations for an adder: 
XO = AO :+: BO :+: Cin 


Xl = Al :+: Bl :+: CO 
;where 
cO = AO * BO +(AO + BO) * Cin 


X2 = A2 :+: B2 :+: Cl 
;where 
Cl = Al * Bl +(Al + B1)*(AO * BO) 
+(Al + B1)*(AO + BO) * Cin 


X3 = A3 :+: B3 :+: C2 
;where 
C2 = A2 * B2 +(A2 + B2)*(Al * B1) 
+(A2 + B2)* (Al + B1)* (AO * BO) 
+(A2 + B2)*(Al + B1l)*(AO + BO) * Cin 


Assuming one of the operands in the adder is the number itself 
and the second operand is one (X3-X0 = A3-A0, B3-BO = 0001 
and Cin = 0) we get the following equations for a counter: 


/X0 

X1 s+: X0 

X2 3+: (X1 * XO) 

X3 s+: (X2 * X1 * XO) 


* 
a 
1oboo iu 


These are of course the same equations as the ones derived 
directly from the truth table. The equations for a binary counter 
are very regular. The general equation for an n-bit binary counter 
can be directly expressed: 


Xn := Xn z+: (Xn—1 * Xn-2 ... XO) 


For devices with active-LOW outputs the general Boolean equa- 
tions can be derived by inverting both sides of the equation: 


[Xn :=/Xn t+: (Xn-1 * Xn-2 ... XO) 


These equations represent a binary UP counter. Counting 
backwards for a DOWN counter, the Boolean equations can be 
similarly generated, either from the truth table or from the adder 
Boolean equations. The general equation for a DOWN 
counter is: 


Xn := Xn 3+: (/Xn-1 * /Xn-2 ... XO) 


This equation is for active-HIGH outputs. For active-LOW output 
devices the Boolean equation for a DOWN counter is: 


/Xn = /Xn r+: (Xn-1 * /Xn-2 ... /X0) 


A four-bit binary counter which combines both UP and DOWN 
capability is illustrated in Figure 12. This design is implemented 
in a registered PLD and requires at least six product terms for the 
most significant output bit. The design logic has been imple- 
mented in sum-of-products form. It uses a control input (UP) 
which selects between the “count up” and “count down” product 
terms, allowing the counter to count up or down. 





7X0 := XO 

/X1 := /X1 :+: XO 

/X2 := /X2 :+: (X1 * XO) 

/X3 3= /X3 s+: (X2 * X1 * XO) 
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Further control functions can be added to these counter equa- another input signal (LOAD). When the LOAD signal is HIGH the 
tions directly either at the truth-table stage or in the equations. For counter is loaded with the input data, and when the LOAD signal 
example, a load data function is required in most counters. This is LOW the counting is resumed. 


allows registers to be loaded with a count under the control of 





Title 4Bit_ Counter 

Pattern 4cnt.pds 

Revision B 

Author Mehrnaz Hada, Bill Hollon, Ali Sebt 
Company Monolithic Memories Inc. Santa Clara, CA 
Date 1/14/85 


CHIP 4BitCoun PAL16RP4 


CLK UP AI BI CI DI CLR LOAD NC GND 
7/OC NC NC DC BA NC NC VCC 


;The 4-bit counter counts up or down and has the clear and 
;load capability. The clear operation overrides count and 
;load. The counter counts up when CLR=low, LOAD=low, and 
;UP=high. It counts down whenever CLR=low, LOAD=low, and 


;UP=low 
EQUATIONS 
A 3:= /A*/B*/C*/D*/UP* /LOAD*/CLR ;When CLR=l, A=0. 
+ /A* B* Ck D* UP*/LOAD*/CLR ;Else it will count 
+ A* Bt /UP*/LOAD*/CLR ;UP or DOWN. 
+ A*/B* D* /LOAD*/CLR 
+ Ak  /Ck UP* /LOAD* /CLR 
+ Ax c*/D* /LOAD* /CLR 
+ LOAD*/CLR* AI ;New value is loaded 
;when LOAD=1, CLR=0. 
B r= /B*/C*/D* /UP*/LOAD* /CLR ;When CLR=1, B=0. 
+ /B* C* D* UP*/LOAD*/CLR ;Else it will count. 
+ B* C*/D* /LOAD* /CLR 
+ B*/C* UP* /LOAD* /CLR 
+ B* D*/UP*/LOAD*/CLR 
+ LOAD*/CLR* BI ;New value is loaded 
;when LOAD=1, CLR=0. 
CoG /C*/D* /UP* /LOAD* /CLR ;When CLR=1, C=0. 
+ /C* D* UP*/LOAD*/CLR ;Else it will count. 
+ C*/D* UP*/LOAD*/CLR 
+ c* D*/UP*/LOAD* /CLR 
+ LOAD*/CLR* CI ;New value is loaded 
;when LOAD=1, CLR=0. 
D : /D* /LOAD* /CLR ;Count 


+H 


LOAD*/CLR* DI ;New value is loaded 
swhen LOAD=1, CLR=0. 


Figure 12. Design File for a Four-bit Counter 
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SIMULATION 


TRACE ON AI BI CI DI LOAD CLR UP ABCD 


SETF LOAD /CLR AI BI CI DI OC 


CLOCKF CLK 


SETF CLR 
CLOCKF CLK 


SETF /CLR UP /LOAD 


FOR I:= 1 TO 16 DO 
BEGIN 

CLOCKF CLK 
END 


SETF LOAD /CLR /UP AI BI CI DI 


CLOCKF 
SETP /LOAD 
POR I:= 1 TO 16 DO 
BEGIN 
CLOCKF CLK 
END 


SETF LOAD CLR AI /BI CI /DI 
CLOCKF CLK 


SETF /OC 


TRACE_OFF 


;Load all registers 
;to HIGH and count up 


:Clear all registers 


;Start Counting up 


;Count up 16 clock 
;cycles 


;Load all registers 
;to HIGH and count 
;down 

;Count down 16 clock 
;cycles 


;Test setting LOAD 
;and CLR on at the 
;Same time. 


Figure 12. Design File for a Four-bit Counter (Cont’d.) 


Binary Counter Device Selection Considerations 


One major device selection consideration is the logic require- 
ment. The binary counter Boolean equations make use of 
exclusive-OR functions in the output. In most of the registered 
PLDs shown in Figure 1, the XOR functions are implemented in 
their sum-of-products logic form. This usually requires a large 
number of product terms. We have seen a design example fora 
four-bit UP/DOWN binary counter which requires up to six prod- 
uct terms. Most standard PAL devices provide eight product 
terms per output. However, for larger counters, a large number 
of product terms is required. Figure 13 shows alist of PLDs which 
provide more than eight product terms per output. 


Some PLDs provide a dedicated XOR gate on the outputs (Figure 
14). This allows an AND-OR-XOR implementation of the Boolean 
logic, and consequently requires fewer product terms. Anine-bit 
binary counter design is illustrated in Figure 15. This design has 
been implemented in the PAL20X10, which provides an XOR 
gate. This design cannot be implemented in standard PAL 
devices. 


Cascading Binary Counters 


Situations are occasionally encountered in digitalsystem designs 
where very long counters are required. Few PLDs provide 
greater than ten outputs to implement such counters. Binary 
counters can be easily cascaded into two or more devices to 
construct such large counters. The design of long counters is very 
simple. These are designed as simple binary counters with a 
count enable control. The less significant counters generate an 
extra output signal at the penultimate count. These signals are 
ANDed together to form the ccunt enable signal for the higher- 
order counter. For a down counter the reverse scheme is 
implemented. 


Cascading counters is a lot simpler than cascading adders 
because the carry-look-ahead circuitry is not required. The only 
thing to rememberis that the more significant counter toggles only 
when the penultimate count of all of the less significant counters 
is reached. 


Notice that the nine-bit counter has a built-in mechanism for cas- 
cading. This count enable output signalis represented by CO and 
is easily implemented in the PLD. This is just another example 
of how such modifications are easy to handle in a PLD design, 
unlike standard SSI/MS]! designs. 
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DEVICE TOTAL DEDICATED DEDICATED vO LOGICAL SPEED GRADES 
NUMBER INPUT REGISTERED PINS PRODUCT FREQUENCY IN 
OF PINS PINS OUTPUT 

PINS 


TERMS/ MHz 


OUTPUT a 


20RS10 8-16 19.2 
20RS8 2 8-16 19.2 
20RS4 8-16 19.2 
22V10 24 12 0-10 10-0 8-16 40, 28.5, 18 
32VX10 24 12 0-10 10-0 8-16 25, 22.2 
PLS167 24 14 Total 48 33 
PLS168 24 12 Total 48 33 


-pisios | 05 | Total 48 48 











CMOS 


C29M16 24 16 -16 20, 15 
C29MA16 24 16 -12 20, 15 


10H20EV/EG8 24 12 0-8 8-0 8-12 125 
10020EV/EG8 24 12 0-8 8-0 8-12 125 


Figure 13. Registered PLDs with More Than Eight Product Terms Per Output 



































DEVICE TOTAL DEDICATED | DEDICATED vo LOGICAL SPEED GRADES 
NUMBER INPUT REGISTERED PINS PRODUCT FREQUENCY IN 
OF PINS PINS OUTPUT TERMS/ MHz 
PINS OUTPUT ies 
TTL 
4 8 14 
- 4 22.2 
2 4 22.2 
4 10 6 4 22.2 
20XRP10 24 10 - 8 30, 22.2, 14 
20XRP8 24 10 2 8 30, 22.2, 14 
20XRP6 24 10 4 8 30, 22.2, 14 
20XRP4 24 10 6 8 30, 22.2, 14 
22RX8 24 14 0-8 8-0 8 28.5 
32VX10 24 12 0-10 10-0 8-16 25, 22.2 


Figure 14. Registered PLDs for Binary Counter Design with XOR Gate Output 
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Title 9BitCounter 

Pattern 9BitCnt.pds 

Revision A 

Author Mehrnaz Hada 

Company Monolithic Memories Inc., Santa Clara, CA 
Date 1/28/85 


;The 9-bit synchronous counter has parallel load, increment, 
sand hold capabilities. The carry out pin (/CO) shows how to 
;implement a carry out using a register by anticipating one 
;count before the terminal count if counting and the terminal 
;count if loading. 


;Operations Table 


3 {OC CLK /LD D8-D0 Q8-00 Operation 
: H xX xX xX Z HI-Z 
; L L xX X Q Hold 
; L Cc L D D Load 
: L c H x Q PLUS 1 Increment 


CHIP 9BitCoun PAL20X10 


CLK DO Dl D2 D3 D4 D5 D6 D7 D8 /LD GND 
/OC /CO Q8 Q7 Q6 Q5 Q4 Q3 Q2 Ql QO VCC 


EQUATIONS 

/Q0 := /LD*/Q0 ;Hold QO 
+ LD*/DO jLoad DO (LSB) 
t+: /LD ;Count 

/Q1 := /LD*/Q1 ;Hold Ql 
+ LD*/D1 ;Load Dl 
7+: /LD* QO ;Count 

/Q2 := /LD*/Q2 ;Hold Q2 
+  LD*/D2 ;Load D2 
z+: /LD* QO* Ql ;Count 

/Q3 := /LD*/Q3 sHold Q3 
+ LD*/D3 ;Load D3 
s+: /LD* QO* Q1l* Q2 ;Count 

/Q4 3:= /LD*/Q4 :Hold Q4 
+ LD*/D4 ;Load D4 
st: /LD* QO* Q1* Q24% Q3 ;Count 

/Q5 := /LD*/Q5 Hold Q5 
+ LD*/D5 ;Load D5 
s+: /LD* QO* Q1l* Q2* Q3* Q4 sCount 

/Q6 := /LD*/Q6 ;Hold Q6 
+ LD* /D6 ;Load D6 
st: /LD* QO* Q1* Q2* Q3* Q4* Q5 sCount 

/Q7 := /LD*/Q7 3;Hold Q7 
+ LD*/D7 ;Load D7 
s+: /LD* QO* Q1* Q2* Q3* Q4* Q5* Q6 ;Count 

/Q8 := /LD*/Q8 :Hold Q8 
+ LD*/D8 ;Load D8 (MSB)} 
r+: /LD* QO* Q1* Q2* Q3* Q4* Q5* Q6* Q7 ;Count 

CO := /LD*/QO* Q1* Q2* Q3* Q4* Q5* Q6* Q7* QB ;Carry out 


:(Anticipate Count) 
+  LD* DO* Dl* D2* D3* D4* D5* D6* D7* DB = ;Carry out 
; (Anticipate Load) 
Figure 15. Design File for a Nine-bit Counter 
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TRACE ON /OC /LD D8 D7 D6 DS D4 D3 D2 Dil DO 
/CO Q8 Q7 Q6 Q5 Q4 Q3 Q2 Ql QD 


SETF OC LD /D8 /D7 /D6 /D5 /D4 /D3 /D2 /D1 /DO 


CLOCKF CLK 


SETF /LD 
CLOCKF CLK 


SETF LD /D8 
CLOCKF CLK 


SETF /LD 
CLOCKF CLK 


SETF LD /D8 
CLOCKF CLK 


SETF /LD 
CLOCKF CLK 


SETF LD /D8 
CLOCKF CLK 


SETF /LD 
CLOCKF CLK 


SETF LD /D8 
CLOCKF CLK 


SETF /LD 
CLOCKF CLK 


SETF LD /D8 
CLOCKF CLK 


SEIF /LD 
CLOCKF CLK 


TRACE_OFF 


/D7 


/D7 


/D7 


/D7 


D7 


/D6 /D5 /D4 /D3 /D2 /D1 DO 


/D6 /D5 /D4 /D3 D2 Dl DO 


/D6 /DS /D4 D3 D2 D1 DO 


/D6 /DS5 D4 D3 D2 D1 DO 


D6 D5 D4 D3 D2 D1 DO 


3; Load 


; Increment 


;Load 


;Increment 


;Load 


; Increment 


;Load 


; Increment 


; Load 


; Increment 


;Load 


; Increment 


Figure 15. Design File for a Nine-bit Counter (Cont’d.) 
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Filp-flop Selection 


Until now, all of the designs have been implemented in devices the functionality of the J-K and T-type registers. For the J-K 
with D-type flip-flops. What happens if the counter design is register the output is asserted when the J input goes HIGH, and 
implemented in a device which allows both J-K and T-type the output is unasserted when the K input goes HIGH. Toggle 
registers? The Boolean logic equations for such a design can be type registers require the T input to be asserted for every change 
derived from the truth table. This requires advance knowledge of in the output level. 


PRESENT STATE NEXT STATE 
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Figure 16. Truth Table for D, J-K and T-type Flip-flops 


Figure 16 shows the truth table for both aJ-K and a T-type register Which flip-flop is most efficient depends on the relative number of 


implementation for a binary counter. Deriving and optimizing the transitions or holds required. As a counter traverses from one 
equations from the table, we get the following results: count (state) to another, every output either makes a “transition” 

(changes logic level) or “holds” (stays at the same logic level). 
X3-J := /X3 * X2 * X1 * XO Small counters in general require more transitions and fewer 
X3-K := X3 * X2 * X1 * XO holds. As the designs get larger, the higher-order bits require 


fewer transitions and more holds. 
X2-J := /X2 * X1 * XO 


X2-K := X2 * X1 * XO D-type flip-flops use up product terms only for active transitions 
from logic LOW level to HIGH level, and for logic HIGH level holds 
X1-J := /X1 * XO only. J-K and T-type flip-flops use up product terms for both LOW- 
X1-K := X1 * XO to-HIGH and HIGH-to-LOW transitions, but eliminate hold terms. 
Generally, the requirements of transition and hold terms depends 
X0-J := /X0 upon the count sequence selection. D-type flip-flops are more 
X0-K := X0 efficient for small designs. Conversely J-K and T-type flip-flops 
can be more efficient for large designs, which require more hold 
X3-T := X2 * X1 * XO terms. 
X2-T := X1 * XO 
X1-T := XO Acomparison of product term requirements of 2-, 3-, 4- and 5-bit 
X0-T := 1 binary counters can be representative for other types of counters 
and state machines. The table in Figure 17 shows the transition 
As wecansee from these equations, the number of productterms terms and the hold terms required for these counters. For aJ-K 
used for J-K and T-type implementations is smaller than the type flip-flop implementation, after optimizing, total product terms 
number of product terms required for a D-type implementation required are 4, 6, 8, and 10 respectively. The D-type implemen- 
(Figure 12). tation requires 3, 6, 10, and 15 respectively, and is relatively less 


efficient for large counters. 
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BINARY COUNTER TRANSITIONS HOLDS D PRODUCT J-K PRODUCT 
TERMS TERMS 
6 


2 3 
14 10 6 
30 34 10 
98 15 





T PRODUCT 
TERMS 
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Figure 17. Product Term Requirements for Configurable Flip-flops 


There area number of PLDs which offer the capability of program- 
mable D, J-K and T-type registers. Figure 18 shows a list of all 
PLDs which offer configurable flip-flops. The J-K and T-type flip- 


DEVICE 


TOTAL DEDICATED 
NUMBER INPUT 
OF PINS PINS 





DEDICATED 
REGISTERED 
OUTPUT 
PINS 





flops are constructed from a basic D-type flip-flop using an extra 
XOR gate per flip-flop. 


LOGICAL SPEED GRADES 

PRODUCT FREQUENCY IN 
TERMS/ MHz 
OUTPUT 





Figure 18. Registered PLDs with Configurable Flip-flops 


Modulo Counters 


The number of unique states a counter traverses is generally 
referred to as the modulus. A typical n-bit binary counter has a 
maximum modulus of 2". It is often necessary to introduce signal 
delays into the logic design to meet timing requirements. This 
makes it possible to allow for bus-skew, access time or differential 
propagation delays between devices along two different signal 
paths. Atypical example of this is the introduction of wait states 
to allow for access times of different memory elements. Counters 
and delay lines are commonly used to introduce the delay. 
Counters in PLDs have the added advantage of programmability 
to select the required delay. Such applications where precise 
timing duration control is required usually use modulo counters 
with a non-power-of-two modulus. Other applications of modulo 
counters include waveform generators and arbiters. 






PRESENTSTATE | —=————si«|_—sCNEXT STATE 
Q3 G2 Qi Qo 

















“=“O0000000 
oo-==-=-0000 
oo-=00=-=900 
“o-0-0-0-0 
o--0000000 
oo00o]=-=-4000 
ecCoo-=-00-+-0 
o-0+-/0?0+-0- 





Figure 19. Truth Table for a BCD Counter 





A good example of a modulo counter is a BCD counter. Such a 
counter is usefulin applications where the computer's outputs are 
generated using a decimal system. While a four-bit binary 
counter can count up to sixteen, the BCD counter terminates the 
count at the modulus of 10. 


Modulo counters can be designed in a variety of ways. One direct 
way is to use the truth table to implement acount to a modulus and 
directly derive the equations from it. The truth table for a BCD 
count (from zero to nine) is shown in Figure 19. 


Now let us consider what happens if the device accidentally 
powers up in one of the count values from ten to fifteen. These 
are illegal counts (states) and, for a good design, a mechanism 
must be built into the equations to allow it to recover back into a 
legal state. What we actually need is to consider the truth table 
in Figure 20 in conjunction with the one in Figure 19 for deriving 
the Boolean equations. 


PRESENT STATE NEXT STATE 


Q3 Q2 Q1 QO ee Q3 Q2 Qi ao 





Figure 20. Truth Table for Illegal State Recovery to 
Count Zero 
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A state diagram for the BCD counter is shown in Figure 21. For 
active-LOW outputs, the Boolean equations can be derived 
directly from the truth table and optimized using Karnaugh maps 








The equation can be reduced to the following: 


/Q3 := /Q3 * /Q2 





or the PALASM 2 software minimizer. + /Q3 * /Q1 
+ /Q2 * QO 
The Boolean equation for Q3 is: + 93 * Ql 
+ Q3 * Q2 
/Q3 = /Q3 * /Q2 * /Q1l * /Q0 
+ /Q3 * /Q2 * /Q1l * QO Similar Boolean equations can be generated for Q2, Q1 and QO. 
+ /Q3 * /Q2 * Q1 * /Q0 Figure 22 shows the circuit diagram of a loadable dual BCD 
+ /Q3 * /Q2 * Ql * QO counter implemented in a PAL16R8. Figure 23 shows the design 
+ /Q3 * Q2 * /Ql * /Q0 file for such a counter. 
+ /Q3 * Q2 * /Ql * QO 
+ /Q3 * Q2* Ql * /Q0 
+ Q3 * /Q2 * /Q1 * QO 
+ Q3 * /Q2 * Ql * /Q0 
+ Q3 * /Q2 * Ql * Q0d 
+ Q3 * Q2 * /Q1l * /Q0 
+ Q3 * Q2 * /Ql * QO 
+ Q3 * Q2 * Q1 * /Q0 
+ Q3 * Q2 * Q1 * QO 
406 07 
CEB 
(OB 
a = 
: INPUT Fd 
Da BUFFERS 
a Peale 
CEA 406 08 
LDA 
CLOCK 
Figure 22. Circuit of a Duai BCD Counter 
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TITLE DUAL BCD COUNTER. 
PATTERN 00 

REVISION ol. 

AUTHOR CHRIS JAY. 

COMPANY MMI SANTA CLARA, CA. 


DATE 17 JULY 1987 
CHIP BCD PAL16R8 


;THE PAL16R8 HAS BEEN PROGRAMMED AS A DUAL BINARY 
;CODED DECIMAL COUNTER. REGISTERS MAY BE LOADED FROM 
;DATA INPUTS DO ~ D3, THE LDA ENABLES DATA ON 

7DO ~ D3 TO BE SYNCHRONOUSLY LOADED INTO Q4 - Q7. 
;LDB ENABLES DATA ON DO - D3 TO BE LOADED INTO 
+REGISTERS QO - Q3. TWO COUNT ENABLE INPUTS CEA 
;AND CEB ENABLE A BINARY CODED DECIMAL COUNT IN 
;BOTH COUNTERS. THE COUNTERS RESET IF POWERED UP 
;IN A HEXADECIMAL STATE, SO IF USED IN A STAND 
;ALONE MODE THE COUNTERS WILL NOT LOCK UP INTO 
7AN INVALID STATE. 


, 


;PIN 1 2 3 4 5 
CLK /LDA CEA DO Dl 
;PIN 6 7 8 9 10 
D2 D3 CEB /LDB GND 
;PIN 11 12 13 14 15 
/OE Qo Ql Q2 Q3 
;PIN 16 17 18 19 20 
Q4 Q5 Q6 Q7 vcc 
STRING RESA *Q5*Q7 + Q6*Q7' ;RESET FROM 
STRING RESB 'Q1*Q3 + Q2*Q3' ; STATES 
EQUATIONS ; 
/Q0 := Q0*/LDB*CEB ;COUNT EQUATION 
+ RESB ;RESET FROM ILLEGAL 
+ /DO*LDB*/CEB ;LOAD DO 
+ /Q0*/CEB*/LDB ;HOLD QO 
/Ql := Q3*/LDB*CEB ;COUNT EQUATION 
+ Q0*Q1*/LDB*CEB ; 
+ /Q0*/ Q1*/LDB*CEB : 
+ RESB ;RESET FROM ILLEGAL 
+ /D1*LDB*/CEB ;LOAD Dil 
+ /Q1*/CEB*/LDB ;HOLD Q1 
/Q2 := Q3*/LDB*CEB ;COUNT EQUATION 
+ /Q0*/Q2* /LDB*CEB ; 
+ /Q1*/Q2*/LDB*CEB ; 
+ Q0* Q1*Q2*/LDB*CEB ; 
+ RESB ;RESET FROM ILLEGAL 
+ /D2*LDB*/CEB ;LOAD D2 
+ /Q2* /CEB* /LDB ;HOLD Q2 
/Q3 15 /Q3*/Q0*/LDB*CEB ;COUNT EQUATION 
+ Q0*/Q2*/LDB*CEB : 
+ /Q1*/Q3*/LDB*CEB ; 
+ RESB ;RESET FROM ILLEGAL 
+ /D3*LDB*/CEB ;LOAD D3 
+ /Q3*/CEB*/LDB ;HOLD Q3 
/Q4 := Q4*/LDA*CEA ;COUNT EQUATION 
+ RESA ;RESET FROM ILLEGAL 
+ /DO*LDA*/CEA ;LOAD DO 
+ /Q4*/CEA*/LDA ;HOLD Q4 


’ 


Figure 23. Design File for a Dual BCD Counter 
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/Q5 Q7*/LDA*CEA 
Q4*Q5*/LDA*CEA 
/Q4*/Q5*/LDA*CEA 
RESA 

/D1*LDA* /CEA 
/Q5*/CEA*/LDA 


++ eesti 


/Q6 Q7*/LDA*CEA 
/0Q4*/Q6*/LDA*CEA 
/Q5*/Q6* /LDA*CEA 
Q4*Q5*Q6*/LDA*CEA 
RESA 

/D2*LDA* /CEA 
/Q6*/CEA* /LDA 


++e+eeesti 


/Q7 /Q7*/Q4* /LDA*CEA 
Q4*/Q6*/LDA*CEA 
/Q5*/Q7*/LDA*CEA 
RESA 
/D3*LDA*/CEA 
/Q7*/CEA* /LDA 


+ ee Se itt 


SIMULATION 

TRACE_ON CLK Q4 Q5 Q6 Q7 
Q0 Ql Q2 Q3 LDA 
LDB DO Dl D2 D3 
CEA CEB OE 

:PRLDF QO Ql Q2 Q3 Q4 Q5 Q6 Q7 

SETP /CLK OE 

CLOCKF CLK 

SETF /CLK OE DO Dl D2 /D3 

SETF LDA LDB /CEA /CEB 

CLOCKF CLK 

SETF /LDB /LDA 

CLOCKF CLK 

SETF CEA /LDA 


FOR I := 1 T0 10 DO 
BEGIN CLOCKF CLK 
END 

SETP /CEA CEB /LDB 
FOR I := 1 T0 10 DO 
BEGIN CLOCKF CLK 
END 


SETF /CEB LDB /DO Dl /D2 D3 
CLOCKF CLK 

SETF /LDB 

CLOCKF CLK 

SETF LDB DO Dl /D2 D3 
CLOCKF CLK 

SETF /LDB 

CLOCKF CLK 

SETF LDB /DO /Dl D2 D3 
CLOCKF CLK 

SETF /LDB 

CLOCKF CLK 

SETF LDB DO /D1 D2 D3 
CLOCKF CLK 

SETF /LDB 

CLOCKF CLK 

SETF LDB /DO Dl D2 D3 
CLOCKF CLK 

SETF /LDB 

CLOCKF CLK 

SETF LDB DO Dl D2 D3 
CLOCKF CLK 

SETF /LDB 

CLOCKF CLK 

TRACE OFF 


;COUNT EQUATION 
;RESET FROM ILLEGAL STATE 


;LOAD D1 
;HOLD Q5 


;COUNT EQUATION 
;RESET FROM ILLEGAL STATE 


;LOAD D2 
;HOLD Q6 


;COUNT EQUATION 


. 
ry 


;RESET FROM ILLEGAL STATE 
;LOAD D3 
;HOLD Q7 


’ 


;TRACE ALL SIGNALS 


: 
’ 


se se Ne se te 


;SET DATA INPUTS SELECT 
;REGISTERS QO - Q3 PERFORM 
;SYNCHRONOUS LOAD. 

;SELECT REGISTERS Q4 ~ Q7 
;PERFORM SYNCHRONOUS LOAD. 
;ENABLE Q4 - Q7 AS A BCD 


;COUNTER, CLOCK FOR 10 
;CLOCK CYCLES 


; 

;ENABLE QO - Q3 AS A BCD 
;COUNTER, CLOCK FOR 10 
;CYCLES 


; 
;LOAD ILLEGAL STATES 
;TO TEST COUNTER RESET 


: 


, 
' 
, 
, 
e 
, 
' 
’ 
: 
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Figure 23. Design File for a Dual BCD Counter (Cont’d.) 
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Modulo Counter Device Selection Considerations 


We have illustrated a counter which counts from zero to a fixed 
modulus. The same technique can be applied for acounter which 
counts down from a maximum power-of-two number to a fixed 
modulus, or even a counter which counts from one modulus to 
another. The important considerations will be the number of 
product terms used. 


The registered PLDs used for modulo counters are similar to the 
ones selected for other counters. Since the counts used are 
binary, devices with J-K or T-type flip-flops or XOR gates will help 
optimize the number of product terms used. The product term 
usage also depends upon the modulus selected. Generally a 
power-of-two or a multiple-of-two modulus will require fewer 
product terms. 


Another factor for flip-flop selection is the illegal states. D-type 
flip-flops are generally better suited for illegal state recovery than 
the J-K or T-type flip-flops. This is because when no product term 
is asserted, the D-type flip-flops rasetto zero. Designers using J- 
K or T-type flip-flops must design-in illegal state recovery. 


Certain devices allow the use of a synchronous RESET product 
term for modulo counters. The idea is to use a minimal number 
of product terms to build a binary counter which counts up to a 
power-of-two number. However, this counter is RESET to zero 
using the synchronous RESET product term when the desired 
modulusis reached. It then begins counting afresh from zero, and 
the procedure is repeated. Similar operation can also be 
achieved with a synchronous PRESET product term for a down 
counter. Figure 24 shows a table of all the PLDs that provide the 
RESET and PRESET product terms. 






































DEVICE TOTAL DEDICATED DEDICATED vO LOGICAL SPEED GRADES 
NUMBER INPUT REGISTERED PINS PRODUCT FREQUENCY IN 
OF PINS. PINS OUTPUT TERMS/ MHz 
PINS OUTPUT fs 
a A EC I SS 
TTL 
16RA8 20 
2388 33.3, 28.5 
20RA10 33, 20 
22RX8 28.5 
22V10 40, 28.5, 18 
32VX10 25, 22.2 
eae | 
CMOS 
C29M16 24 20, 15 
C29MA16 24 20, 15 
ECL 
10H20EV/EG8 24 12 0-8 8-0 8-12 125 
10020EV/EG8 24 12. 0-8 8-0 8-12 125 
Figure 24. PLDs Which Provide RESET or PRESET Product Terms 
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Using synchronous RESET and PRESET product terms allows 
the counter to recover from illegal states. Notice that the logic 
product terms in the counter are designed for a complete binary 
count. If the counter powers up in any illegal state (as shown in 
Figure 25), it will continue the count until the terminal count and 
return to zero, where the correct modulo count will begin. This 
illegal state recovery will take an unpredictable number of clock 
cycles, and you may wish to design a more systematic recovery 
system. 


Cascading Modulo Counters 


For large modulo counters, the technique of generating Boolean 
equations from the truth tables is very tedious and time consum- 
ing. Another approach for designing modulo counters is to divide 
itinto two smaller modulo counters. In addition to simplifying the 
design, this approach usually helps optimize the number of 
product terms, 





SYNCHRONOUS 
RESET PRODUCT 
TERM ASSERTED 


As an example, a modulo-360 counter can be directly imple- 
mented with nine register bits. However, instead of implementing 
this as a straight 9-bit counter, we can implement this as two 
counters: one four-bit counter (counting from zero to 14) and 
another five-bit counter (counting from zero to 23). Together the 
two counters count up to 360. Shown in Figure 26 the terminal 
count output MOUT is asserted when the count reaches 360. 


The design requires nine inputs, nine outputs, one clock pin, one 
LOAD pin, one RESET and one MOUT (module output signal) 
pin. Note that no extra flip-flops or pins were needed. Obviously, 
the count values of this counter are not the same as a straight 
modulo-360 counter. Actually, this is what contributes to the 
optimization of the number of product terms used. The PLPL 
design file of this counter is shown in Figure 27. 


ILLEGAL STATES 


Figure 25. ABCD Counter Using Synchronous RESET Product Term 


Q24 (4) (3) (2) (1) (0) 


A (8:4) —+ 
pe = MODULO 24 
* § BITS 





406 09 
Q15 (3) (2) (1) (0) 
M_OUT 
MODULO 15 
4 BITS 
406 1 


Figure 26. A Modulo-360 Counter In PAL22V10 
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DEVICE MODULO_360 COUNTER (PAL22V10) 


PIN CLK = 1 RST = 2 

LOAD = 3 

A[8:0] = 4:11,13 

Q24[4:0] = 18:14 

Q15({3:0] = 22:19 

M_OUT = 23; “ASSERTED WHEN COUNT REACHES 360" 
BEGIN 

IF (RST) THEN ARESET() ; "COUNTER INITIALIZATION" 


IF (LOAD) THEN BEGIN 
Q24[4:0] := A[8:4]) ; “INITIALIZATION COUNT" 
Q15{3:0] := A[3:0] ; “INITIALIZATION COUNT" 
END; 
ELSE BEGIN 
IF (Q24[4]*/Q24[3]* Q24[2]* Q24[1]* Q24[0)* 
Q15(3)* Q15{2)* Q15[1}*/Q15[0})) THEN M OUT = 1; 
CASE (Q24{4:0]) BEGIN "MODULO 24" 
#B00000) BEGIN 
Q15{3:0}] := Q15[3:0} ; 
Q24(4:0] := #B00001 ; 
END ; 


#B00001) BEGIN 





Q15[3:0]) := Q15[3:0] ; 
Q24[4:0] := #B00010 ; 
END ; 
#B00010) BEGIN 
Q15[3:0] := Q15{3:0) ; 
Q24[4:0] := #B00011 ; 
END ; 
#B00011) BEGIN 
Q15(3:0] := Q15[3:0] ; 
Q24[4:0] := #B00100 ; 
END ; 
#B00100) BEGIN 
Q15[3:0] := Q15[3:0) ; 
Q24[4:0] := #B00101 ; 
END ; 
#B00101) BEGIN 
Q15[3:0] := Q15{3:0) ; 
Q24[4:0) := #B00110 ; 
END ; 
#B00110) BEGIN 
Q15[3:0) := Q15[3:0) ; 
Q24[{4:0] := #B00111 ; 
END ; 
#B00111) BEGIN 
Q15[3:0) := Q15[3:0] ; 
Q24[4:0) := #B01000 ; 
END ; 
#B01000) BEGIN 
Q15[3:0] := Q15[3:0] ; 
Q24[4:0] := #B01001 ; 
END ; 
#B01001) BEGIN 
Q15[3:0] := Q15[3:0] ; 
Q24[4:0] := #B01010 ; 
END ; 
#B01010) BEGIN 
Q15[3:0] := Q15[3:0) ; 
Q24{4:0] := #B01011 ; 
END ; 
#B01011) BEGIN 
Q15[3:0] s= Q15[3:0] ; 
Q24[4:0] := #B01100 ; 


END ; 
Figure 27. Design File of a Modulo-360 Counter 
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#B01100) BEGIN 
Q15[3:0]} := Q15[3:0] 
Q24[(4:0}] := #B01101 ; 


END ; 

#801101) BEGIN 
Q15[3:0] := Q15{3:0] ; 
Q24[4:0) := #B01110 ; 


END ; 


#801110) BEGIN 
Q15[3:0] 
Q24[4:0} 


Q15{3:0] ; 
#B01111 ; 


not 


END ; 


#B01111) BEGIN 


Q15[3:0] := Q15[3:0] ; 
Q24[4:0] := #B10000 ; 
END ; 
#B10000) BEGIN 
Q15{3:0) := Q15[3:0] ; 
Q24[4:0) s= #B10001 ; 


END ; 


¥B10001) BEGIN 


Q15[{3:0] := Q15[3:0] ; 
Q24[4:0] := #B10010 ; 
END ; 
#B10010) BEGIN 
Q15(3:0) := Q15[3:0] ; 
Q24[4:0] := #B10011 ; 


END ; 


#B10011) BEGIN 
Q15(3:0]) := Q15[3:0) ; 
Q24[4:0}] := #B10100 ; 
END ; 


#B10100) BEGIN 
Q15(3:0]) := Q15[3:0) ; 
Q24[4:0] := #B10101 ; 
END ; 


#B10101) BEGIN 


Q15(3:0) := Q15{3:0] ; 
Q24[4:0]) := #B10110 ; 
END ; 
#B10110) BEGIN 
Q15[(3:0] := Q15{3:0) ; 
Q24(4:0) := #B10111 ; 


END ; 


#B10111) BEGIN 
Q24[3:0] := #B00000 ; 
CASE (Q15[{3:0] BEGIN "MODULO 15" 
} := 


#B0000) Q15[3:0 = #B0001 
#B0001) Q15[3:0]) := #B0010 
#30010) Q15{3:0)] := #B0011 
#B0011) Q15(3:0) := #B0100 
#B0100) Q15{3:0] := #B0101 
#B0101) Q15[3:0]) := #B0110 
#B0110) Q15[{3:0) := #B0111 
#80111) Q15[3:0]) := #B1000 
#B1000) Q15(3:0) := #B1001 
#B1001) Q15[3:0] := #B1010 
#B1010) Q15{3:0) := #B1011 
#B1011) Q15[3:0] := #B1100 
#B1100) Q15[3:0) := #B1l111 
#B1101) Q15[3:0] := #B1110 
#B1110) Q15[3:0]) := #B0000 
END ; 
END ; 
END ; 


END ; 
END. 


Figure 27. Design File of a Modulo-360 Counter (Cont'’d.) 
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TEST VECTORS 


IN CLK RST LOAD A[8:0]) ; 





OUT Q24[4:0] Q15[3:0] M_ouT ; 
BEGIN 
"CLK RST LOAD A[8:0] Q24[{4:0] Q15[3:0]) M_OUT" 
X 1 X X X X XX X X X X LULLUL LLLUUL L; 
c 0 1 101011100 HLHU4HSH HHLL L; 
c 0 0 X X X X X¥ KX X XX HLHHL HHLUL L; 
Cc 0 0 X X X X X X KX K X HLHHH HHLL L; 
c 0 0 X X X X XX X XX LLLULUtL H HLH L; 
c 0 1 012111110 LHHH4H HHH L L; 
c 0 0 X X X X X X KX X X HLLULUL HHH L L; 
c 0 0 X X X X X X X KX X HLLULH HHHUL L; 
Cc 0 0 X X X XX X X X X HLLUHL HHHL L; 
Cc 0 0 X X X X X XX X X HLLHH HHHL L; 
c 0 0 X XX X XX X XX HUHLL HHHL L; 
Cc 0 0 X X X X X X X X X HLHLH HH HL L; 
c 0 0 X X X X X X X X X HLHHL H HHL L; 
c 0 0 X X X X X X X X X HLHHH HHHL H; 
Cc 0 0 X XX X X¥ XX XX LLLLUuL LULL L; 
c 0 0 X X X X XX XX X LLLUH LLLUL L; 
Cc 0 0 X X X X X X KX X X LULULHL LLLUL L; 
END. 


Figure 27. Design File of a Modulo-360 Counter (Cont'’d.) 


PLS Devices for Modulo Counters 


Monolithic Memories offers afamily of PLS devices. PLS devices also provide a complement array term which is very useful for 
provide programmable AND and programmable OR arrays. designing modulo counters. The available PLS devices are listed 
These devices offer S-R type output and buried flip-flops. They in Figure 28. 


DEVICE TOTAL DEDICATED DEDICATED LOGICAL SPEED GRADES 
NUMBER INPUT REGISTERED PRODUCT FREQUENCY IN 
OF PINS OUTPUT TERMS/ MHz 
OUTPUT f 


PLS105 Total 48 
PLS167 Total 48 
PLS168 Total 48 





Figure 28. PLS Devices 
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The complement array is an OR array term, the complement of 
which is fed back as an input to the AND array of the device 
(Figure 29). The complement term can be used for both illegal 
state recovery and programmable modulo counter designs. 


lilegal State Recovery 


For any counter going through a sequence of legal counts 
(states), at least one product term is always active as long as the 
counter never needs to hold a count; such terms determine the 
next legal state. When the counter powers up in any invalid state, 
none of the product terms for deciding the next state will be active. 
When none of the product terms connected to the complement 
array is active, the complement array is asserted. 


The complement array is usually connected to all of the product 
terms of the counter. Thus when the count is legal, the comple- 
ment array output is always LOW (Figure 30). When the counter 
is in an illegal state, the complement array detects it and is 


DEVICE 
INPUTS 


STATE 
(COUNT) 


asserted HIGH. This signal can then be usedto reset the registers 
to some legal state. One advantage of the complement array is 
that it allows recovery to any state predetermined by the designer. 


Programmable Modulus Counter 


The complement array term can instead be used for designing 
programmable modulus counters. Acounter is implemented with 
the device registers. Different product terms are programmed 
with different modulus values. These product terms are under the 
control of input signals. At any one time only one of the product 
terms is kept active. The complement array is connected to all 
such terminal count product terms. Since all of these product 
terms are LOW the complement array is initially HIGH and is used 
as a count enable for the counter. When the count reaches the 
terminal value the complement array term goes LOW disabling 
the normal count product terms. Simultaneously the terminal 
count product term becomes active, resetting the counter to 
begin counting afresh. 






COMPLEMENT 


ye ARRAY TERM 





DEVICE 
OUTPUTS 


406 11 


Figure 29. The Architecture of the Complement Array Term 
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DEVICE 
INPUTS 


HIGH WHEN IN AN 
ILLEGAL STATE 


HIGH WHEN IN 
LEGAL STATE 





RECOVERY PRODUCT 


LEGAL 
STATE 
TERM WHICH RESETS 
PRODUCT 
TERMS OUTPUT/STATE REGISTERS 
DEVICE 


OUTPUTS 





STATE 
REGISTERS 













406 12 


DEVICE 
INPUTS 


NORMAL COUNT 
PRODUCT TERMS 


COUNT ENABLE GOES LOW 
WHEN MODULUS IS REACHED 


LOW WHEN COUNTING, 
HIGH AT MODULUS 





PRODUCT TERMS 
WITH DIFFERENT 
COUNT VALUES 


PROGRAMMED 
DEVICE 


OUTPUTS 





REGISTERS 


06 13 
Figure 31. Programmable Modulo Counter Using Complement Array Term 
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Counters with Encoding 


Until now we have discussed counters which generate binary 
output sequences, Most peripherals require a predetermined 
sequence of control signals. Custom control sequences can be 
generated by decoding the binary sequence with combinatorial 
logic. Figure 32 shows a general model of a counter with 


CONTROL 
INPUTS 


DECODER 





OUTPUT 
-———} DECODER 





combinatorial output decoding circuitry. This combinatorial circuit 
modifies the counter bits and generates output signals in the 
manner required for peripheral timing and control. Since these 
circuits require extra combinatorial logic, they are not very effi- 
cient. They are also more susceptible to hazards and output 
glitches. 


COMBINATORIAL 
LOGIC ee 
js 


OUTPUTS 





406 14 


Figure 32. Counter with an Output Decoder 


It is possible to have a different output coding for a four-bit 
counter, as shown in Figure 33. This code, called Gray code, 
allows only one output bit to toggle for each newcount value. This 


BINARY CODE GRAY CODE 


X3 X2 X1~ XO G3 G2 G1 G0 




















Ase oe eV V4 QODODADACOAOO 
aA ewe pe oOoOoOooHfW-K =a AaB COCO CO 
-m~ oO Of-f]/00+-++-00-=300 
-"o+-'o-0+-0+/0-;0+0+0 
“a osteo or ort I OOO ODOOOOO 
CODOH24 nnn uUuUu D000 
oot-—4—s--0000223++00 
owes oo-t+-00++;00-+-0 





Figure 33. Generating Gray Code from a Binary Code 


COMBINATORIAL 
LOGIC 


CONTROL 
INPUTS 





{ 
NEXT 1 OUTPUT 
STATE |! “PRE” 
DECODER ' DECODER 


code can be easily derived from a four-bit binary counter code 
(also shown in Figure 33) using an output decoder. 


We can derive the Boolean equations for the combinatorial output 
decoder from the truth table. The equations are: 


G3 = X3 

G2 = X3 :+: X2 
Gl = X2 :+: X1 
GO = X1 :+: XO 


A more efficient and easier technique for generating control 
signals is to implement the decode circuitry before the registers. 
This alternative is shown in Figure 34. This essentially generates 
a non-standard counter with state values which are not a binary 
progression. Itcan also be considered to be a counter where the 
product terms for a binary count and encoding the outputs have 
been combined. 


Many different codes can be generated using such techniques. 


We will limit ourselves to the ones which are most commonly 
used: Gray-code counters and Johnson counters. 


CLK 






406 15 


Figure 34. Counter with Combined Next State Generation and Output Encoding Circuit 
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Gray-Code Counters The Boolean logic equations for a Gray-code counter are: 

Gray-code counters are often used in digital designs for control X3 r= /X3_* X2 * /X1 * /X0 

timing functions. The primary advantage of Gray-code counters X3 * X2 * /X1 * /XO 

stems from the characteristic that only one output bit changes x3 .* X2 * /X1 * XO 

value for every clock cycle. These output signals can then easily X3 * X2* Xl * XO 

be decoded using a combinatorial decoder without any risk of x3 .* X2* X1 * /XO 

hazards. Gray-code counters are used extensively as system X3 * /X2 * X1 * /XO 

clocks, since the different output bits provide different clock X3_* /X2 * X1* XO 

phases without the risks of hazards. Gray-code is also used in X3 * /X2 * /X1 * XO 
high-speed data communication applications, where data is 

transmitted from one part of the system to another, and where the X2 i= /X3 * /X2 * X1 * /X0 

error susceptibility increases with the number of bit changes /X3.* X2 * Xl * /X0 

between adjacent numbers in asequence. These are also used /k3 * X2* X1* XO 

for such specialized applications as shaft encoders and real-time /X3 * X2 * /X1 * XO 

process control. /X3 * X2 * /X1 * /XO 

X3 * X2 * /X1 * /X0 

The implementation of a Gray-code counter is very simple. A x3 * X2 * /X1 * XO 

truth table can be derived from the transition table as is done for x3 * X2* X1* XO 
a binary counter. The Boolean equations can then be directly 

derived from the truth table. The truth table for the Gray-code X1 i= /X3 * /X2 * /X1 * XO 

counter is shown in Figure 35. /X3 * /X2 * Xl * XO 

/X3 * /X2 * X1 * /X0 

/X3 * X2* X1 * /XO 

PRESENT STATE ee 

X3 X2 x1 XO Xx3 * X2 * X1 * /XO 

X3 * /X2 * X1 * /X0O 

XO := /X3 * /X2 * /X1 * /XO 

/X3 * /X2 * /X1 * XO 

/X3 * X2 * X1 * /X0O 

/X3 * X2 * X1* XO 

X3 * X2 * /X1 * /XO 

X3 * X2 * /X1 * XO 

X3 * /X2 * X1 * /X0 

x3 * /X2 * X1* XO 


0 
0 
0 
0 
0 
0 
0 
0 
1 
1 
1 
1 
1 
1 
1 
1 


(oe koko a a a a a a oo olo) 
cCoHm]=]240000--=-00 
O©C-3=0C03-4A00O=A==-00--0 
O©A424424u44 Un CFC CD0000 
CODDOAABAAuuununi D900 
COOA=44-DdODOAaAaMAO 
CO]-=-00- 0000-4 





Figure 35. Truth Table for a Four-bit Gray-code Counter 
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Johnson Counters 


A Johnson counter is part of a family of counters known as “ring 
counters.” These counters are used for special applications 
where code symmetry is desired. Ring counters are also often 
used for timing purposes, since all of the outputs are essentially 
a series of pulsas. This code symmetry also allows use of the 
fewest possible product terms with a D-type register. Devices 
such as the Logic Cell Array (LCA), which provide a small amount 
of logic per cell, can implement Johnson counters very easily. 


Johnson counters are also known as circular-shift counters. The 
sequence for afive-stage Johnson counter is shown in Figure 36. 
As can be seen in the truth table, the counter first fills up with 1's 
from left to right and then it fills up with zeros again. Note from the 
output sequence that only one of the Johnson counter bits 
changes for every clock period, like the Gray-code counter. One 
major advantage of the Johnson counter is that it can be readily 
decoded with small two-input NAND gates and hence is suitable 
for high-speed applications. 


Note that the five-stage sequence has a table of 10 legal states 
and 22 illegal states (Figure 37). In general, an n-bit Johnson 
counter will produce a modulus of 2n. Figure 38 shows the state 
diagram of the five-bit counter. 


Legal States 


PRESENT STATE NEXT STATE 
Q4 Q3 Q2 Qi Qo Q4 Q3 Q2 Qi a0 






a ooo Molo) 
O---4-4--4ce000 
oofn4n4-s4000 
eooon44440 0 
COD0OA424440 
--4-40C0000- 
—~---=-4 90000 
OfA2644u C000 
oo-4--4--4--4000 
eoooaA444400 








Figure 36. Five-bit Johnson Counter Truth Table 


The implementation of a Johnson counter is relatively straight- 
forward, and is the same regardless of the number of stages. 
When D-type flip-flops are used, the Q output of each flip-flop is 
connected to the D input of the following stage. The single 
exception is the Q output of the last stage, whichis complemented 
and connected to the D input of the first stage. 





IHegal States 


PRESENT STATE NEXT STATE 
Q4 Q3 Q2 Qi Qo Q4 Q3 Q2 Qi Qo 


0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 


=-42-4-900CD0DDCOOH244444F C00 
=~O00]4]24=4000424243 0000-40 
O-=0-2=002==30-00-24=00-00- 
=~32=024240-20-0-0-0-0-00-00 
ooo Ro Ro RoR No ko Ro MoM Ro Ro No Moo Mo —o-RoRon) 
oo Moo Ro Roo No Roo Moo Ro Molo Ro Noone) 
oe Mo No Mo Roe Roo Ro Ro Mo Roo MoM—-Ko Rolo RoRoM-) 
(ooo No Ro Mo Roo Ro Ro Mo Roo Moo Ro Roo R-Rolo N=) 
oo R=Ro Ro Reo Re Ro Ro Ro Mo Ro RoRoNo Ro Ro Rolo No—oN-) 


Figure 37. Illegal States for a Five-bit Johnson Counter 
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Figure 38. State Diagram of a Five-bit Johnson Counter 


One disadvantage of the counter is the number of invalid (or 
illegal) states. The invalid states increase exponentially with the 
length of the counter. The bigger the counter becomes, the 
greater are its chances of entering an illegal state. Johnson 
counters are very susceptible to illegal states, and can “hang up” 
very easily. Noise or improper use can cause this counter to end 
upin anillegalstate. Therefore a design with illegal state recovery 
circuitry is always recommended. 





2-88 


21 mononthic Ht) memories & 


Registered Logic Design 





Figure 39 shows a nine-bit Johnson counter which can be derived and one clock pin. The design incorporates the product terms 
by directly extending the design of afive-bit Johnson counter. The required for illegal state recovery. The PLPL design file for this 
design file adds a LOAD function; it requires nine input pins, nine counter is shown in Figure 40. 


output pins, one RESET pin, one LOAD pin, one invalid status pin 


406 17 





Figure 39. Block Diagram of a Nine-bit Johnson Counter 
DEVICE NINE _BIT JOHNSON COUNTER (AmPAL22V10) 





PIN CLK = 1 
RST = 2 
A[8:0] = 3:11 
Q(8:0) = 14:21,23 
VF = 22 "VALID STATE FLAG" 
LOAD = 13 ; 
BEGIN 
IF (RST) THEN ARESET(Q[8:0)) ; "COUNTER INITIALIZATION" 
IF (LOAD) THEN Q[8:0) := A(8:0] ; “LOAD NEW VALUE" 
ELSE BEGIN 
CASE (Q[8:0)) 
BEGIN 
#B000000000) Q[8:0]) := #B100000000 ; 
#B100000000) Q{8:0] := #B110000000 ; 
#B110000000) Q[{8:0) := #B111000000 ; 
#B111000000) Q[8:0]} := #B111100000 ; 
#B111100000) Q[8:0} := #B111110000 ; 
#B111110000) Q[8:0} := #B111111000 ; 
#B111111000) Q[8:0}] := #B111111100 ; 
#B111111100) Q{8:0) := #B111111110 ; 
#B111111110) Q[8:0] := #B111111111 ; 
#B111111111) Q[8:0] := #B011111111 ; 
#BO11111111) Q(8:0] := #B001121111 ; 
#B001111111) Q[{8:0] := #B000111111 ; 
#B000111111) Q[8:0] := #B000011111 ; 
#B000011111) Q{8:0] := #B000001111 ; 
#B000001111) Q[8:0] := #B000000111 ; 
#B000000111) Q[8:0] := #B000000011 ; 
#B000000011) Q[8:0] := #B000000001 ; 
#B000000001) Q[8:0] := #B000000000 ; 
END ; 
END ; 
CASE (Q[8:0)) 
BEGIN 


#B000000000) VF 
#B100000000) VF 
#B110000000) VF 
#B111000000) VF 
#B111100000) VF 
#B111110000) VF 
#B111111000) VF 
#B111111100) VF 
#B111111110) VF 
#Bl11111111) VF 
#B011111111) VF 
#B001111111) VF 
#B000111111) VF 
#B000011111) VF 
#B000001111) VF 
#B000000111) VF 
#B000000011) VF 
#B000000001) VF 

END ; 


PRP RP RP EPP RPP PRP PRP RPP Ree 
Wh eRe We: peu, Ge: Ne Re ee Se we ER RO ee aie 
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END. 


Figure 40. Design Filo for a Nino-bit Johnson Counter 
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TEST_VECTORS 


IN CLK RST LOAD A[8:0}; 
OUT Q[8:0] VF; 

BEGIN 

w L 

"C RO 

"LSA AAA AA A QQ 
"KTD 876543210 8 7 
X 1X XXX XXX XxX xX LL 
Col il1lid’iijiiadj’jildilil HH 
CoO XXxXXXxXxXxXxX Lu 
CoO XXX xXXXxXxXxX LL 
C00 XxX XxX XxX xX xXx LL 
co1li101010101 H L 
Coo XX xXXXXXXxX LL 
C00 XXXXXxXX XxX HL 
coi 1231111110 HH 
coo. XXXxXxXxXXxX xX H H 
coo XX XX XXX xX xX LH 
col 000000001 LL 
C00 XxXXxXXXX XxX LL 
END. 


Cem m rer oeeae AO 


wo 
Oo 
w 
nN 
HO 
°o 
"m< 


"RESET" 
"LOAD" 
"COUNT" 


=e 


"COUNT" 

"LD INVALID ST" 
"INVALID ST" 
"RECOVERY" 
"LOAD" 

"COUNT" 


remem m eee ee eee 
Cernomm eer oer ooe 
rermmorerere rome 
remnant mee eee 
Crm mm ero mnam oe 
CUM MOOCerR oer ee 
AGE Gaeta oe 


Be se we se Se Ne Ne Ne Be Be 


Figure 40. Design File for a Nine-bit Johnson Counter (Cont’d.) 


Shift Registers 


A Shift Register is a special digital circuit often used as a primary 
building block in digital computer systems. It is closely related to 
a ring counter. Its fundamental usage is for temporary data 
storage and bit-wise data manipulation for advanced arithmetic 
and multiplication operations. Shift registers are also frequently 
used in communications, for converting parallel byte-wide data 
from the microprocessor to a serial data bit-stream for transmis- 
sion. An example of a serial data link controller on page 2-453 
shows such a shift register design. Shift registers are also used 
in graphics systems for serializing parallel data for use by the 
display monitor. Anumber of examples of video shift registers are 
included in the graphics section (page 2-257). 


The fundamental purpose of a shift register (Figure 41) is to shift 
data from one flip-flop to another. There are several types of shift 
registers. They are classified by the way in which incoming data 
is received (parallel or serial), and how outgoing data is transmit- 
ted (parallel or serial). 





CONTROL 
SIGNALS 


LEFT SERIAL DATA 
IN AND OUT (LIRO) 


Qo Qy Az Q3 Q4 As Ag OQ 
PARALLEL DATA OUT 


PARALLEL DATA IN 
Dp Dy Dz D3 Dg Ds Dg D7 


SHIFT REGISTER 


In the following example, we will discuss a simple universal shifter 
which provides both serial and parallel input and output functions. 
Depending upon the control signals 10 and I1, the data is shifted 
from one flip-flop to another in the left or the right direction. These 
inputs also contro} when the new parallel data is loaded onto the 
registers. When shifting left or right, serial data can be received 
and transmitted on serial pins LIRO and RILO. Since the flip-flop 
outputs appear on the output pins at all times, the parallel output 
data is always available. The truth table is shown in Figure 42. 


The Boolean logic equations can be directly derived from the truth 
table, and are shown in the PAL device design file in Figure 43. 


Shift registers can be modified to suit various system design 
requirements. This universal shift register can be used for serial 
in/serial out, parallel in/parallel out, serial in/parallel out and 
parallel in/serial out functions. 


CLOCK 








RIGHT SERIAL DATA 
OUT AND IN (RILO) 


406 18 


Figure 41. A Shift Register Block Diagram 
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;Retain Data 
:Shift Right 
Shift Left 
‘Load Data 





Figure 42. The Truth Table for a Universal Shift Register 


TITLE UNIVERSAL _SHIFT_ REGISTER 
PATTERN SHIFT. PDS 

REVISION 01 

AUTHOR JOE ENGINEER 

COMPANY MONOLITHIC MEMORIES, INC. 
DATE 09/12/87 


CHIP UNIV SHI. PAL20X8 


CLK I0 DO D1 D2 D3 D4 D5 D6 D7 T1 GND 
/OC RILO Q7 Q6 Q5 Q4 Q3 Q2 Ql QO LIRO vcc 


; DESCRIPTION 
;THIS PAL DEVICE IS AN 8-BIT SHIFT REGISTER WITH PARALLEL LOAD AND 


;HOLD CAPABILITY. TWO FUNCTION SELECT INPUTS (I0,I1) PROVIDE ONE OF 
7FOUR OPERATIONS WHICH OCCUR SYNCHRONOUSLY ON THE RISING EDGE OF THE 


;CLOCK (CLK). THESE OPERATIONS ARE: 

' /OC CLK I1 10 D7-DO Q7-Q0 OPERATION 

i H X xX x x Z HI-Z 

; L c L L xX L HOLD 

; L Cc L H xX SR(Q) SHIFT RIGHT 
; L Cc H L xX SL(Q) SHIFT LEFT 
; L Cc H H D D LOAD 


;TWO OR MORE OCTAL SHIFT REGISTERS MAY BE CASCADED TO PROVIDE LARGER 
;SHIFT REGISTERS. RILO AND LIRO ARE LOCATED ON PINS 14 AND 23 
;RESPECTIVELY, WHICH PROVIDES FOR CONVENIENT INTERCONNECTIONS WHEN 
;TWO OR MORE OCTAL SHIFT REGISTERS ARE CASCADED TO IMPLEMENT LARGER 
;SHIFT REGISTERS. 


EQUATIONS 

/Q0 := /11*/10*/Q0 ;HOLD QO 
+ /11* 10*/Q1 ;SHIFT RIGHT 
it: I1*/I0*/LIRO ;SHIFT LEFT 
+  I1l* 10*/D0 ;LOAD DO 

/Q1 := /11*/10*/Q1 ;HOLD Ql 
+ /I1* 10*/Q2 ;SHIFT RIGHT 
rt: I1*/10*/Q0 ;SHIFT LEFT 
+ Ii* I10*/D1 ;LOAD Dl 

/Q2 := /11*/10*/Q2 ;HOLD Q2 
+ /11* 10*/Q3 ;SHIFT RIGHT 
rt: T1*/I0*/Q1 ;SHIFT LEFT 
+ I1* 10*/D2 ;LOAD D2 

/Q3 =  /1T1*/10*/Q3 ;HOLD Q3 
+ /11* 10*/Q4 ;SHIFT RIGHT 
it: I1*/10*/Q2 ;SHIFT LEFT 
+ T1* 10*/D3 ;LOAD D3 

/Q4 := /11*/10*/Q4 ;HOLD Q4 
+ /11* 10*/05 ;SHIFT RIGHT 
zt: I1*/10*/Q3 ;SHIFT LEFT 
+ I1l* 104/D4 ;LOAD D4 

/Q05 := /11*/10*/Q5 ;HOLD Q5 
+ /T1* 10*/Q6 ;SHIFT RIGHT 
z+: I1*/10*/Q4 ;SHIFT LEFT 
+ T1* I0*/D5 ;LOAD DS 


Figure 43. Octal Shift Register Design File 
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/Q6 := /11*/10*/Q6 
/11* 10*/Q7 
T1*/10*/Q5 
I1* I0*/D6 


/11*/10*/Q7 
/11* I10*/RILO 
11*/10*/Q6 
I1* I10*/D7 


++i 


/Q7 : 


++i 


/LIRO = 
LIRO.TRST = 


/RILO = 
RILO.TRST = 


; SIMULATION NOT SHOWN HERE 


;HOLD Q6 
;SHIFT RIGHT 
;SHIFT LEFT 
;LOAD D6 


;HOLD Q7 
;SHIFT RIGHT 
;SHIFT LEFT 
;LOAD D7 


;LEFT IN RIGHT OUT 


;RIGHT IN LEFT OUT 


Figure 43. Octal Shift Register Design File (Cont'd.) 


Barrel Shifters 


In most data processing systems, some form of data shifting or 
rotation is necessary. In typical computer systems, the shifter is 
located at the output of the ALU, and usually requires a single- 
cycle shift and add function (Figure 44). For such applications as 
floating-point arithmetic or string manipulation, ordinary shift 
registers are inefficient, since they require n clock cycles for an n- 
bit shift. 


INPUT DATA BUS 










REGISTER 
FILE 


a 


SHIFT 
DISTANCE 


SHIFTER 





406 19 


OUTPUT DATA BUS 


Figure 44. Typical ALU Architecture 


Aspecialized shift register, called a “barrel shifter,” is used to shift 
(or rotate) data by any number of bits in a single clock cycle. The 
name “barrel shifter” is used because of the circular nature of the 
shift operation. The storage registers on the output of the shifter 
are used in this architecture to pipeline the data operation, 
increasing throughput. The three-state buffer on the output 
registers is also useful for providing an interface to the data bus. 


The design of a barrel shifter proceeds in the same manner as a 
regular shift register. The truth table is drawn, and the Boolean 
equations are then written based upon the truth tables. An eight- 
bit barrel shifter requires at least eight data inputs, eight regis- 
tered data outputs, three control lines to specify the shift distance, 
a clock input and an output enable that controls the three-state 
buffer on the register output. 


Figure 45 shows the block diagram for an eight-bit registered 
barrel shifter, while Figure 46 shows the truth table. The regis- 
tered barrel shifter requires a total of 14 inputs and 8 outputs. This 
function can be easily implemented in an AMPAL22V10. Figure 
47 shows the PLPL design file for this eight-bit barrel shifter 
function. 


D7 Dg Ds Dg Dg Dz Dy Do 


S 
S RST 
y AmPAL22V10 
Se OE 
CLK y 
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Figure 45. Block Diagram of an Eight-bit Barre! Shifter 
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Figure 46. Truth Table for an Elght-bit Barrel Shifter 


DEVICE EIGHT BIT BARREL SHIFTER 
PIN CLK Sry 
D [7:0] = 2:9 /Q{7:0] = 
SEL2 2:13 SELl1 = 
OE = 14 RSY = 
BEGIN 
IF (RST) THEN ARESET(); 
IF (OE) THEN ENABLE(Q[7:0]); 
IF (/SEL2 * /SEL1 * /SELO) THE 
IF (/SEL2 * /SEL1 * SELO) THE 
IF (/SEL2 * SEL1 * /SELO) THE 
IF (/SEL2 * SEL1 * SELO) THE 
IF ( SEL2 * /SEL1 * /SELO) THE 
IF ( SEL2 * /SEL1 * SELO) THE 
IF ( SEL2 * SEL1 * /SELO) THE 
IF ( SEL2 * SEL1 * SELO) THE 
END. 


"FUNCTION TABLE SPECIFICATION" 


TEST _VECTORS 


IN CLK RST D[7:0] SEL2 SEL1 SELO; 


I_O OE; 

OUT Q(7:0]; 

BEGIN 

"CLK RST D[7:0] SEL2 SEL1 
Cc 1 00000000 1 0 
Cc 0 11111111 0 1 
Cc 1 XXXXXXXX xX x 
c 0 XXXXXXXX xX 4 
c 0 00000000 0 0 
c 0 11111111 0 0 
c 0 01111111 1 0 
Cc 0 01111111 0 1 
Cc 0 01111111 1 1 
END 


SE 


ooocox x OF 


(PAL22V10) 
23:16 
11 SELO = 10 
15; 


"SHIFTER INITIALIZATION" 
“ENABLE OUTPUTS" 


N Q(7:0) := D[7:0); 
N Q[7:0]) := D[6:0], D7]; 
N Q(7:0) := D[5:0), D[7:6]; 
N QO(7:0)] := D[4:0], D(7:5]; 
N Q{7:0] := D({3:0], D[7:4); 
N Q(7:0)] := D[2:0], D[7:3]; 
N Q(7:0] := D{1:0], D[7:2]; 
N Q(7:0)] := D[O), D[7:1]; 
LO OE Q[7:0} " 

0 22222222; 

1 HHHHHHHH; 

1 LLLLLLLL; 

0 ZZZZZ222; 

1 LLLLLLLL; 

L HHHHHHHH; 

1 HHHHLHHH; 

Al HHHHHHLH; 

1 HHLHHHHH; 


Figure 47. Design File for an Eight-bit Barrel Shifter 


Gray-code, Johnson Counter and Shift Register Device 
Selection Considerations 


Gray-code counters, Johnson counters and shift registers are not 
very logic-intensive; the number of product terms required is 
minimal. The D-type flip-flops provide the most efficient imple- 
mentations, allowing these designs to be easily implemented in 
most PAL devices. 


Since Gray-code counters are often used as system clocks, very 
high speed PAL devices provide the highest resolution clocks. 


Barrel shifters are very logic-intensive and require many product 
terms, since data from allthe inputs needs to be accessible at any 
output. Registered PLDs with a large number of product terms 
are ideal for barrel shifters. Large barrel shifters can also be 
partitioned into a number of PLDs. 
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Asynchronous Registered Designs 


Until now we have discussed strictly synchronous registered 
designs, where a common system clock is used. In asynchro- 
nous registered designs, a common clock is not used. The 
register clock may be generated by the output of another register, 
orby alogical combination of various other signals. Such designs 
are usually slow for such applications as timing generation, 
because when the output of one register is used to clock another, 
multiple delays are encountered before all the register outputs 
stabilize. On the other hand, designs can be very fast for 
asynchronous applications such as bus arbitration and control, 
where a fast response to a bus signal can be provided without 
waiting for a common system clock. 


Although asynchronous designs are easier to visualize, they 
present larger problems in implementation. Combinatorial haz- 


RESET 


ard conditions can cause false clocking of registers, destroying 
the logic intended by the designer. The designer also needs to 
worry about race conditions when clocking a number of registers 
simultaneously. Careful design analysis is strongly recom- 
mended before implementing any asynchronous design. 


Ripple counters are probably the easiest examples of such 
asynchronous designs. Figure 48 shows the logic diagram of a 
five-bit binary ripple counter. These counters clearly have the 
advantage of design simplicity. The output from one stage is fed 
as the clock to the next stage. However, this results in a slower 
counting rate, since the clock signals need to propagate through 
all five registers before the next count is reached. 


The ripple counter design illustrated below is implemented in a 
PAL20RA10. Figure 49 shows the design file for this counter. 


CK SET 





EXTRA CIRCUIT REQUIRED 
FOR MODULO 20 COUNTER 


Figure 48. A Five-bit Ripple Counter 406 21 


TITLE 
PATTERN Upcount.pds 
REVISION 01 

AUTHOR Joe Engineer 


Dual 5-Bit Counter 


COMPANY Monolithic Memories, Inc. 


DATE 11/12/87 


CHIP Upcount PAL20RA10 


PL CKA CKB ENA ENB NC NC NC NC NC_ INIT GND 
OE QB4 QB3 QB2 QB1 OBO OA4 OA3 QA2 QAl QAD VCC 


EQUATIONS 
; The First Counter 


/QA4 := QA4 * /ENA ; Toggle if lower MSB 

is zero 
+ /QA4 * ENA ; Hold count 

QA4 .CLKEF = /QA3 3 Toggle 

QA4.RSTF = INIT ; Initialize 

QA4.TRST = VCC 

/QA3 := QA3 * /ENA ; Toggle when QA2 zero 
+ /QA3 * ENA ; Hold count 

QA3.CLKF = /QA2 3; Toggle 

QA3.RSTF = INIT ; Initialize 

QA3.TRST = Vcc 


Figure 49. Design File for a Five-bit Ripple Counter 
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/QA2 := QA2 * /ENA ; Toggle when QAl zero 
+ /QA2 * ENA ; Hold count 

QA2.CLEF = /QAl 3; Toggle 

QA2.RSTF = INIT ; Initialize 

QA2.TRST = vcc 

/QA1 := QAl * /ENA ; Toggle when QAO zero 
+ /QAl * ENA ; Hold count 

QA1.CLEF = /QA0 ; Toggle 

QA1.RSTF = INIT ; Initialize 

QAl1.TRST = VCC 

/QA0 := QAO * /ENA ; Toggle LSB 
+ /QAO * ENA ; Hold count 

QAO.CLKF = CKA 7 Toggle 

QAO.RSTF = INIT 7 Initialize 

QA0.TRST = VCC 


; The Second Counter 





/QB4 := QB4 * /ENB ; Toggle if lower MSB is zero 
+ /QB4 * ENB ; Hold count 

QB4.CLKF = /0B3 # Toggle 

QB4.RSTF = INIT ; Initialize 

QB4.TRST = VCC 

/QB3 := QB3 * /ENB ; Toggle when QB2 zero 
+ /0B3 * ENB ; Hold count 

QB3.CLKF = /QB2 3 Toggle 

QB3.RSTF = INIT ; Initialize 

QB3.TRST = VCC 

/QB2 := QB2 * /ENB ; Toggle when QB1 zero 
+ /QB2 * ENB ; Hold count 

QB2.CLKF = /QB1 3; Toggle 

QB2.RSTF = INIT ; Initialize 

QB2.TRST = VCC 

/QB1 := QB1 * /ENB ; Toggle when QBO zero 
+ /QB1 * ENB”) ; Hold count 

QB1.CLKF = /QB0 ; Toggle 

QB1.RSTF = INIT ; Initialize 

QB1.TRST = VCC 

/QB0 := QB0 * /ENB ; Toggle LSB 
+ /QB0 * ENB) ; Hold count 

QBO.CLKF = CKB ; Toggle 

QBO.RSTF = INIT ; Initialize 

QBO.TRST = VCC 

SIMULATION 


TRACE_ON /PL QA4 QA3 QA2 QA1 QAO 984 OB3 QB2 QB1 OBO 


SETF PL INIT /CKA /CKB /OE 
SETF INIT CKA CKB 

SETF /INIT /CKA /CKB 

CHECK /QA4 /QA3 /QA2 /QA1 /QAO 
CHECK /QB4 /0QB3 /QB2 /QB1 /QBO 


Figure 49. Design File for a Five-bit Ripple Counter (Cont’d.) 
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SETF CKA CKB 

CHECK /QA4 /QA3 /QA2 /QA1 QAO 
CHECK /QB4 /QB3 /QB2 /QB1 OBO 
SETF /CKA /CKB 


SETF CKA CKB 

CHECK /QA4 /QA3 /QA2 QA1 /QAO 
CHECK /QB4 /QB3 /QB2 QB1 /QBO 
SETF /CKA /CKB 


SETF CKA CKB 

CHECK /QA4 /QA3 /QA2 QA1 QAO 
CHECK /QB4 /QB3 /QB2 QB1l OBO 
SETF /CKA /CKB 


SETF CKA CKB 

CHECK /QA4 /QA3 QA2 /QA1 /QA0 
CHECK /QB4 /QB3 QB2 /QB1 /QBO 
SETF /CKA /CKB 


SETF CKA CKB 

CHECK /QA4 /QA3 QA2 /QA1 QAO 
CHECK /QB4 /QB3 QB2 /QB1l QBO 
SETF /CKA /CKB 


TRACE_OFF 


Figure 49. Design File for a Five-bit Ripple Counter (Cont'd.) 


There are additional independent SET and RESET controls inthe Asynchronous Designs Device Selection Considerations 
PAL20RA10 which can be used to restart the count. If a modulo 

counteris desired, these SET and RESET signals can be used as The device selection for asynchronous designs is easy. As the 
described above. Figure 48 shows the implementation of a clock signals require logic, only PLDs which allowimplementation 
modulo-20 counter which is RESET when output bits Q4 and Q2 of Boolean logic on the clock signals are useful. Figure 50 shows 
are both HIGH. Since the RESET is implemented with a product alist of devices which provide programmable clock signals. They 
term, the extra AND gate shown can be implemented directly also provide programmable SET and RESET product terms, and 
within the PAL device. the capability of bypassing the register. 





















DEDICATED DEDICATED LOGICAL SPEED GRADES 















INPUT REGISTERED PRODUCT FREQUENCY IN 
PINS OUTPUT TERMS/ MHz 
PINS OUTPUT fax 
, 16RA8 20 8 0-8 8-0 20 
20RA10 24 10 0-10 10-0 33, 20 
CMOS 


Figure 50. Programmable Logic Devices for Asynchronous Designs 
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Other Applications of 
Registered PLDs 


Registered PLDs are used for a number of miscellaneous appli- 
cations which are not covered by the synchronous and asynchro- 
nous design applications discussed up to now. Inthe subsequent 
discussion we will cover a few of these applications: 


* Frequency dividers 
* Addressable registers 


Frequency Dividers 


Standard synchronous counters provide the basic capability of 
dividing an input frequency. Asingle register of a PAL device will 
let us divide by two. If we stack these registers, a binary counter 
provides symmetrical division by 2,4,8,16, etc. This divider has 
been a standard for years, and the PAL device has always been 
an excellent choice for such applications. 


One unique application of PAL devices is for dividing input 
frequencies by odd numbers. This has been done historically by 
designing a counter which cycles modulo an odd number, and 
decoding the specific states of the counter. The disadvantage of 
this approach is that the output is not symmetrical and the duty 
cycle is not 50%. 


Let’s examine a simple divide-by-five counter. This counter can 
be implemented using three flip-flops that start at zero and reset 
at four, resulting in a five-state counter. The table in Figure 51 
shows the outputs of the three individual flip-flops. 


Present State Next Stata 
Q2 Qi Qo Q2 Qi Qo 


State zero to one. 
State one to two. 

State two to three. 
State three to four. 
State four to Zero. 

























o-0O000 
oo+-+-0 
oo-0oO-- 





Figure 51. Truth Table for a Five-bit Counter 


The Boolean equations are: 


Q2 := /Q2 * Ql * QO ;MSB bit 
Ql := /Q1 * Q0 + Ql * /Q0 
QO := /@2 * /Q0 7; LSB bit 


The waveforms for this divider are shownin Figure 52. Notice that 
the Q2 output goes HIGH for one state and that this output is one- 
fifth of the input frequency, but it is a 20% duty cycle. Q1 is active 
for two states; it provides the same frequency, but with a 40% duty 
cycle. If we want a 50% duty cycle we are going to have to divide 
a State in half. 


To provide the 50% duty cycle, the two edges should be evenly 
spaced in the count sequence, one edge in the middle of state two 
and one at the beginning of state zero. The first edge can be 
formed by logically “ANDing” state_2 with the falling edge of the 
clock. The second edge can be formed by decoding state zero. 
edge_1 = /clock * /Q2 * Q1 * /Q0 ;edge between 
jstates two and 
;three 

edge 2 = /Q2 * /Q1 * /Q0 edge at state 
7 Zero 


The logical “OR” of these two equations will provide the needed 
rising edges. To provide a clean output, this signal should clock 
another output register. 


The next step in the design is to pick the appropriate PAL device 
to fit this design. Our biggest concern is that we need the 
capability of clocking the counter at one speed and the output flip- 
flop at another. To do this, we cannot use a PAL device that has 
a dedicated clock pin; we need an architecture that allows pro- 
grammable clocks. The PAL20RA10, PAL16RA8 and 
AmPALC29MA16 have this feature. This design uses the 
PAL20RA10 device. 


Since the clock signal requires two product terms (one for each 
edge), and the PAL20RA10 clock signal has only one product 
term, this implementation is not possible. Another technique is to 
use the independent asynchronous SET and asynchronous 
RESET product terms of the PAL20RA10 output register. AHIGH 
on the SET product term asserts the register output, and a HIGH 
onthe RESET product term unasserts the register output. Due to 
the asynchronous nature of the product terms some adjustment 
in timing is required. The SET product term is asserted when in 
state 0 (Q2=0, Q1=0 and Q0=0), and the RESET product term is 
asserted when between states two and three. 


OUTPUT.SET = /clock * /Q2 * Q1 * /Q0 ;set between 
sstates 2 & 3 
;reset at 
;state zero 


OUTPUT.RESET = /Q2 * /Q1 * /Q0 
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STATE 0, STATE 1, STATE 2, STATE 3, STATE 4, STATE 0, STATE 1 





Qo 


Q, 


Q2 


OUTPUT 





SET 


RESET 


406 22 


Figure 52. Waveform for a Frequency Divider 
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TITLE DIVIDE BY FIVE EXAMPLE 
PATTERN DIVBY5.PDS 

REVISION A 

AUTHOR BARRY SEIDNER 

COMPANY MONOLITHIC MEMORIES INC., SANTA CLARA, CA 
DATE 7/25/1987 


CHIP DIVIDER PAL20RA10 


PL CLK 13 14 I5 16 17 I8 19 110 111 GND 
OE Q14 Q15 Q16 Q17 Q18 Q19 Q20 /Q21 /Q22 /Q23 vCcc 


EQUATIONS 
;The following equations delineate the three bit counter of our 


;state machine driver. It has a total count of five starting 
;at zero and counting thru four, then resetting to zero. 


Q23:= Q21 * Q22 * /Q23 ;MSB OF STATE MACHINE 
Q23.CLKF=CLK 7CLOCK ON RISING EDGE OF CLK PIN 
Q23.TRST=VCC ;OUTPUT ALWAYS ON 

Q22:=/Q22 * Q21 + Q22 * /Q21 ;BIT 1 OF STATE MACHINE 
Q22.CLKF=CLK ;CLOCK ON RISING EDGE OF CLK PIN 
Q22.TRST=VCC ;OUTPUT ALWAYS ON 

Q21:=/Q21 * /Q23 7;LSB OF STATE MACHINE 
Q21.CLKF=CLK ;CLOCK ON RISING EDGE OF CLK 
Q21.TRST=VCC ;OUTPUT ALWAYS ON 





;The last cell of our divide by five counter is the actual 

sOutput of divider. It divides by setting and resetting at the 
;appropriate time in the count sequence. The output will stay high 
;for count zero, one, and the first half of two-then go low for 
;the second half of two, thru three and four. 


Q20.SETF = /Q23 * Q22 * /Q21 * /CLK 7SET ON COUT THREE AND 1/2 
Q20.RSTF = /Q23 * /Q22 * /Q21 +RESET ON COUNT ZERO 
Q20.CLKF = GND ;CLOCK INPUT NOT USED 

Q20 _ t= GND ;DATA INPUT NOT USED 
Q20.TRST = VCC ;OUTPUT ALWAYS ON 
SIMULATION 


;This simulation will run the divider through 
;several passes of the count to verify operation. 


TRACE ON OE PL CLK Q23 Q22 Q21 Q20 


;Initialization of all inputs. Before the simulator can be 
;executed, all of the inputs to the PAL device should be 
;set to a specific value. This section of simulator 
;declaration sets three input variables inactive. 


SETF PL /OE /CLK ;SET PRELOAD FALSE 
7;SET OUTPUT ENABLE 
;SET CLOCK LOW 


;The PRLDF statement will set the registers to a known 
;state at the beginning of our simulation. This line sets 
;us to state zero. 


PRLDF /Q23 /Q22 /Q21 Q20 

;The FOR loop statement exemplifies an easy way to perform 
;repetitive functions. In this example, the "RA" cell does 
znot have a dedicated clock input so this routine will 
:generate 30 repetitive clocks to drive our design. 


FOR I:=1 TO 30 DO 


BEGIN 
SETF CLK ;SET CLOCK HIGH 
SETF /CLK ;SET CLOCK LOW 
END 


;End of simulation 
TRACE OFF 


Figure 53. Design File for Frequency Divider 
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Addressable Registers 


Addressable registers are commonly-used MSI functions, often 
implemented in PAL devices. Addressable registers are used as 
building blocks for digital computers. Depending upon the ad- 


TITLE 16-BIT ADDRESSABLE REGISTER 

PATTERN ADREG16. PDS 

REVISION A 

AUTHOR John Birkner 

COMPANY Monolithic Memories Inc. Santa Clara, CA 
DATE 2/11/85 


The 16-bit addressable register loads one of 16 registers 
selected by ADDR(0..3) with data input, DATA. 


CHIP ADREG16 PAL32R16 
QO Ql Q2 Q3 /El NC NC AO Al VCC A2 A3 DATA NC /PRLD2 CLK2 


Q4 Q5 Q6 Q7 QB QI Q1O Ql1l /E2 NC NC NC NC GND NC NC NC NC /PRLDL CLK} 
Ql2 Q13 Q14 Q15 


dress input one of the many flip-flops in a register is loaded with 
the input data. All other flip-flops in the register retain their 


previous values. An example of a 16-bit addressable register is 
shown in Figure 54. 


Ql4 := AO *Ql4 shold 
+ fal *Q14 shold 
+ /42 *Qla thold 
+ /A3*Q14 shold 
+ /AO* Al* A2* AJ*DATA tload 

Qis t= /AO *Qis shold 
+ far *Q15 shold 
+ /A2 *Q15s shold 
+ /A3*Q15 thold 
+ AO* Al* A2* AJ*DATA sload 

SIMULATION 


TRACE_ON QO Ql Q2 Q3 Q4 QO5 Q6 Q7 Q3 Q9 QIO Qli Ql2 Ql3 Ql4 Qi5 
AO Al A2 A3 DATA 


SETF El E2 /DATA /PRLD1 /PRLD2 
SETF /AO /Al /A2 /A3 
CLOCKF CLK1 CLK2 


SETF AO /Al /A2 /A3 
CLOCKF CLK1 CLK2 


SETF /AO Al /A2 /A3 
CLOCKF CLK1 CLK2 


SETF AO Al /A2 /A3 
CLOCKF CLK1 CLK2 


SETF /AO /Al A2 /A3 
CLOCKF CLK1 CLK2 


SETF AO /Al A2 /A3 
CLOCKF CLK1 CLK2 


SETF /AO Al A2 /A3 
CLOCKF CLK1 CLK2 


SETF AO Al A2 /A3 
CLOCKF CLK1 CLK2 


SETF /AO /Al /A2 A3 
CLOCKF CLK1 CLK2 


SETF AO /Al /A2 A3 
CLOCKF CLK1 CLK2 


SETF /AO Al /A2 A3 
CLOCKF CLK1 CLK2 


SETF AO Al /A2 A3 
CLOCKF CLK1 CLK2 


SETF /AO /Al A2 A} 
CLOCKF CLK1 CLK2 


SETF AO /Al A2 A3 
CLOCKF CLK1 CLK2 


SETF /AO Al A2 A3 
CLOCKF CLK1 CLK2 


SETF AO Al A2 A} 
CLOCKF CLK CLK2 


SETF DATA 


EQUATIONS 
Qo z= AO *Q0 thold 
+ Al #Q0 thold 
+ a2 *Q0 rhold 
+ A3*Q0 shold 
+ /RO*/A1*/A2*/A3*DATA jload 
Ql := /A0 *Ql shold 
+ Al *Ql rhold 
+ A2 *Ql shold 
+ A3*Q1L shold 
+ AO*/A1L*/A2*/A3*DATA zload@ 
Q2 s= AO #Q2 thold 
+ /AL *Q2 yhold 
+ A2 #Q2 shold 
+ A3*Q2 shold 
+ /AO* Al*/A2*/A3*DATA zload 
Q3 t= /AO *Q3 rhold 
+ fAL *Q3 zyhold 
+ A2 *Q3 shold 
+ A3*Q3 shold 
+ AO* Al*/A2*/A3*DATA zload 
Q4 s= AO *Q4 thold 
+ Al *Q4 shold 
+ fA2 #Q4 thold 
+ A3*Q4 jhold 
+ /AO*/AL*® A2*/A3*DATA zload 
Qs z= /A0 *Q5 shold 
+ Al *Q5 shold 
+ /A2 *Q5 shold 
+ A3*Q5 rhold 
+ AO*/Al1* A2*/A3*DATA sload 
Q6 := AO *Q6 thold 
+ fAlL #Q6 thold 
+ JAZ *Q6 shold 
+> A3*Q6 shold 
+ /AO* Al* A2*/A3*DATA tload 
Q7 := /AO0 *Q7 shold 
+ 7AL *Q7 shold 
+ JA2 *Q7 shold 
+> A3*Q7 shold 
+ AO* Al* A2*/A3*DATA tload 
Qs m= AO *Q8 shold 
+ Al *Q8 rhold 
+ A2 *Q8 shold 
+ /A3*Q8 shold 
+ /A0O*/AL*/A2* AZ*DATA sload 
Qo t= /A0 *Q9 shold 
+ Al *Q9 shold 
+ A2 *Q9 shold 
+ LA3*Q9 vhold 
+ AO*/A1*/A2* AIJ*DATA zload 
Qld t= AO *Q10 shold 
+ Jal *Q10 rhold 
+ A2 *Ql0 shold 
+ SA3*QIO rhold 
+ /AO* Al*/A2* A3*DATA pload 
Qll te /AO *Qll ghold 
+ /Al *Qll shold 
+ A2 *Qll shold 
+ /AI*QIL shold 
+ AO* Al*/A2® AJ*DATA load 
Ql2 t= AO *Ql2 shold 
+ Al *Ql2 shold 
+ fA2 *Ql2 thold 
+ ZAI*Q12 thold 
+ /AO*/A1* A2* AJ*DATA sload 
Qu3 := /A0 *Q13 rhold 
+ Al *Q13 shold 
+ fA2 *Q13 shold 
+ /A3*Q13 thold 
+ AO*/Al*® A2* A3*DATA sload 


SETF /AO /Al /A2 /A3 
CLOCKF CLK1 CLK2 


Page: 12 
g 9 egcegcg cgcgegegeg cgcgegcgeg cgcegcge 
Qo XXXXLLLLLL LLLLLLLLLL LLLLLLLLLL LLLLLLH 
Ql XXXXXXLLLL LLLLLLLLLL LLLLLLLLLL LLLLLLL 
Q2 XXXXXXXXLL LLLLLLLLLL LLLLLLLLLL LLLLLLL 
Q3 XXXXXXXXXX LLLLLLLLLL LLLLLLLLLL LLLLLLL 
a4 XXXXXXXXXX XXLLLLLLLL LLLLLLLLLL LLLLLLL 
Qs XXXXXXXXXX XXXXLLLLLL LLLLLLLLLL LLLLLLL 
Q6 XXXXXXXXXX XXXXXXLLLL LLLLLLLLLL LLLLLLL 
Q7 XXXXXXXXXX XXXXXXXXLL LLLLLLLLLL LLLLLLL 
Q8 XXXXXXXXXX XXXXXXXXXX LLLLLLELLL LLLLLLL 
Qo XXXXXXXXXX XXXXXXXXXX XXLLLLLLLL LLLLLLL 
QlLO XXXXXXXXXX XXXXXXXXXX XXXXLLLLLL LLLLLLL 
QLL  XXXXXXXXXX XXXXXXXXXX XXXXXXLLLL LLLLLLL 
Ql2 XXXXXXXXXX XXXXXXXXXX XXXXXXXXLL LLLLLLL 
Ql3) XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX LLLLLLL 
Ql4  XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXLLLLL 
Ql’5 XXXXXXXXXX XXXXXXXXXK XXXXXXXXXX XXXXLLL 
ao XXLLLHHLLH HLLHHLLHHL LHHLLHHLLH HLLHHLL 
Al XXLLLLLHHH HLLLLHHHHL LLLHHHHLLL LHHHHLL 
A2 XXLLLLLLLL LHHHHHHHHL LULLLLLHHH HHHHHLL 
A3 XXLLLLLLLL LLLLLLLLLH HHHHHHHHHH HHHHHLL 
DATA LLLLLLLLLL LLLLLLLLLL LELLLLLLLL LELLLHH 


Figure 54. A Sixteen-bit Addressable Register Design File 
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Introduction 


State machine designs are widely used for sequential control 
logic, which forms the core of many digital systems. State ma- 
chines are required in a variety of applications covering a broad 
range of performance and complexity; low-level control of micro- 
processor-to-VLSI-peripheral interfaces, bus arbitration and 
timing generation in conventional microprocessors, custom bit- 
slice microprocessors, data encryption and decryption and trans- 
mission protocols are but a few examples. 


Typically, the details of contro! logic are the last to be settled in the 
design cycle, since they are continuously affected by changing 
system requirements and feature enhancements. Program- 
mable logic is a forgiving solution for contro! logic design because 
it allows easy modifications to be made without disturbing PC 
board layout. Its flexibility provides an escape valve that permits 
design changes without impacting time-to-market. 


A majority of registered PAL device applications are sequential 
control designs where state machine design techniques are 
employed. As technology advances, new high-speed and high- 
functionality devices are being introduced which simplify the task 
of state machine design. Abroad spectrum of different function- 
ality-and-performance solutions is available for state machine 
design. In this discussion we will examine the functions per- 
formed by state machines, their implementation on various de- 
vices, and their selection. Finally, we will implement a state 
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machine design and go through all of the stages involved in a 
design tutorial. 


What Is a State Machine? 


A state machine is a digital device which traverses through a 
predetermined sequence of states in an orderly fashion. Astate 
is a set of values measured at different parts of the circuit. A 
simple state machine can consist of PAL-device-based combina- 
torial logic, output registers, and buried (state) registers. The 
state in such a sequencer is determined by the values stored in 
the buried and/or output registers. 


A general form of a state machine can be depicted as a device 
shown in Figure 1. In addition to the device inputs and outputs, 
astate machine consists of two essential elements: combinatorial 
logic and memory (registers). This is similar to the registered 
counter designs discussed on page 2-66, which are essentially 
simple state machines. The memory is used to store the state of 
the machine. The combinatorial logic can be viewed as two 
distinct functional blocks: the next state decoder and the output 
decoder (Figure 2). The next state decoder determines the next 
state of the state machine while the output decoder generates the 
actual outputs. Although they perform two distinct functions, 
these are usually combined into one combinatorial logic array as 
in Figure 1. 







OUTPUTS 


Figure 1. Block Diagram of a Simple State Machine 
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Figure 2. State Machine, with Separate Output & Next State Decoders 
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The basic operation of a state machine is twofold: 


1. Ittraverses through a sequence of states, where the next state 
is determined by next state decoder, depending upon the 
present state and input conditions. 


2. It provides sequences of output signals based upon state 
transitions. The outputs are generated by the output decoder 
based upon present state and input conditions. 


Using input signals for deciding the next state is also known as 
branching. In addition to branching, complex sequencers provide 
the capability of repeating sequences (looping) and subroutines. 
The transitions from one state to another are called control 
sequencing and the logic required for deciding the next states is 
called the transition function (Figure 2). 


The use of input signals in the decision-making process for output 
generation determines the type of a state machine. There are two 
widely-known types of state machines: Mealy and Moore 
(Figure 3). Moore state machine outputs are a function of the 
present state only. In the more general Mealy-type state ma- 
chine, the outputs are functions of both the state and the input 
signals. The logic required is known as the output function. For 
either type, the control sequencing depends upon both states and 
input signals. 


Most practical state machines are synchronous sequential cir- 
cuits which rely on clock signals to trigger the state transitions. A 
single clock is connected to all of the state and output edge- 
triggered flip-flops, which allows a state change to occur on the 
rising edge of the clock. Asynchronous state machines are also 
possible, which utilize the propagation delay in combinatorial 
logic for the memory function of the state machine. Such 
machines are highly susceptible to hazards, hard to design and 
are seldom used. In our discussion we will focus solely on 
sequential state machines. 
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State Machine Applications 


State machines are used in a number of system control applica- 
tions. A sampling of a few of the applications, and how state 
machines are applied, is described below. 


As sequencers for digital signal processing (DSP) applications, 
state machines offer speed and sufficient functionality without the 
overkill of complex microprocessors. For simple algorithms, such 
as those involved in performing a Fast Fourier Transform (FFT), 
a state machine can control the set of vectors that are multiplied 
and added in the process. For complex DSP operations, a pro- 
grammable DSP may be better. On the other hand, the program- 
mable DSP solution is not likely to be as fast as the dedicated 
hardware approach. 


Consider the case of a video controller. it generates addresses 
for scanning purposes, using counters with various sequences 
and lengths. But instead of implementing these as actual count- 
ers, the sequences involved can be “unlocked” and implemented, 
instead, as state machine transitions. And there is an advantage 
beyond mere economy of parts. A count can be set or initiated, 
then left to take care of itself, freeing the microprocessor for other 
operations. 


In peripheral control the simple state machine approach can be 
very efficient. Consider the case of run-length-limited (RLL) code. 
Both encoding and decoding can be translated into state ma- 
chines, which examine the serial data stream as it is read, and 
generate the output data. 


Industrial control and robotics offer further areas where simple 
control functions are required. Such tasks as mechanical posi- 
tioning of a robot arm, simple decision making, and calculation of 
a trigonometric function, usually do not require the high-power 
solution of microprocessors with stacks and pointers. Rather, 
what is required is a device that is capable of storing a limited 
number of states and allows simple branching upon conditions. 
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Figure 3. The Two Standard State Machine Models 
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Data encryption and decryption present similar problems to those 
encountered in encoding and decoding for mass media, only here 
it is desirable to make the scheme not so obvious. A program- 
mable state machine device with a security fuse is ideal for this 
because memory is internally programmed and cannot be ac- 
cessed by someone tampering with the system. 


Functions Performed 


Allofthe system design functions performed by controllers can ba 
categorized as one of the following state machine functions: 


¢ Arbitration 

¢ Event monitoring 

¢ Multiple condition testing 
¢ Timing delays 

* Control signal generation 


Later we will take a design example and illustrate how these 
functions can be used when designing a state machine. 


State Machine Theory 


Let us take a brief look at the underlying theory for all sequential 
logic systems, the finite state machine (FSM), or simply state 
machine. 


Those parts of digital systems whose outputs depend on their 
past inputs as well as their current ones can be modeled as finite 
state machines. The “history” of the machine is summed up inthe 
value of its internal state. When a new input is presented to the 
FSM, an output is generated which depends on this input and the 
present state of the FSM, and the machine is caused to move into 
a new state, referred to as the next state. This new state also 
depends on both the input and present state. The structure of an 
FSM is shown pictorially in Figure 2. The internal state is stored 
in a block labelled “memory”. As discussed-earlier, two combina- 
torial functions are required: the transition function, which gener- 
ates the value of the next state, and the output function, which 
generates the state machine output. 


State Diagram Representation 


The behavior of an FSM may be specified in graphical form as 
shown in Figure 4. This is called a state diagram, or state 
transition diagram. Each bubble represents a state, and each 
arrow represents atransition between states. Inputs which cause 
the transitions are shown next to each transition arrow. 
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Figure 4. State Machine Representation 


Control sequencing is represented in the state transition diagram 
as shown in Figure 5. Direct control sequencing requires an 
unconditional transition from state Ato state B. Similarly condi- 
tional control sequencing shows a conditional transition from 
state C to either state D or state E, depending upon input 
signal I1. 





b. Conditional Control 
Sequencing 


a. Direct Controt 
Sequencing 


407 06 
Figure 5. Control Sequencing 


For Moore machines the output generation is represented by 
assigning outputs with states (bubbles) as shown in Figure 6. 
Similarly, for Mealy machines conditional output generation is 
represented by assigning outputs to transitions (arrows), as was 
shown in Figure 4. More detail on Mealy and Moore output 
generation is given later. 
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a.Moore Machine b. Mealy Machine 
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Figure 6. Output Generation 


For this notation, there is a specification uncertainty as to which 
signals are outputs or inputs, as they both occur on the drawing 
next to the arrow in which they are active. This is usually resolved 
by separating the input and output signals names with a line 
(Figures 4 & 6). Sometimes an auxiliary pin list detailing the logic 
polarity and input or output designations is also used. 


State transition diagrams can be made more compact by writing 
onthe transitions not the input values which cause the transition, 
as in Figure 4, but a Boolean expression defining the input 
combination or combinations which cause this transition. For 
example, in Figure 7, some transitions have been shown for a 
machine with inputs “START”, “X1” and “X2”. In the transition 
between states 1 and 2, the inputs X1 and X2 are ignored (thatis, 
they are “don't cares”) and thus do not appear on the diagram. 
This saves space and makes the function more obvious. 


START 
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Figure 7. State Transition Diagram with Mnemonics 
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There can be a problem with this method if one is careless. The 
state transitions in Figure 8 show what can happen. There are 
three input combinations, {l0, 11, 12, 13} = {1011}, {1101} and 
{1111}, which make both (A0*/l2 + 13) and (10*I1 + 10°12) true. 
Since a transition to two next states is impossible, this is an error 
in the specification. It must either be guaranteed that these input 
combinations never occur, or the transition conditions must be 
modified. In this example, changing (l0*l1 + 10°12) to (I0*I1 + 
10*12)*/I3 would solve the problem. 
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Figure 8. State Diagram with Conflicting 
Branch Conditions 


State Transition Table Representation 


Asecond method for state machine representation is the tabular 
form known as the state transition table, which has the format 
shown in Figure 9. Along the top are listed all of the possible input 
bit combinations and internal states. Each row gives the next 
state and the next output; the table thus specifies the transition 
and output functions. This type of table, however, is not suitable 
for specifying practical machines in which there is alarge number 
of inputs, since each input combination defines a row of the table. 
With 10 inputs for example, there would have to be 1024 rows! 
A modified version of this table is often used directly for program- 
mable logic sequencer (PLS) device design. 


PRESENT | INPUTS NEXT STATE OUTPUTS 
STATE GENERATED 
Tas Dw) ae | oe 


Figure 9. A State Transition Table 






Flowcharts 


Another popular notation is based on flowcharts. In this notation, 
states are represented by rectangular boxes, and alternative 
state transitions are determined by strings of diamond-shaped 
boxes. The elements may have multiple entry points, but in 
general have only one exit. The state name is written as the first 
entry inthe rectangular state box. Any Moore outputs present are 
written next in the state box, with a caret (“*”) following those that 
are unregistered. The state code assignment, if it is known, is 
written next to the upper right corner of the state box. Decision 
boxes are diamond or hexagonal shaped boxes containing either 
an input signal or a logic expression. Two exits labelled “0” and 
“1” lead to either another decision box, a state box, or a Mealy 
output. The rounded oval is used for Mealy machine outputs. 
Once again, a caret (“4”) follows those outputs that are unregis- 
tered. Allof the boxes may need to be expanded to accommodate 
a number of output signals or a larger expression. 





The use of these symbols is shown in Figure 10. Each path 
through the decision boxes from one state to another defines a 
particular combination or set of combinations of the input vari- 
ables. Apath does not have to include all input variables; thus it 
accommodates “don't cares”. These decision trees take more 
space than the expressions would, but in many practical cases, 
state machine controllers only test a small subset of the input 
variables in each state and the trees are quite manageable. Also, 
the chain of decisions often mirrors the designer’s way of thinking 
about the actions of the controller. Itis important to note thatthese 
tests are not performed sequentially in the FSM; all are performed 
in parallel by the FSM'’s state transition logic. 


A benefit of this method of specifying transitions is that the 
problem of Figure 8 can be avoided. Such a conflict would be 
impossible as one path cannot diverge to define paths to two 
States. 


When there is no danger of conflicts due to multiple next states 
being defined, this flowchart notation can be compacted by 
allowing more complex decisions. Expressions can be tested, as 
shown in Figure 11a, or multiple branches can extend from a 
decoding box, as in Figure 11b. In the second case it is 
convenient to group the set of binary inputs into a vector, and 
branch on different values of this vector. 


The three methods of state machine representation—state dia- 
grams, state tables, and flowcharts—are all equivalent and 
interchangeable, since they all describe the same hardware 
structure. Each style has its own particular advantages. Although 
most popular, the state transition diagrams are more complex for 
problems where state transitions depend on many inputs, since 
the transition conditions are written directly on the transition 
arrows. Although cumbersome, the state tables allow the de- 
signer tight control over signal logic. Flowcharts are convenient 
for small problems where there are not more than about ten states 
and where up to two or three inputs or input expressions are 
tested in each state. For larger problems, they can become 
ungainly. 


Once a state machine is defined, it must be implemented on a 
device. Software packages are then used to implement the 
design on a device. The task is to convert the state machine 
description into transition and output functions. Software pack- 
ages also account for device-specific architectural variations and 
limitations, to provide a uniform user interface. 
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Figure 10. Flowchart Notation 
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Some software packages accept all of the three different state 
machine representations directly as design inputs. However, the 
most prevalent design methodology is to convert the three state 
machine design representations to a simple textual representa- 
tion. Textual representations are accepted by most software 
packages although the syntax varies. The PALASM 2 software 
package offers one such simple and easy-to-use state machine 
textual representation. The task of converting from a state 
transition diagram and flowchart representation to PALASM 2 
software state machine syntax is demonstrated in a design 
tutorial on page 2-122. 


Since the most common of all state machine representations is 
the state transition diagram representation, we will use it in all 
subsequent discussions. Transition table and flowchart repre- 
sentation implementations will be very similar. 
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State Machine Types: Mealy & Moore 


With the state machine representation clarified, we can now 
return to the generic sequencer model of Figure 1, which has 
been labelled (Figure 12) to show the present state (PS), next 
state (NS) and output (OB, OA). This will illustrate how Mealy and 
Moore machines are implemented with most sequencer devices 
which provide a single combinatorial logic array for both next state 
and output decode functions. There are four ways of using the se- 
quencer, two of which implement Moore machines and two Mealy. 
First, let us look at the Mealy forms. 


The standard Mealy form is shown in Figure 13, where the signals 
are labelled as in Figure 12 toindicate which registers and outputs 
are used. The register outputs PS are fed back into the array and 
define the present state. The combinatorial logic implements the 
transition function, which produces the next state flip-flop inputs 
NS, and the output function, which produces the machine output 
OB. This is the asynchronous Mealy form. 
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Figure 11. Using Fiowcharts 
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Figure 12. Generic Model of an FSM 
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Analternative Mealy form is shown in Figure 14. Here the outputs 
are passed through an extra output register (OA) and thus do not 
respond immediately to input changes. This is the synchronous 
Mealy form. 


The standard Moore form is given in Figure 15. Here the outputs 
OB depend only on the present state PS. This is the asynchro- 
nous Moore form. The synchronous Moore form is shown in 
Figure 16. In this case the combinatorial logic can be assumed 
to be the unity function. The outputs (OB) can be generated 
directly along with the present state (PS). Although these forms 
have been described separately, a single sequencer is able to 
realize a machine which combines them, provided that the 
required paths exist in the device. 
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In the synchronous Moore form, the outputs occur in the state in 
which they are named in the state transition diagram. Similarly in 
the asynchronous Mealy and Moore forms the outputs occur in 
the state in which they are named, although delayed alittle by the 
propagation delay of the output decoder. This is because they are 
combinatorial functions of the state (and inputs in the Mealy 
case), 


However, the synchronous Mealy machine is different. Here an 
output does not appear in the state in which it is named, since it 
goes into another register first. It appears when the machine is 
in the next state, and is thus delayed by oneclock cycle. The state 
diagram in Figure 17 illustrates all of the possibilities on a state 
transition diagram. 
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Figure 14. Synchronous Mealy Form 
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Figure 16. Synchronous Moore Form 
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Figure 17. State Diagram Labelling for 
Different Output Types 


As amatter of notation, Moore outputs are often placed within the 
state bubble and Mealy outputs are placed next to the path or 
arrow which activates them. 


The relationship of Mealy and Moore, synchronous and asyn- 
chronous outputs to the states is shown in Figure 18. 


Device Selection Considerations 


Architecturally, the state machine devices can be divided into 
three categories: 


* Logic-based devices 


* Memory-based devices 
¢ Instruction-based devices 
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Logic-based devices include the PAL and PLS devices. These 
devices use the sum-of-products logic array to implement the 
transition and output functions. The memory-based devices like 
PROSE implement the transition and output functions using a 
PROM or RAM array. Conceptually this is no different from the 
logic-based devices, since the memory can be viewed as special 
logic. The instruction-based sequencers (Am29PL141) offer 
hardwired instructions and fixed logic blocks to provide the 
transition function logic with enhanced capabilities like subrou- 
tines. Functionally all three types of devices work similarly per- 
forming two basic functions: control sequencing and output 
generation. 


There are three major criteria for selecting the correct state 
machine device for a design: 


Number of inputs/outputs 

— VO flexibility 

— Number of output registers 

Speed 

Intelligence/functionality 

— Number of product terms 

— Type of flip-flops 

~— Number of state registers 

— Number of PROM locations (memory-based sequencers) 


Number of I/Os 


The number of inputs, outputs and I/O pins determines the signals 
which can be sampled or generated by a state machine. Figure 
19 lists the devices offered for state machine designs, and shows 
the number of inputs, outputs and I/O pins available on each 
device. 


Timing and Speed 


The timing considerations for sequencer design are similar to 
those for registered logic design (page 2-64). A system clock 
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Figure 18. State Machine Timing Diagram 
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DEVICE TOTAL DEDICATED DEDICATED LOGICAL SPEED GRADES 
NUMBER INPUT REGISTERED PRODUCT FREQUENCY IN 

OF PINS PINS OUTPUT TERMS/ MHz 
PINS OUTPUT f 


MAX 








55.5, 37, 25, 16 
55.5, 37, 25, 16 
55.5, 37, 25, 16 


omnmnmnwmnwoaoaaa 
ohoawwma ® © 


RADWHAAMDS | AOORAD RAP RAOARDO 
oO 
fe eee eee — 


ao 
‘ 

_ 

nr 


20RP10 
20RP8 
20RP6 
20RP4 
20XRP10 
20XRP8 
20XRP6 
20XRP4 


37, 25 
30, 22.2, 14 
30, 22,2, 14 
30, 22.2, 14 
30, 22.2, 14 
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20RS10 
20RS8 
20RS4 





20RA10 
22RX8 
22V10 
32VX10 
32R16 


PLS167 
PLS168 








Figure 19. Devices for State Machine Design 
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DEVICE TOTAL DEDICATED | DEDICATED vo LOGICAL SPEED GRADES 
NUMBER INPUT REGISTERED PINS PRODUCT FREQUENCY IN 
OF PINS PINS OUTPUT TERMS/ MHz 
PINS OUTPUT fic 
C22V10 24 0-10 10-0 8-16 33.3, 20 
C29M16 24 16 8-16 20, 15 
C29MA16 24 16 4-12 20, 15 
ECL 





10H20EV/EG8 24 12 0-8 8-0 8-12 
10020EV/EG8 24 12 0-8 8-0 8-12 


125 
125 





Figure 19. Devices for State Machine Design (Cont'd.) 


cycle forms the basic kernel for evaluating control function 
behavior. For the most part, all input and output functions are 
specified in relationship to the positive edge. Registered outputs 
are available after a period of time t,,,, the clock-to-output 
propagation delay. Asynchronous outputs require an additional 
propagation delay (t,,.) before they are valid. 


For the circuit to operate reliably, all of the flip-flop inputs must be 
stable at the flip-flop no later than the minimum set-up time (t,,) of 
the flip-flops before the next active clock edge. If one of the inputs 
changes after this threshold, then the next state or synchronous 
output could be stored incorrectly; the circuit may even malfunc- 
tion. To avoid this, the clock period (t,) must be greater than the 
sum of the set-up time of the flip-flops and the clock to output time 
(t,, +t,,,). This determines the minimum clock period and hence 
the maximum clock frequency, f,,,,, Of the circuit. Metastability 
and erroneous system operation may occur if these specifica- 
tions are violated. 


The timing relationships are shown in Figure 20. In each cycle 
there are two regions: the stable region, when all signals are 
steady, and the transition region, when the machine is changing 


state and signals are unstable. The active clock edge causes the 
flip-flops to load the value of the new state which has been set up 


INPUT 


407 18 


STATE n 


at their inputs. At atime afterthis, the present state and output flip- 
flop outputs will start to change to their new values. After atime 
has elapsed, the slowest flip-flop output will be stable at its new 
value. Ignoring input changes for the moment, the changes in the 
state register cause the combinatorial logic to start generating 
newvalues forthe asynchronous outputs and the inputs to the flip- 
flops. If the propagation delay of the logic is t,,, then the stable 
period will start at a time equal to the sum of the maximum values 
of t,,,, and t,,. 


Figure 19 also shows the maximum operating frequency of the 
devices. ECL-based PAL sequencers can implement simple 
state machines from 60 MHz to above 100 MHz. Conventional 
TTLPAL sequencers can now reach speeds of 55 MHz. The new 
PLS devices operate at 37 MHz. Finally, PROSE sequencers 
provide 25 MHz operation, and the Am29PL141 can run at 20 
MHz. The design engineer usually selects the simplest device 
that provides the desired level of performance. 


Asynchronous Inputs 
The timing of the inputs to a synchronous state machine are often 


beyond the control of the designer and may be random, such as 
sensor or keyboard inputs, or they may come from another 





Figure 20. Timing Diagram for Maximum Operating Frequency 
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Figure 21. Asynchronous Input Causing Race 


synchronous system that has an unrelated clock. In either case 
no assumptions can be made about the times when inputs can or 
cannotarrive. This fact causes reliability problems that cannot be 
completely eliminated, but only reduced to acceptable levels. 


Figure 21 shows two possible transitions from state "S1” (code 
00) either back to itself, orto state "S2” (code 11). Whichtransition 
is taken depends on input variable "A” which is asynchronous to 
the clock. The transition function logic for both state bits B1 and 
B2 includes this input. The input A can appear in any part of the 
clock cycle. For the flip-flops to function correctly, the logic for B1 
and B2 must stabilize correctly before the clock. The input should 
be stable in a windowt,, (setup time) before the clock andt, (hold 
time) after the clock. If the input changes within this window, both 
the flip-flops may not switch, causing the sequence to jump to 
states 01 or 10, which are both undefined transitions. This type 
of erroneous behavior is called an input race. 


A solution to this problem is to change the state assignment so 
that only one state variable depends on the asynchronous input. 
Thus the 11 code must be changed to 01 or 10. Now, with only 
one unsynchronized flip-flop input, either the input occurs in time 
to cause the transition, or it does not, in which case no transition 
occurs. In the case of a late input, the machine will respond to it 
one cycle later, provided that the input is of sufficient duration. 


There is still the possibility of an input change violating the setup 
time of the internal flip-flop, driving it into a metastable state. This 
can produce system failures which can be minimized, but never 
eliminated (see Metastability, page 3-164). The same problem 
arises when outputs depend on an asynchronous input. 


Very little can be done to handle asynchronous inputs without 
severely constraining the design of the state machine. The only 
way to have complete freedom in the use of inputs is to convert 
them into synchronous inputs. This can be done by allocating a 
flip-flop to each input as shown in Figure 22. These synchronizing 
flip-flops are clocked by the sequencer clock, and may even be 
the sequencer’s own internal flip-flops. This method is not 
foolproof, but significantly reduces the chance of metastability 
occurring. 


Functionality 


The functionality of different devices is difficult to compare since 
different device architectures are available. The number of 
registers in a device determines the number of state combina- 
tions possible. However, all of the possible state combinations 
are not necessarily usable, since other device constraints may be 
reached. The number of registers do give an idea of the 
functionality achievable in adevice. Other functionality measures 
include the number of product terms, type of flip-flop, or the 
number of PROM locations in a memory-based sequencer. One 
device may be stronger than another in one of these measures, 
but overall may be less useful due to other shortcomings. Choos- 
ing the best device involves both skill and experience. 


A designer has a complete spectrum of devices with different 
architectures to choose from for a state machine design. These 
range from the PAL16R8D family of very high speed devices to 
the new PROSE PMS14R21 and instruction-based Am29PL141 
high-functionality devices. The spectrum is completed by mid- 
range PAL devices including the PAL23S8, PAL22RX8A and 
PAL32VX10/A, with architectural features specifically designed 
for state machine designs, and the PLS devices designed exclu- 
sively for state machine implementation. 


In order to give an idea of device functionality, we will consider 
each of the architecture options available to the designer and 
evaluate its functionality. 
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Figure 22. Input Synchronizing Register 
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PAL Devices as Sequencers 


A vast majority of state machine designs are implemented with 
PAL devices. Early versions of software required the user to 
manually write the sum-of-products Boolean equations for using 
PAL devices. Second generation software allows one to specify 
the design in “state machine syntax,” and handles the translation 
to sum-of-products logic automatically. PAL devices implement 
the output and transition functions in sum-of-products form via a 
user-programmable AND array and a fixed OR array. 


PAL devices deliver the fastest speed of any sequencer and are 
ideally suited for simple control applications characterized by faw 
input and output signals interacting within a dedicated controller 
in asequential manner. The number of flip-flops in a PAL device 
range from 8 to 12, which offer potentially more than one thou- 
sand state values. Since some of the flip-flops are used for 
outputs, and the number of product terms is limited, the usable 
number of states is reduced drastically. Generally, up to about 35 
states can be utilized. 


PAL Device Flip-flops 


PAL device based sequencers implement small state machine 
designs, which have a relatively large number of output transi- 
tions. Since the output registers change with most state transi- 
tions, they can be used simultaneously as state registers, once 
the state values are carefully selected. Most PAL devices are 
used for small state machines, and efficiently share the same 
register for output and state functions. High-functionality PAL 
device based sequencers provide dedicated buried state regis- 
ters when sharing is difficult. 


TOTAL DEDICATED 
NUMBER INPUT 
OF PINS PINS 


DEVICE 





owmwWwOWO WO WO OO 


DEDICATED 
REGISTERED 
OUTPUT 
PINS 


As a state machine traverses from one state to another, every 
output either makes a transition (changes logic level) or holds 
(stays at the same logic level). Small state machine designs 
require relatively more transitions and fewer holds. As designs 
get larger, state machines statistically require relatively fewer 
transitions and more holds. 


Most PAL devices provide D-type output registers. D-type flip- 
flops use up product terms only for active transitions from logic 
LOW to HIGH level, and for holds for logic HIGH level only. J-K, 
S-R, and T-type flip-flops use up product terms for both LOW-to- 
HIGH and HIGH-to-LOW transitions, but eliminate hold terms. 
Thus D-type flip-flops are more efficient for small state machine 
designs. Some high-end PAL devices offer the capability of 
configuring the flip-flops as J-K, S-R or T-types, which are more 
efficient for large state machine designs since they require no 
hold terms. 


Many examples of PAL-device-based sequencers can be found 
in system time base functions, special counters, interrupt control- 
lers, and certain types of video display hardware. 


PAL devices are produced in avariety of technologies for multiple 
applications, and provide a broad range of speed-power options. 
PAL devices which can be used for sequencer designs are listed 
in Figure 23. We will consider the following PAL devices in detail. 


PAL10H/10020EV/EG8 
PAL16R8D family 
PAL23S8 

PAL22RX8 
PAL32VX10 


LOGICAL SPEED GRADES 

PRODUCT FREQUENCY IN 
TERMS/ MHz 
OUTPUT f 


MAX 


55.5, 37, 25, 16 
55.5, 37, 25, 16 
55.5, 37, 25, 16 
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37, 25, 16 

37, 25, 16 

37, 25, 16 
22.2 
22.2 
22.2 
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Figure 23. Table of PAL Devices for Sequencer Applications 
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DEVICE DEDICATED DEDICATED LOGICAL SPEED GRADES 








INPUT REGISTERED PRODUCT FREQUENCY IN 
OUTPUT TERMS/ MHz 
PINS OUTPUT fe 
20RP10 10 37, 25 
20RP8 8 37, 25 
20RP6 6 37, 25 
20RP4 4 37, 25 
20XRP10 10 30, 22.2, 14 
20XAP8 8 30, 22.2, 14 
20XRP6 6 30, 22.2, 14 
20XRP4 4 30, 22.2, 14 





20RS10 
20RS8 
20RS4 





20XRP10 30, 22.2, 14 
20XRP8 30, 22.2, 14 
20XRP6 30, 22.2, 14 
20XRP4 30, 22.2, 14 
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owmwon oa) See OmMDDND DAD 
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20RS10 8-16 
20RS8 8-16 
20RS4 8-16 









20RA10 
22RX8 
22V10 
32VX10 
32R16 


33, 20 
28.5 
40, 28.5, 18 



















C22V10 













C29M16 
C29MA16 


10H20EV/EG8 24 12 0-8 8-0 8-12 
10020EV/EG8 24 12 0-8 8-0 8-12 


Figure 23. Table of PAL Devices for Sequencer Applications (Cont'd.) 








2-112 ol Monolithic KL Memorles cl 


State Machine Design 





The ECL PAL10H/10020EV/EG8 


At 125 MHz, the ECL PAL10H/10020EV/EG8 provides 
the highest speed for a state machine design. The 
PAL10H/10020EV/EG8 has eight outputs and 20 inputs. Half of 
the outputs have 8 product terms and half have 12 product terms. 
All of the 20EV8 outputs use D-type flip-flops, while the 20EG8 
outputs are transparent latches. Two architectural fuses per 
output control the polarity of the logic and bypass the flip-flop for 
combinatorial asynchronous outputs. Two global product terms 
are present to Set and Reset all flip-flops. Maximum frequency 
of operation as a state machine is 125 MHz. 


Figure 24 details the macrocell for the PAL10H/10020EV8. 


The PAL16R8D Family 


This is the high-speed end (55 MHz) of the TTL state machine 
design solutions. PAL16R8D family devices are available in three 
different versions (PAL16R8/6/4) with varying numbers of regis- 
tered and combinatorial outputs and I/O pins for convenient 
design fitting. The PAL16R8 family provides 64 product terms 
(eight per output). 


The software design options available for the PAL16R8D family 
are the traditional Boolean equations and the state machine 
syntax provided by PALASM 2 software. 


The PAL23S8 


At 33.3 MHz and in a 20-pin DIP, this device (Figure 25) provides 
the highest performance-to-board-space ratio. Designed for 
high-functionality state machines, this device offers eight output 
and six dedicated buried registers. It also offers four output 
macrocells with register bypass for asynchronous outputs. It has 
nine dedicated device inputs and four I/O pins giving a total of 23 
array inputs. For improved state machine initialization, it also 
offers dedicated synchronous Preset and asynchronous Reset 
product terms. Pin-compatible with the PAL16R8 family, this 
device offers varied distribution of 145 product terms for large 
designs and optimal fit. 


The software design options include Boolean equations and state 
machine syntax. 





630 02 


Figure 24. The PAL10H/10020EV8 Output Macrocell 
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The PAL22RX8A 


The 24-pin PAL22RX8A (Figure 26) provides eight configurable 
input/output macrocells with register bypass for a user-program- 
mable number of registered or combinatorial outputs and I/O 
pins. It also provides dedicated asynchronous reset and preset 
product terms for state machine initialization. The PAL22RX8A 
provides a product-term-controlled XOR gate with its sum-of- 
products logic. This allows users to configure individual macro- 
cells with D, J-K, S-R or T-type flip-flops. 


The requirements of transition and hold terms are dependent 
upon state selection, which varies for different applications. The 
PAL22RX8A provides both D-type and J-K type flip-flops, allow- 
ing the designer to select the one requiring fewer product terms. 


The PAL22V10 


The PAL22V10 provides output macros similar to the 
PAL22RX8A. It provides both programmable polarity and regis- 
ter bypass. In addition, it has varied product term distribution. 


The PAL22V10 has an advantage of a large number of product 
terms. It provides a varied distribution of eight to sixteen product 
terms per output. It also provides ten output macrocells, allowing 
for larger designs. 


The PAL32VX10/A 


The PAL32VX10/A (Figure 27) provides high functionality and 
density for PAL device designs. The PAL32VX10/A has config- 
urable flip-flops and initialization product terms for state machine 
designs. It offers unprecedented design density because ofits 10 
input/output macros and a large number (8-16/output) of product 
terms. 


The PAL32VX10/A also provides dual feedback paths from each 
output macrocell. This allows the macrocell register to be used 
asa buried state register incase the I/O pinis used as adedicated 
input. The PAL32VX10/A is designed for large state machine 
designs which have relatively few output transitions and require 
separate output and buried registers. The PAL32VX10/A pro- 
vides up to 10 buried registers for state machine design without 
any I/O penalty. The PAL32VX10/A also allows trading-off output 
registers for state registers and vice-versa. This optimizes the 
use of device resources. 
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Figure 25. Block Diagram of PAL23S8 
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Figure 26. PAL22RX8A Output Macrocell 
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Figure 27. PAL32VX10/A Output Macrocell 
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Figure 28. Architecture of a PLS105 Device 
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Programmable Logic Sequencers (PLS) 


Another alternative for control logic is the PLS (Programmable 
Logic Sequencer) device (Figure 28). PLS devices have both 
programmable AND and programmable OR arrays. Forty-eight 
product terms are available, which can be shared between 
transition functions and output functions for all of the outputs, as 
dictated by the application needs. PLS devices also offer S-R flip- 
flops with buried registers for control sequencing. The flip-flops 
have acommon clock line. The S-R type flip-flops are similar to 
the J-K type flip-flops discussed earlier, and require no hold 
terms. Consequently S-R type flip-flops are very efficient forlarge 
state machine designs. PLS devices can function as either as 
Moore machines or Mealy machines with a register on the output 
path. 


With the improvements in design tools, it is possible to specify 
designs directly from the state diagram. However, one format 
which has been used often in the past is the PLS table. This table 
is primarily used where the design is entered manually. The table 
is also useful for learning about and experimenting with se- 
quencer designs. Often, itis possible to manipulate sequencer 
programs using the table in ways which are not possible with 
some of the more rigid CAD languages. 


The table has provision-for input and output signal pin allocation 
and naming, and aline for each of the 48 device product terms. 
The entries used in each line are shown in Figure 29. 


AND ARRAY TERMS 
VALUE 


(input, present state) 


true input/state bit 
complement 
don't care 


Set Flip-flop 
Reset Flip-flop 
no change 


Figure 29. Symbols Used in a PLS Table 
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Figure 30. Example State Machine 





The table defines the output function and transition function. Both 
functions depend on the present state and inputs. Each possible 
link path between two states uses a product term and becomes 
a line in the PLS table. The collection of all link path lines is a 
complete definition of the combinatorial logic required. The 
following example illustrates the relationship between a state 
diagram and the PLS table. 


Let us look at an arbitrary function as shown in Figure 30. The 
inputs are A,B,C, and D; the state names are SO, $1, S2 and S3; 
and the outputs are W, X, Y and Z. To construct the PLS table, a 
state assignmentis necessary. Selecting an arbitrary binary code 
for each state: 


SO = 00 
$1 =01 
$2 = 10 
$3 = 11 


With the two state variables P and Q, the PLS table can be 
derived, and is shown in Figure 31. 


PRESENT| NEXT | OUTPUTS 
STATE | STATE 


INPUTS 
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Figure 31. Example PLS Table 
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Figure 32. “HOLD” Transitions 
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This is now a complete description of the combinational logic of 
the state machine. Every line in the table is a product term which 
is active when conditions for a transition to another state are 
satisfied. For this reason, the product terms in a PLS device are 
also known as transition terms. As the circuit moves from state 
to state, new transition terms become active as others become 
inactive. 


Acomment may be appended to each term to aid documentation 
of the design. 


“Hold” Terms 


The flip-flops used in PLS devices are the S-R type. As discussed 
earlier, S-R flip-flops do not require any product terms for holds 
and require product terms only for state transition or output 
generation (Figure 32). 


This may be seen in the two lines of the PLS table corresponding 
to the two link paths shown, in Figure 33. 


ae PRESENT 
STATE 





LHLH 
: LHLH 





Figure 33. Hold Transitions In a PLS Table 


The first path, corresponding to the first line in Figure 33, is a 
transition back to the same state. No change is thus required in 
any of the state variables. ‘-’ entries can thus be used. The first 
line has only ‘-’ entries in the OR Array part of the table; the 
transition term is thus not connected and this line can be elimi- 
nated altogether. This is a logic minimization which can be done 
immediately when converting the flowchart or state diagram into 
aPLS table. The second path is to a different state, “0110”. This 
does require a term since some state variables have to change. 
Also, note that “hold” state transitions are only free if no output 
changes are required, 


Complement Array Term 


An important addition to the PLS device’s programmable AND- 
OR array is the complement array term (Figure 34). This is an 








extra OR line which is first complemented and then fed back as 
an additional input into the AND array. Figure 35 shows the three 
possible configurations of the array at each transition term. The 
symbols shown are used in an extra truth table column for config- 
uring the complement array. 


Taz Tas Ty Tp 


TRANSITION TERMS 





X INDICATES FUSE 
407 26 


Figure 34. Connections of the Complement Array Term 


Proper usage of the complement array can lead to considerable 
savings in product terms. This can be seen in the typical use of 
the complement array in the implementation of default or “ELSE” 
transitions. Figure 36 shows a set of four transitions from state 
SA, three of which are enabled by one of three combinations of 
inputs A, B, and C, causing transitions to states SO, Si and S2. 
All other combinations cause a transition to state SX. 
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Figure 35. Complement Array Configurations 
INPUT COMBINATIONS 
A BC 
A BC 
ABC 
A BC 
A BC 
A BC 
A BC 
A BC 
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Figure 36. Transitions with a Default 
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Three product terms are required for transition to states SO, S1 
and S2 which encode the three different conditions. For the 
default transition to state SX, the remaining five condition combi- 
nations of A, B and C would have to be encoded, requiring five 
more product terms (Figure 37). But these terms can be thought 
of as “not the transitions to SO or S1 or S2”. By using the comple- 
ment array (which provides this NOT signal), only one more 
product term is required for the default transition (Figure 38). 


The last five terms have been compressed into one. If none ofthe 
three defined input combinations is true in state SA, the inputs to 
the complement array will all be zeros, and the complement array 
output will be a one. This activates the fourth transition term, 
causing a transition to SX (Figure 39). 


Thesame complement array term canbe used in other states too, 
since the propagated signal is ANDed with the present state code. 
Therefore all defaults can be realized with one term per state. 
PLS devices can be an effective solution in state- and branch- 
intensive applications, typically found in a variety of protocol con- 
trollers, waveform generators, and sequence detectors. In such 
applications, complex arrangements of multiple branches may be 
present. Multi-step control functions may include many loops of 
dozens of steps to count packets of data or input events as they 
occur. 


PLS devices can be characterized by the number of inputs, 
number of product terms, number of flip-flops and number of 
outputs. The MMI PLS family currently has three members; these 
are listed in Figure 40. 


Software for PLS devices can usually accept input descriptions in 
three formats: truth tables, Boolean equations and state machine 
syntax. The first two require the user to work out the explicit logic 
assignment for all terms and code the detailed fuse states 
programmed or unprogrammed. The new state machine syntax 
allows design in a high-level syntax, and is easier to use. 






ALL THREE PRODUCT 
TERMS ARE LOW ="0" 
WHEN NONE OF THE 
THREE CONDITIONS FOR 
TRANSITION TO S;,S2 OR Sy 
1S VALID 
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Figure 39. 
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Figure 37. Product Terms Required for Transition to 
State SX 
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Figure 38. Default Transition Using Complement 
Array Term 
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DEVICE TOTAL DEDICATED | DEDICATED LOGICAL SPEED GRADES 
NUMBER INPUT REGISTERED PRODUCT FREQUENCY IN 

OF PINS PINS OUTPUT TERMS/ MHz 

OUTPUT fa 





PLS167 24 14 Total 48 33 
PLS168 24 12 Total 48 33 

































Figure 40. Table of PLS Devices 


PROSE Sequencer (PMS14R21) 


The PROSE (PMS14R21 PROgrammable SEquencer) is arevo- 
lutionary architecture optimized for very high functionality state 
machine designs. It combines a PROM and a PAL array ona 
single device, utilizing the efficiencies of both for state machine 
design. 


The PROSE is a high-speed, 14-input, 8-output state machine. It 
consists of a 128x21 PROM array preceded by a 14H2 PAL array 
(see Figure 41). The PAL array is efficient for performing logic 
functions on a large number of input conditions, while the PROM 
array is optimal for implementing a large number of productterms 
(decision branches) and states. The combination allows a very 
efficient implementation of a state machine. 





MODE 
SDI 
DCLK spo 
CLK 
PE 
407 30 


Figure 41. PROSE PMS14R21 Device Architecture 


PROM Operation 


The PROM consists of 128, 21-bit word locations. Each of these 
locations in the PROM can be viewed as a state. Eight bits 
Q(7-0) at each location define the outputs. The other 13 bits are 
used as feedback: six as condition select bits CS(5-0) to the PAL 
array, and seven to the PROM array itself for determining the next 
location to be addressed. Five of these seven constitute the low- 
order address bits A(4—0) for the next state. The remaining 2 bits 
are XF(1—-0), inputs to the exclusive-OR gates which determine 
address bits A6 and AS in conjunction with PAL output signals X1 


and X0. Effectively, the seven-bit address of every PROM 
location is generated by seven bits of the PROM data and 
constitutes the present state. The next PROM location is the one 
whose address is stored as data in the present location. Eight of 
the PROM data bits are used as outputs. 


PAL Array Operation 


The PAL14H2 has 14 complementary inputs and 2 active-HIGH 
outputs (X1 and X0). Each of the 2 outputs is a sum of eight 
product terms. Eight of the inputs, (7-0), are from an external 
source, and are used for encoding conditions for branch selec- 
tion. Six inputs, CS(5-0), are from internal feedback from the 
PROM, and are used for selection of the conditions that deter- 
mine branching when decisions are involved in state transitions. 


Just as PROMs are efficient for direct control sequencing, PAL 
arrays are very efficient for encoding input conditions. The 
function of the PAL14H2 in the PROSE device is to encode all 
eight test input conditions along with the state information. Based 
upon these test conditions being true it inverts the polarity of the 
two most significant PROM address bits. This causes the PROM 
to address a different location (state), which is equivalent to 
conditional branching. The PALASM 2 design software selects 
these branch locations automatically and isolates the designer 
from low-level details. 


With 128 PROM locations, the PROSE device can be used for 
large state machines. It is very effective in single-thread control 
functions characterized by long sequences of events generated 
by one “trigger” event. There may be asingle repetitive decision 
evaluated many times or a long chain of decisions evaluated for 
sequential inputs. The output control stream may be quite wide 
and complex depending on the system. Generally, only narrow 
branches (two or four-way) are needed. Typical applications 
occur incontrollers for error detection, numerical processing, and 
data encryption and decryption. The PROM output structure can 
hold any pattern of eight-bit values to be read out as needed. No 
trade-off has to be made with branching complexity to provide this 
output flexibility. Byte or word-oriented expansion can be made 
by arranging multiple devices in parallel. 


PROSE state machines are fully supported by PALASM 2 soft- 
ware. A complete design example of a PROSE-based state 
machine is included on page 2-122. 
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Fuse Programmable Controller 
(Am29PL141) 


The Am29PL141 is a high-performance, single-chip, fuse-pro- 
grammable controller (FPC). This chip is designed to allow the 
implementation of complex state machines and controllers by 
programming the appropriate sequence of microinstructions in 
the on-chip microprogram memory. 









MICROADDRESS 
CONTROL LOGIC 













T(5:0) 
> 
ZERO 
DETECT 
BRANCH 
CONTROL 
LOGIC 
(GOTO) 
EQ 
cc 
cS 4 


CONDITION CODE 
SELECT LOGIC 














MICRO- 
INSTRUCTION 
DECODE 


PIPELINE REGISTER . 


Large state machines require the capability of nested routines. 
Nested state machines are control sequences that may be 
invoked by one or more control functions, but when invoked, 
suspend the “calling” control sequence. Following completion of 
the low-level routine, the higher-level “calling” sequence resumes 
where it left off. The nested task may have modified part of the 
machine environment or the data processed by the system. The 
Am29PL141 provides stacks which allow nested subroutines. 


DECREMENTER 
(COUNTER — 1) 







MICROPROGRAM 
COUNTER (PC) 


MODE 


Pe et lan 5 
| 32 BIT SSR | SDO 


| 
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MICROINSTRUCTION 16 8 2 6 MEMORY 
DECODE LOGIC <7 ‘7 ‘7 
V Ad y, V VV 
ZERO SDI DCLK P(15:8) -P(7:6) —P(S:0) 
pore OUTPUTS 
Figure 42. Architecture of Am29PL141 
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With its on-chip intelligent microprogram address sequencer 
(Figure 42), high-speed 64 x 32-bit PROM-based microprogram 
memory, on-chip pipeline register, and an on-chip diagnostics 
serial shadow (SSR) register, this chip offers two major advan- 
tages: fast operation, and testability. This device is available with 
a 20-MHz clock rate (50-ns cycle time) in a 28-pin dual-in-line 
package. 


Amicroprogram address sequencer is the heart of the FPC, and 
performs all control sequencing functions. The Am29PL141 has 
29 high-level microinstructions which include jumps, loops, sub- 
routine calls, and multiway branching. These microinstructions 
can be conditionally executed based on the test inputs. The 
output generation function is similar to that of the PROSE device, 
where output data is stored in PROM locations. Ablock diagram 
of the FPC is shown in Figure 42. 


The FPC consists of five main logic blocks: the microaddress 
contro} logic, condition code selection logic, branch control logic, 
microinstruction decode logic, and the microprogram memory. 


The microaddress control logic generates the proper sequences 
of addresses for the microinstructions in the microprogram 
memory. APLA microinstruction decoder is used to decode the 
opcode from the microinstruction. This microinstruction decoder 
also generates all necessary control signals for executing each 
microinstruction. Depending on the microinstruction, the mi- 
croaddress can be generated from anumber of different sources: 
branch control, microprogram counter (PC+1), subroutine regis- 
ter, or the microprogram counter (PC). 


Asix-bit stack is available for looping and subroutine calls. When 
the loop counter is not being used for counting purposes, it canbe 
configured with the stack register to implement a two-deep stack 
for nested loops and nested subroutine calls. The condition code 
selection logic consists of an 8:1 test multiplexer. One of eight test 
conditions can be selected on which to base the conditional 
instructions. The polarity of the selected condition code input is 
controlled by the POL bit in the microword. 


The branch control logic is used for generating multiway branch 
addresses from the external inputs T(5—-0). This logic also per- 
forms the comparison between the inputs and the pipeline data 
field for executing the COMPARE instruction. An EQ signal is 
generated as an output of the COMPARE instruction and can be 
used for a condition code test in the next clock cycle. 


With its 64-location PROM, the Am29PL141 provides up to 64 
states for state machine designs. It offers the most sophisticated 
control sequencing circuitry, which allows multiple branching and 
nested subroutines. This can enhance the functionality of the 
device even beyond 64 states. 


The Am29PL141 is supported by an assembler (ASM14X) which 
allows the user to write instructions in a high-level language 
syntax. The assembler then converts the high-level statements 
into the specific device microinstructions and PROM data. The 
JEDEC device output file is also generated for programming the 
device PROM. 


State Machine Design Tutorial 


The following discussion is a tutorial on state machine design 
methodology. In this tutorial we will use the PROSE PMS14R21 
device and explain the state machine design process. The 
PROSE PMS14R21 is supported by the state machine syntax of 
PALASM 2 design software. This software syntax is identical to 
and can be used for programming other PAL and PLS devices. 


There are four stages in astate machine design. Thefirst requires 
conceptualizing the design and selecting the correct device; 
second converting it to its state diagram representation; third 
converting the state diagram to its state machine syntax text file; 
and fourth assembling the file and programming the device. To 
explain all the stages in the design process we will take a general 
design example and then go through the stages of the design 
process. As an alternative design methodology, we will also 
examine the flowchart state representation in the design 
example. 


Conceptualizing the Design 


This is the first step in a state machine design process. Inthis step 
a complete functional description of the design is formalized 
along with the requisite truth tables and/or timing diagrams. We 
will consider a simple traffic signal controller example for illustra- 
tive purposes. 


To begin with, let us visualize the scene of a traffic intersection 
(Figure 43), simplified to two-one-way streets. 


RED 1 
YEL 1 


GRN 1 





DIRECTION 1———> 


<+———- DIRECTION 2 


SEN 2 


YW 


Wd 


SEN 1 
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Figure 43. A Traffic Intersection 
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The traffic intersection shows two one-way streets: one in direc- 
tion 1 and the other in direction 2. Each direction has a signal 
consisting of red, yellow, and green lamps. These lamps are 
activated with appropriately-named active-HIGH signals (RED,, 
YEL,, GRN,, RED,, YEL,, GRN,). The signals are generated by 
the state machine controller. Also, each direction has a sensor 
which provides an active-HIGH signal (SEN,, SEN.) that indi- 
cates the presence of avehicle. Thecontroller has to manage this 
intersection, with the sensors as inputs and the lamps as outputs. 


The assertion of SEN, or SEN, signals a request for a green light 
for traffic in the corresponding direction. Once SEN, is HIGH and 
SEN, is LOW, indicating traffic in direction 1, the GRN, light 
should be on at all times. Similarly, when SEN, is HIGH and SEN, 
is LOW, the GRN, light should be on at alltimes. When SEN, and 
SEN, are both HIGH, indicating traffic in both directions, the traffic 
controller should cycle, allowing equal periods of green signals 
(GRN, and GRN,) for both directions. This cycling is also done 
when SEN, and SEN, are both LOW, indicating no traffic. 





Allow traffic in direction 1: RED,, GRN, = H 


Allow traffic in direction 2: RED,, GRN, = H 
Cycle with equal durations in both directions 
Cycle with equal durations in both directions 





Figure 44. Table for Traffic Flow Direction 
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Whenever the signals change from direction 1 to direction 2, the 
appropriate yellow light (YEL, or YEL,) is turned on for the 
transition duration. This allows time for the traffic in the intersec- 
tion to pass, before allowing traffic in the other direction. The 
timing diagram is shown in Figure 45. When both sensors 
indicate the presence of traffic (SEN,, SEN, = H), extra time is 
allowed for traffic to pass, before the signals change the direction. 
The extra time is also allowed when there is no traffic in both 
directions (SEN,, SEN, =L). The timing diagram for this is shown 
in Figure 46. Finally, the traffic signal controller shown includes 
the system clock (CLK), and an initialize or reset signal (INIT). 
INIT drives the controller to an initial state. 


Before we proceed to the next step, we mustselect adevice which 
will implement this design. 


In the previous section we found that the basic selection of a state 
machine controller device depends upon the speed and I/O 
considerations. The PROSE PMS14R21 has eight inputs and 
eight outputs in addition to the initialization input signal PRESET, 
which are sufficient for this design. Another selection criterion is 
the intelligence of the PROSE device. The device should be able 
to implement all of the states required by thedesign. The PROSE 
device is the only 24-pin SKINNYDIP device which has the 
capacity for up to 128 states. Later we will see that these are 
sufficient for implementing our design. 





S, Ss S7 Sq So 
DIRECTION 2 ——® DIRECTION 1 


Figure 45. Fast Traffic Flow Transition 
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Figure 46. Slow Traffic Flow Transition 


State Machine Representation 


Once the design problem is defined, the next stage is to convert 
it to its state machine representation. State machines can be 
represented by bubble-and-arrow state diagrams (Figure 4). 
Each bubble represents a state, with each arrow representing a 
transition. A second method of state machine representation is 
with flow charts, details of which are explained on page 2-104. 
State diagrams are by far the most popular representation. 
Flowcharts are included for completeness; you need not worry if 
you are not comfortable with them. 


The transition from the present state to the next state is depend- 
ent upon present state and input conditions. Similarly, outputs 
can be generated based upon the present state (called a Moore 
type state machine), or present state and input conditions (called 
a Mealy type state machine). Ourdesign example is a Moore type 
state machine. The PALASM 2 syntax supports both Moore and 
Mealy type state machines. 


The first step for constructing a state diagram is to define states. 
All of the specific events in a design are assigned astate. These 
include: change of output signals, response to a change in input 
signals, or time delays. For example, in Figure 45, different states 
(S,, S, and S,) have been assigned where the outputs change. 
Often states with the same outputs can be merged into one 
unique state, as shown in the case of state S,. Once the states 
are defined as bubbles, the transition from one state to another 
(called control sequencing) and outputs generated for each state 
(output generation) are defined. The task is to use the timing 
diagram along with the functional description of the design to 


develop control sequencing and output generation functions for 
state representations. The state diagramis an improvement over 
the timing diagram since it also provide additional branch decision 
information. 


Each of the functions performed by a controller can be viewed as 
one of the following operations. We will show how each of these 
operations determine the control sequencing and the output 
generation for the state machine, and their state diagram and 
flowchart representations. 


* Arbitration 

¢ Multiple condition testing 
¢ Event monitoring 

* Contro! signal generation 
* Timing delays 


Arbitration is the decision-making process. It is inadequately 
represented in timing diagrams with arrows (Figures 45 and 46). 
The table in Figure 44 represents the functional requirement for 
arbitration. Based upon the sensor inputs, the state machine 
stays in the same state S,, when SEN|=H and SEN,=L (for 
allowing traffic in direction 1) or transitions either to state S, 
(SEN,=L, SEN,=H), or to state S, (SEN,, SEN, are both HIGH or 
LOW). This is represented in the state diagram and flowchart as 
the transition from S, (Figure 47). 


S, can also be thought of as an event monitoring state where the 
controller waits for the command from the sensors. Based upon 
the two bits SEN, and SEN,, the state machine moves to different 
next states. This is also an example of multiple condition testing. 
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a. State Diagram 
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b. Flowchart 
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Figure 47. Conditional State Transitions 


An example of contro! signal generation (Figure 48) is signal 
RED, which is used to turn on the red light for direction 1. This 
control signal generation example requires assigning output 
signal name(s) (RED,) to the state(s) (S,, S,, S, and S,) where it 
is asserted (see Figure 46). For simplicity, we are only consider- 
ing the outputs which are asserted. Other outputs in their default 
states must be accounted for in the final state machine represen- 
tation. 


The signal GRN, is kept asserted for two extra clock cycles 
(Figure 46) to allow the traffic to pass when changing the traffic 
pattern from direction 2 to direction 1. The state diagram repre- 
sentation (Figure 48) requires assigning unconditional transition 
(control sequencing) from one state to another, for a number of 
states (S,,S, and S,), depending upon the required time delay. It 
also requires assigning the signal name (GRN,) to all three states, 








407 3% 
a. State Diagram 
S 
RED 2, GRNy 
S; 
RED2, YELy 
407 41a b. Flowchart 


Figure 48. Output Assignment to States 


which results in signal assertion for the extra two clock cycles. 


Once the design has been converted into a state diagram (Figure 
49) or flowchart (Figure 50) representing all of its timing and func- 
tional requirements, the next stage involves conversion to state 
machine syntax for its textual representation. 


State Machine Syntax 


The PALASM 2 programming software design file (also called the 
PAL device Design Specification or PDS) allows both conven- 
tional Boolean equations and state machine design constructs. It 
also allows both Mealy and Moore types of designs with extensive 
logic minimization capabilities and easy menu-driven operation. 
Other programming software packages which provide similar 
design capabilities are also available from various vendors. 
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Figure 49. State Diagram—tTraffic Signal Controller 
APALASM 2 state machine design file consists of four sections: 


¢ Declaration section 

* State section 

* Condition section 

* Simulation section (optional) 


Declaration Section 


As illustrated in Figure 51, this section stores the basic informa- 
tion about the device type and pin names. It is also used to store 
some bookkeeping information such as the company name and 
design revision numbers. 


TITLE TRAFFIC CONTROLLER 
PATTERN STATE MACHINE 
REVISION 1 

AUTHOR J. ENGINEER 
COMPANY. MONOLITHIC MEMORIES 
DATE JANUARY 30, 1987 


CHIP S MACHINE PMS14R21 
CLOCK DCLOCK SEN1 SEN2 I2 I3 14 
I5 Ié I7 SDI GND 
RESET SDO RED1 YEL1 GRN1 RED2 
YEL2 GRN2 01 00 MODE VCC 


Figure 51. Declaration Section 


POWER UP 
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GRN,, RED, 









S 
GRN,, RED, 
S3 
YEL, RED, 








S, 
RED .GRN> 
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RED, GRN2 
Ss, 
YEL2, RED, 


Figure 50. Flowchart—Traffic Signal Controller 
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State Section 


The state section contains all the control sequencing and output 
generation information. The state section also stores all of the 
state initialization values and default parameters. As illustrated in 
Figure 52, it initially defines the type of state machine being 
designed with keywords MOORE_MACHINE or 
MEALY_MACHINE. For the PROSE device it also selects the 
use of pin 13 as either MASTER_RESET or OUTPUT_ENABLE. 
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It also defines the default output logic values as HIGH or LOW 
with the keyword DEFAULT_OUTPUT. Once an output is as- 
signed a default logic value, it is explicitly mentioned only when 
asserted; otherwise it is assumed to be in its default logic value. 
This usually simplifies the task of drawing a state diagram (see 
Figure 49). Similarly, the keyword OUTPUT_HOLD defines the 
outputs which retain their previous values unless explicitly men- 
tioned in anew state. The last two keywords also help isolate the 
user from the implementation details of J-K or D-type flip-flops 
when using PAL-device-based state machines. The keyword 
OUTPUT_HOLD is not used in this design. 


STATE 


MOORE _MACHINE 

MASTER_RESET 

DEFAULT_OUTPUT /RED1/YEL1/GRN1/RED2/YEL2/GRN2 
POWER_UP := VCC -> SO 


Figure 52. State Section 


Once the default parameters are defined, all of the state transi- 
tions are represented. Only state names are used, since the state 
values are assigned automatically by the software. Every state 
transition (Figures 53 and 54) can depend upon the present state 
for direct control sequencing (state_x := VCC -> state_y), or the 
present state and input conditions for conditional control se- 
quencing (state_x := Condition_name -> state_y). The syntax 
has a direct relationship to the state diagrams. For conditional 
control sequencing, the syntax also allows a default state when 
none of the conditions of transitions to other states are satisfied. 


The output generation can also be represented easily. Outputs 
are specified from each present state. These outputs can be 
Moore type (state.OUTF := output_name) or Mealy type 
(state.OUTF := Condition_name -> output_name), depending 
upon whether or not conditions are used to generate outputs. 





STATE_B 


STATE_A‘= Vog-> STATE_B 
a. Direct Control Sequencing 


STATE _A 
0; 


STATE_C:= 1; -> STATE_D 
+l, -> STATE_E 


STATE_B 
0, 


STATE_A.OUTF:=0, 
c. Moore Machine 
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b. Conditional Control Sequencing 
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STATE_C:=ly -> STATE_D 
+l, -> STATE_E 


b. Conditional Control! Sequencing 


ord) 


‘hy 


STATE_B rate 8) 
2 


STATE_A.OUTF:= 0, STATE_AOUTF:=1, 0, 
c. Moore Machine d. Mealy Machine 


STATE_A‘= Voc -> STATE_B 


a. Direct Control Sequencing 


one 
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Figure 53. Direct Relationship Between State Diagram 
and State Syntax 


Condition Section 


This is the third section of the PALASM 2 state machine file 
structure. This section assigns names to allofthe input conditions 
used by the state section for both control sequencing and output 
generation. 


Design Example Syntax 


All of the state machine tasks represented by the state diagram 
can be converted easily to textual format. The transition from 
state S, to S, and S, (Figure 55) illustrates how outputs RED, and 
GRN, are generated. These represent control signal generation 
and timing delay tasks of a state machine. Similarly, the arbitra- 
tion function can be converted to textual format, showing the 


STATE _E 


STATE _B 


STATE_A.OUTF:=!1-> 0, 
d. Mealy Machine 





Figure 54. Direct Relationship Between Flowchart and State/Condition Syntax 
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transition from S, to various other states (Figure 56). This is also a Simulation Section 
good example of multiple condition testing and event monitoring. 
The simulation section is similar to that used for conventional 


SO.OUTF := RED2 * GRN1 Boolean equation simulation. Please see page 4-137 for details. 
S1.OUTF := RED2 * GRN1 

S2.OUTF := RED2 * GRN1 Assembly and Programming 

S3.OQUTF := RED2 * YEL1 


Once the state diagram is converted to its textual constructs, the 
file (Figure 57) can be easily assembled by using PALASM 2 
software. The software then generates the device programming 


Figure 55. Output Generation Representation 


oar re pes < se JEDEC format file. This JEDEC format file can be downloaded to 
+ Cl => 53 a device hardware programmer provided by various vendors. For 
een detail please see page 4-169 of the PALASM 2 software section. 


Figure 56. Control Sequencing Representation 


Traffic Signal Controller 


TITLE TRAFFIC CONTROLLER 
PATTERN STATE MACHINE 
REVISION 1 

AUTHOR J. ENGINEER 
COMPANY MONOLITHIC MEMORIES 
DATE JANUARY 30, 1987 


CHIP S_MACHIN PMS14R21 
CLOCK DCLOCK SEN1 SEN2 I2 I3 14 
I5 16 I7 SDI GND 
RESET SDO RED1 YEL1 GRN1 RED2 
YEL2 GRN2 01 00 MODE vcc 


STATE 
MOORE_MACHINE ; Defined as a Moore Machine 
MASTER_RESET ; Initialization 


DEFAULT_OUTPUT /RED1/YEL1/GRN1/RED2/YEL2/GRN2 
POWER_UP:=VCC -> SO ; Power up state defined 


SO := C3 -> Sl Traffic in direction 1 


+ CO -> Sl 7 Slow signal change 

+ Cl -> S3 ; Fast signal change 

+-> SO ; Otherwise stay in state SO 
S1 s= VCC -> S2 ; Time delay 

S2 := VCC -> $3 ; Time delay 

S3 := VCC -> S4 ; Time for yellow 

S4 := C3 -> SS 3; Traffic in direction 2 

+ cO -> SS 3 Slow signal change 

+ C2 -> $7 ; Fast signal change 

+-> S4 ; Otherwise stay in state SO 
SS := VCC -> S6 ; Time delay 

S6 := VCC -> S7 ; Time delay 

S7 := Vcc -> SO ; Time for yellow 


Figure 57. The Design File 
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SO.OUTF := GRN1 * RED2 
S1L.OUTF := GRN1 * RED2 
S2.OUTF := GRN1 * RED2 
S3.OUTF := YEL1 * RED2 
S4.OUTF := RED1 * GRN2 
S5.OUTF := RED1 * GRN2 
S6.OUTF := RED1 * GRN2 
S7.OUTF := RED1 * YEL2 


CONDITIONS 

CO = /SEN1 * /SEN2 
Cl = /SEN1 * SEN2 
C2 = SEN1 * /SEN2 


C3 = SEN1 * SEN2 


SIMULATION 


; Allow direction 1 traffic 


Change from direction 1 to 2 
Allow direction 2 traffic 


se Ne 


; Change from direction 2 to 1 


Condition for no traffic 
Condition for direction 2 
Condition for direction 1 
Condition for traffic in both directions 


Se Se Me Ne 


TRACE_ON CLOCK SEN1 SEN2 RED1 YEL1 GRN1 RED2 YEL2 GRN2 


SETF RESET /CLOCK 
CLOCKF CLOCK 

CLOCKE CLOCK 

CHECK /RED1 /YEL1 GRN1 
SETF /SEN1 /SEN2 
CLOCKF CLOCK 

CLOCKF CLOCK 

CHECK /RED1 /YEL1 GRN1 
CLOCKF CLOCK 

CHECK /RED1 YEL1 /GRN1 
CLOCKF CLOCK 

CHECK RED1 /YEL1 /GRN1 
CLOCKF CLOCK 

CHECK RED1 GRN2 

CLOCKF CLOCK | 

CHECK RED1 YEL2 

CLOCKF CLOCK 

CHECK /RED1 /YEL1 GRN1 
SETF /SEN1 SEN2 
CLOCKF CLOCK 

CHECK /RED1 /YEL1 GRN1 
CLOCKF CLOCK 

CLOCKF CLOCK 

SETF SEN1 /SEN2 
CLOCKF CLOCK 

CLOCKF CLOCK 

CHECK YEL2 RED1 

CLOCKF CLOCK 

CHECK GRN1 RED2 


TRACE_OFF 


;STATE TRANSITION ONLY ON 1ST CLOCK 
;LIGHTS CHANGE ON 2ND CLOCK 
/YEL2 /GRN2 RED2 


RED2 /YEL2 /GRN2 
RED2 /YEL2 /GRN2 


/RED2 /YEL2 GRN2 


RED2 /YEL2 /GRN2 


RED2 /YEL2 /GRN2 


Figure 57. The Design File (Cont’d.) 
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Microprocessor-Based Systems 








Introduction 


The microprocessor has become the standard processing 
element in a wide variety of digital systems. Applications ranging 
from ignition control in automobiles to high performance engi- 
neering workstations utilize microprocessors. Usually a micro- 
processor with its associated VLSI components (the dynamic 
RAM, EPROM and standard microprocessor peripherals) can 
implement most application problems. The microprocessor's 
general-purpose nature, ease of use, and cost-effectiveness 
makes it the device of choice in all but the most performance- 
intensive or specialized applications. 


All the designer needs is the necessary “glue logic” to connect 
these VLSI components together. A typical microprocessor 
based design is illustrated in Figure 1. The heart of the design is 
the microprocessor. Typically, DRAMs provide the data storage 
and EPROMs provide the non-volatile program storage. De- 
pending onthe application such standard microprocessor periph- 
erals as DMA controllers, serial 1/O or disk controllers are usually 
present. Glue logic is needed to connect these devices together 
to create a working system. For example, memories and periph- 
erals need to be selected based on the desired address range, 
afunction called “address decoding”; high-speed address strobe 
signals RAS and CAS must be generated for the DRAMs; wait 
states must be generated for the slower memory and peripheral 
chips; arbitration and prioritization between different bus masters 
(system components which control the bus transactions: DMA 
and CPU) is necessary in multi-master systems. 
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Programmable Logic Devices as 
Standard Product Glue 


The typical microprocessor system shown in Figure 1 highlights 
several of the places where programmable logic is used to 
implement the “glue” in a microprocessor system. These simple 
random logic and control-oriented functions are necessary to 
hold the standard products together to construct a complete 
system; thus the name “glue”. The common glue functions in a 
microprocessor system share several application needs, andare 
best satisfied by programmable logic devices. 


Microprocessor glue functions compete for board space with LSI 
devices in almost allcases. If the glue logic uses too much board 
space, the design will need to sacrifice features (reducing the 
design’s value to the customer) or add extra boards (increasing 
the cost). In either case the cost/value ratio increases, which is 
undesirable. Programmable logic devices can reduce the board 
space required for glue functions because of the density advan- 
tage the devices have over standard SSI/MSI devices. This 
makes more board space available for standard VLSI devices or 
customized peripherals. 


Since the microprocessor glue functions tend to be control- and 
timing-oriented, they are subject to many changes as the board 
design evolves, processor speeds increase, or new standard 
products are incorporated in the design. In complex systems, 
logic functions are commonly “tuned” to optimize bus band-width, 
peripheral access time, or memory timing only after the system is 





WAIT 
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ADDRESS 
DECODER 
CONTROL 

GENERATOR 







STANDARD 
PERIPHERAL 








Figure 1. Typical Microprocessor System 
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running. The optimum balance is usually too difficult to predict 
and can only be found via experimentation. Because of their 
programmable nature, PLDs offer the designer the capability of 
changing the logic at any time. Thus designs can quickly adjust 
to changing requirements, or can be improved without adding 
components or changing the circuit board. 


The address decoding, timing, and simple state machines are 
“shallow” and “wide” logic functions. Shallow logic functions need 
only a few levels of gating at a high speed. The standard AND- 
OR array architecture offers 3 to 5 levels of gating (inverter, AND- 
gate, OR-gate, flip/flop-master and flip/flop-slave) at very high 
speed (10 to 18ns). Wide logic functions require a large number 
of inputs on each gating structure. For example, address decode 
functions may need over 10 inputs to decode a microprocessor 
address and an 8-bit refresh counter would require over 8 inputs 
for the most significant bit. The wide-input structure of PLDs, 
where practically all signals are available (in both polarities) at 
each gating structure, is a good match for this application 
requirement. 


PLDs offer the designer several important charcteristics which 
match well with the needs of glue logic functions. Some common 
application areas will help illustrate the advantages of PLDs in 
microprocessor glue applications. 










PROGRAMMABLE 
LOGIC 
ATTRIBUTES: 









GLUE LOGIC NEEDS: 











¢ Minimum Board Space 

* Easy and Frequent 
Design Changes 

* High Speed Simple 

Logic Functions 


+ High Logic Density 

* Quick and Easy Device 
Modifications 

* Optimized for Fast 

Shallow Logic 











Figure 2. Table for Glue Logic Needs and Programmable 
Logic Attributes 
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Address Decoding 


The single most common application of combinatorial PAL de- 
vices is address decoding (Figure 3). In order for a microproces- 
sorto communicate with a peripheral or memory chip achip select 
signal must be generated by the glue logic. Usually acombination 
of address signals and control signals are required to generate 
the chip select for microprocessor memory and peripheral de- 
vices. For example, in the 68000 microprocessor some of the 
address signals, the address strobe signal, and the read/write 
signal must participate in the address decode function. The logic 
needed to generate the chip select is many times simply aNAND 
of the true or complement of the necessary address and control 
inputs. 


In high-performance microprocessors, the chip select must be 
generated fast enough to allow single cycle access (no wait 
states) to memory. PLDs provide the high-speed wide-gating 
logic functions necessary to generate these chip select signals. 
Additionally, the addresses for peripherals and memory devices 
are often not known at the beginning of the design or may change 
many times during the design. PLDs can be easily changed to 
adjust to such modifications. More on address decoding is on 
page 2-35, 


Wait-State Generation 


Wait-state generation is in many ways related to the address 
decode function (Figure 4). After the addressed chip is selected, 
the CPU must be notified that the data transfer can be completed. 
The signal to the CPU which identifies the end of a data transfer 
may need to be extended if a slow device is being accessed. 
DRAMs, EPROMs, EEPROMs, and microprocessor peripherals 
may all need different delays depending on their access time bus 
bandwidth requirements and frequency of access. Usually there 
is atleast afast access (no wait-states) a slower access (one wait- 
state) and a longer access (several wait-states), The wait state 


TIMING 
CONTROL 


J REALTIME 


CLOCK 


BATTERY 


i EEPROM 
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Figure 3. Address Decoding Application Example 





2-132 


an fAonolithic Rar FAemories al 


Microprocessor-Based Systems 








generation logic must create a variable delay depending on the 
device being presently accessed. Usually the delay only depends 
onthe present access and is independent of previous accesses. 


Aregistered PAL device has the capability of implementing the 
simple state machine needed for wait-state generation. Addition- 
ally, the PLD contains enough logic to provide for a variety of 
memory, peripheral and processor speeds, without requiring 
additional board space. Thus a single design can have an 
extended lifetime, or provide a quick time to market when faster 
or lower-cost processors, peripherals and memories are 
available. 


DRAM Control 


DRAM control logic consists of several distinct functions: address 
multiplexing, RAS/MS/CAS generation, refresh request genera- 
tion, and memory request arbitration. Address multiplexing, as 
the name implies, involves multiplexing the upper and lower 
processor addresses to the DRAM chips. RAS/MS/CAS genera- 
tion involves the generation of the three separate DRAM timing 
signals: Row Address Strobe, Column Address Strobe, and Mux 
Select (on the address multiplexer). Tight timing relationships 
between the address multiplexer and the RAS/MS/CAS logic is 
necessary in high-performance systems. Refresh request gen- 
eration logic is used to time the interval between DRAM refresh 
cycles. The logic is usually a wide counter using a slow clock. 
Memory request arbitration logic decides whether the refresh 
request will be granted or a memory access will be allowed. 


PLDs have a variety of characteristics which make them well 
suited for DRAM control applications. High speed and minimum 
skew specifications are important considerations for address 
multiplexing, RAS/MS/CAS generation and refresh arbitration in 
high performance designs. High-speed PLDs have significantly 
tighter skew specifications than the equivalent multi-device SS1/ 
MSI solutions. This makes PLD-based DRAM control applica- 
tions easier to design and higher performance than SSI/MSI 
based designs. Detail on DRAM designs is provided on 
page 2-179. 
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Microprocessor Peripheral Interface 


Microprocessors are usually connected to a number of VLSI 
peripherals performing various ancilliary functions. In many 
cases a design calls for the use of “incompatible” microprocessor 
and VLSI peripherals. Cost, performance, availability, backwards 
compatibility, or other constraints sometimes dictate that different 
families of devices need to coexist in the same design. Glue logic 
is necessary to allow these incompatible devices to communi- 
cate. Sometimes logic signals need to be translated. The most 
obvious case is probably the different way a processor read or 
write cycle is encoded. One popular processor uses two separate 
read and write signals (/R or /W) with the active state indicating 
the presence of a valid address. Another popular processor uses 
an “address valid” signal to indicate the presence of an address 
and a single read/write signal to indicate the type of access. 
Peripherals hooking onto the other processor's bus will need 
some simple logicto translate the meaning of these signals. Often 
signal timing is also different between families. This requires 
signals to be delayed or qualified by glue logic. 


The two most popular processor families for high performance 
designs are the 8086/80186/80286 family and the 60000/68010/ 
68020 family. Each processor has its own peculiarities which the 
designer must take into account when hooking up a peripheral 
device. Subsequent discussion covers several applications 
examples for each family. Finally, PLDs can also be used to make 
custom microprocessor peripherals. An application example of a 
PLD-based interrupt controller is shown on page 2-172. 


Conclusion 


PLDs offer the designer several important characteristics which 
match well with the needs of glue logic applications. The following 
application examples will illustrate the PLD’s versatility at imple- 
menting microprocessor glue functions. 


TIMING 
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Figure 4. Wait State Generator Application Example 
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Interfacing to the 8086/80186/80286 





Overview 


The 8086 is a general-purpose 16-bit microprocessor CPU. 
The CPU has a 16-bit data bus multiplexed with 16 address 
outputs. There are 4 additional address lines (segment ad- 
dresses which are multiplexed with STATUS) that increase the 
memory range to 1 Mbyte, 8086 addresses are specified as 
bytes. In a 16 bit word, the least significant byte has the lower 
address and the most significant byte has the higher address. 
This is compatible with 8080, 8085, Z80 and PDP 11 address- 
ing schemes but differs from the Z8000 and 68000 
addressing. 


The data bus is “asynchronous”, i.e., the CPU machine cycle 
can be stretched without clock manipulation by inserting Wait 
states between t2 and t3 of a read or write cycle to accom- 
modate slower memory or peripherals. Unlike the 68000, the 
8086 has separate address space for I/O (64 kBytes). 


The 8086 can operate in MIN. or MAX. mode. Maximum mode 
offloads certain bus control functions to a peripheral device 
and allows the CPU to operate efficiently in a co-processor 
environment. A brief discussion on both the MIN. and MAX. 
modes are as follows: 


MIN. mode: \/O addressing is defined by a HIGH or 
the 1O/M output, and activated by the RD 
output for reading from memory or I/O, or 
activated by the WR output for writing to 


memory or I/O. 


OMA: The Bus is requested by activating the HOLD input to 
the 8086. Bus Grant is confirmed by the HLDA output 
from the 8086. 
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MAX. mode: /O operation is controlled by two outputs 
(8086 plus from the 8288. 
8288) 1ORC: active during Read from I/O 





IOWC: active during Write to /O 
MRDC: active during Read from memory 
MWTC: active during Write to memory 








DMA: The Bus is requested and Bus Grant is acknowledged 
on the same pin (RQ/GTO OR RQ/GT1) through a 
pulsed handshake. 


Interrupts in Min. and Max. Modes: 


Interrupt is requested by activating the INTR or NMI inputs to 
the 8086. 


Interrupt is acknowledged by the INTA pin on a MIN. mode 
6086 or by the INTA pin on the 8288 in MAX. mode. 


Note: There is no RD or IORC during the interrupt- 
acknowledge sequence. 
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8086 and Am7990 LANCE 


The LANCE, Am7990, has been designed to be interfaced 
easily with the popular 16-bit microprocessors (8086/80186, 
68000, 28000, LSI-11). Most of the interface logic is embedded 
inside the chip and is program selectable. 


Although the LANCE itself has a multiplexed bus, it can easily 
be interfaced to demultiplexed buses with a minimal amount of 
effort. The following designs assume that the processor and 
the LANCE reside on the same board. Address buffers and 
data transceivers are set up to be shared between the proces- 
sorand the LANCE. All of these designs use PAL devices to 
reduce the parts count. 


The 8086 to LANCE interface requires a different Bus Request 
handshake, depending on whether the 8086 is configured in 





MAX. or MIN. mode. The 8086 has a bidirectional signa! for 
both Bus Request and Bus Grant (RQ/GT). Both Bus Re- 
quest (RQ), and Bus Grant (GT) to/from 8086 are one CPU 
clock wide, and are synchronous to the CPU clock. Figure 1 
shows a PAL device design for the conversions in MAX. 
mode. This PAL device is utilized to include other external 
logic requirements for interfacing the LANCE to 8086. The 
interface diagram is similar to the one for the 80186 to 
LANCE interface except for the changes made in program- 
ming the PAL device. The interface timing diagram is shown 
in Figure 3. The PAL device design file is shown in Figure 2. 


Figure 4 shows a block diagram of the 8086 to Am7990 
interface, in MIN. mode. The interface also employs a PAL 
device to minimize parts count. The PAL device equations 
are given in Figure 5. 


rrr hs 


Figure 1. 8086 RG/GT, Am7990 HOLD/HOLDA Conversion PAL Device 
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8086 and Am7990 LANCE Interface 


TITLE 8086 RO/GT TO LANCE HOLD/HLDA CONVERTOR 
PATTERN PATOO1 

REVISION 01 

AUTHOR RASOUL OSKOUY 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 


CHIP CONVERT PAL16R4 


CLK NC HOLD /GT NC NC NC NC NC GND 
NC NC NC /HLDA /D2 /R2 /R1 NC NC VCC 
EQUATIONS 
Ri := HOLD 78086 ROQ/GT conversion to LANCE HOLD/HLDA 
D2 := /R1 ;both REQ and GRANT are one clock wide and 
;are synchronous to CPU clock 

R2 := R1*/D2 

+ /R1*D2 
HLDA := GT*/R2 

+ HLDA*/D2 


Figure 2. 8086 RG/GT, Am7990 HOLD/HLDA Conversion PAL Device Design File 


TITLE 


8086 MIN MODE TO LANCE INTERFACE 


PATTERN PAT 

REVISION 01 

AUTHOR KHUYNH NGUYEN 

COMPANY ADVANCED MICRO DEVICES 


DATE 


11/06/87 


CHIP LANC_INT PAL16L8 


ALE /AS DTR NC NC DEN NC /READY HLDA GND 
NC CPURDY READ /R /T /DAS /WR /RD LE VCC 


EQUATIONS 


/LE = 
+ 


/ALE 
/AS 


/CPURDY = /READY ; CPU (86) is a bus master 


DAS = 


RD + WR 


DAS.TRST = /HLDA 


/READ 


/READ. 


= DTR 
TRST = /HLDA 


T = DTR 


T.TRST 


= /HLDA 


R = /DTR * DEN 


R.TRST 


= /HLDA ; LANCE is a bus master 


RD = READ * DAS 
RD.TRST = HLDA 
WR = /READ * DAS 
WR.TRST = HLDA 


Figure 5. Source Listing for Example of Figure 4 
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CLK 
(INPUT) 
HOLD | 
(INPUT) 
Ri (OUTPUT) 
D2 (OUTPUT) 
R2 (OUTPUT) DMA ” 
RQGT (INPUT) nN | CPUB 


DMA i; 
US 
ACKNOWLEDGE C. 
HLDA (OUTPUT) | 
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Figure 3. AmPAL16R4, 8086 (Max. Mode) LANCE Interface Timing Diagram 
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Figure 4. Am7990 to 8086 Interface 








an Monolithic HR Memorles a" 2-137 


2-138 


8086 and Am9516 Universal DMA 


Controller Interface 


Am9516 in MIN. Mode 


Figure 1 illustrates the interface of the Am9516 to the 8086 in 
the MIN. mode configuration. Figure 3 illustrates the interface 
in MAX. mode. The interfaces could be accomplished by 
using rather complex implementation of standard SSI/MSI 
logics. Examples here replace the logic portion with a PAL 
device. The MIN. Mode uses the PAL device 16L8. This is a 
good example of “garbage collection”. It reduces the amount 
of real estate, interconnections, parts, and part types. 





Both interface examples accomplish two major functions. First, 
when the Am9516 is bus master, it converts RD and WR into 
R/W and DS, and vice versa when the Am9516 is not the bus 
master. Secondly the buffer controls, TBEN and REBN, are 
generated from DEN and DT/R . 


The two examples show different types of latches and 
transceivers and there are many more to choose from. The 
designer selects those that best meet system requirements 
while trying to minimize the number of different parts that must 
be stocked. Figure 2 shows the PAL device equations for this 
example. 
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Figure 1. The Am9516 UDC to 8086 CPU Interface (Minimum Mode) 





al Monolithic Ta Memories zt 


8086 and Am9516 Universal DMA Controller Interface 





TITLE Am9516 TO 8086 MIN MODE INTERFACE 
PATTERN 01-3.66 

REVISION 01 

AUTHOR JOE ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/05/87 


CHIP DMA_INTFCE PAL16L8 
;Am9516 to 8086 min mode interface chip 


NC ALED ALEP HLDA BW ADO DT /DEN /SEL GND 
NC /RBEN /RD ALE AO /RW /DS /WR /TBEN VCC 


EQUATIONS 


DS = RD + WR 
DS.TRST = /HLDA 


RW = DT 
RW.TRST = /HLDA 





TBEN = /DT * /SEL * DEN 
TBEN.TRST = /HLDA 


RBEN = DT * /SEL * DEN 
RBEN.TRST /HLDA 


I 


RD = /RW * DS 
RD.TRST = /HLDA 


WR = RW * DS 
WR.TRST = /HLDA 


ALE = /ALEP * /ALED 

AO = /ADO * /BW * HLDA * ALED 
+ ADO * BW * HLDA * ALED 
+ /ADO * /HLDA * ALEP 
+ AO * /ALEP + AO * /ALED 


; DESCRIPTION 


>ITHIS PAL DEVICE CONVERTS THE CONTROL SIGNALS TO INTERFACE THE 4046 
;IN MIN MODE TO THE Am9516 DMA CONTROLLER. ANOTHER EXAMPLE SHOWS 
;HOW THIS IS DONE IN MAX MODE. 


; SIMULATION NOT INCLUDED 


Figure 2. Source Listing for Example of Figure 1 
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Am9516 in MAX. Mode 


This MAX. mode interface between the 8086 and Am9516 
(Figure 3) also uses PAL devices to reduce component 
count. 


The example makes several assumptions which result in a 
slightly more complex design than absolutely necessary. The 
8086 is assumed to be in MAX. Mode, and the design has to be 
compatible with a MULTIBUS or similar interface; the 16R4 
and 8288 could be eliminated if this is not the case. 


The 16R6 (Figure 4) is a PAL device that performs a function 
similar to the 8288, that is, it converts the processor's status 
signals and clock into control signals. In this case, the signals 
are R/W, Data Strobe (DS), Interrupt Acknowledge (INTA) 
and Peripheral Acknowledge (PACK). This PAL device 
basically generates ZBUS signals for Zilog peripherals. 
In the example, it connects to the Am9516 (UDC). The DMA 
Controller is very similar to the AmZ8016 except its bus inter- 
face has been modified to interface to non-multiplexed 
buses. This was changed from the previous design using a 
16R8 to eliminate problems due to skew between OSC 
and CLK. 


The 16R4, Figure 5, has two functions. The first is connecting 
the MIN. Mode protocol! of the Am9516 or similar device to 


AmPAL16R6 








the MAX. Mode protocol for bus exchange. The 74LS03s are 
used to aid in this function. The timing waveform illustrates 
what happens in detail. The basic philosophy is that a rising 
edge on the HOLD input generates a pulse that is one clock 
wide. The CPU samples this pulse and, in response, issues a 
one-clock-wide pulse. The 16R4 uses this response pulse to 
generate HLDA. When the Am9516 has completed the nec- 
essary transfer, HOLD transitions HIGH to LOW. This gener- 
ates another pulse to the CPU signalling that the Am9516 is 
done and that the CPU may continue. 


The second function of the 16R4 is the conversion of R/W, 
DS and M/IO into the MULTIBUS-compatible signals 
/MRDC, /MWTC, /IORC, and /IOWC, when HLDA is HIGH. It is 
possible to collapse the 74LS03s into the PAL device, thus 
reducing the external logic required to only one open collector 
inverter. The disadvantage, however, is that it adds two addi- 
tional clocks for the bus exchange overhead, one during ac- 
quisition of the bus and one on bus release. For block 
transfers, this is not significant but it may be undesirable when 
performing single transfers, or short burst, or when in Demand 
Mode. To eliminate the 74LS03, change the equation for R2 to 


/R1 + /D2 + R1 «+ D2, 


and then drive the /[RQ/GT] line with a 74LS05 from the R2 
output. 


K 
HOLD 


HLDA 
PAL16R4 
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Figure 3. 
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TITLE 8086 TO 85XX PERIPHERAL INTERFACE 
PATTERN 01-3.68 

REVISION 01 

AUTHOR JOE BRCICH 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 


CHIP DMA_INTRR PAL16R6 


CLOCK RESET CLK /S0 /S1 /S2 /LOCK NC NC GND 





/OE /AS /P1 /RW /DS /PO /IACK /RDY CLKD vcc 
EQUATIONS 
PO := /RESET * SO * /PO * /P1 
+ /RESET * S1 * /PO * /P1 
+ /RESET * S2 * /PO * /P1 
+ /RESET * SO * Pl 
+ /RESET * S1 * PL 
+ /RESET * S2 * Pl 
Pl := /RESET * /PO * /P1 
+ /RESET * Pl * SO 
+ /RESET * Pl * S1 
+ /RESET * Pl * S2 
DS := /IACK * /PO * Pl * S00 * /S1 * S2 


+ /IACK * /PO * Pl * /SO * Sl * S2 

+ IACK * SO * S1 * S2 * PO * /P1 * LOCK 
+ DS * SO * Sil * S2 

+ DS * SO * /S1 * S2 

+ DS * /SO * Sl * S2 


RW := SO * /S1 * S2 


IACK := /RESET * SO * Sl * S2 * /PO * /P1 * /LOCK 
+ /RESET * IACK * SO * S1 * S2 * PO * /P1 * /LOCK 
* 
* 


+ /RESET IACK * LOCK * /DS 
+ /RESET IACK * /LOCK * DS * /PO * Pl 
RDY := /RESET * ‘SO * /S1 * S2 * PO * Pl 
+ /RESET * /SO * S1 * S2 * PO * Pl 
+ /RESET * RDY * SO * /S1 * S2 
+ /RESET * RDY * /SO * S1 * S2 
+ /RESET * IACK * SO * S1 * S2 * DS 
+ /RESET * RDY * SO * S1 * S2 


/CLKD = CLK 
AS = /CLKD * PO * /P1 * /IACK * CLK 
7 DESCRIPTION 


*;THIS PAL DEVICE TRANSLATES 8086 BUS SIGNALS INTO COMPATIBLE SIGNALS FOR 
?;THE 9516. IT IS ALSO APPLICABLE TO 85XX PERIPHERALS BY ALTERING /RW AND 
7/DS TO /RD AND /WR. ONE FLIP-FLOP IS AVAILABLE TO GIVE THE NECESSARY 
;DELAY TO THE FALLING EDGE OF /WR. THE DATA STROBE TIMING FOR A WRITE 
7;CYCLE IS DELAYED UNTIL THE FALLING EDGE OF T2 TO MEET THE REQUIREMENTS 
;OF THE 85XX PARTS. THIS DESIGN ASSERTS RDY TO DEMAND ONE WAIT STATE 
7FROM THE 8086. THIS WAIT STATE IS NOT LONG ENOUGH FOR DESIGNS WHICH USE 
7;AN 8 MHz 8086. THEREFORE, WITH AN 8 MHz CPU, 85XX PERIPHERALS SHOULD BE 
;USED. AS AN ALTERNATIVE, THREE WAIT STATES CAN BE USED BY ALTERING THE 
;RDY EQUATION. THIS PAL DEVICE ALSO TRANSFORMS THE 8086 TWO-CYCLE 

7 INTERRUPT ACKNOWLEDGE INTO A SINGLE CYCLE OF THE TYPE NECESSARY FOR 85Xx 
*;PARTS. THIS IS MADE POSSIBLE BY SAMPLING THE LOCK STATUS, PO, Pi, AND 
7;IACK SIGNALS. 


Figure 4. Source Listing for Example of Figure 3 
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TITLE Am9516 TO 8086 INTERFACE 
PATTERN 01-3.67 

REVISION 01 

AUTHOR JOE ENGINEER 

COMPANY ADVANCED MICRO DEVICES 
DATE 11/06/87 


CHIP DMA_INTFCE2 PAL16R4 
;8086 to Am9516 interface 


CLK /RQGT HOLD NC NC NC /RW /DS MIO GND 
/OE /MWTC /MRDC HLDA /D2 /R2 /R1l /IOWC /IORC VCC 


EQUATIONS 


IORC = /MIO * DS * /RW 
IORC.TRST = HLDA 


IOWC = /MIO * DS * RW 
IOWC.TRST = HLDA 


MRDC = MIO * DS * /RW 
MRDC.TRST = HLDA 


MWTC = MIO * DS * RW 
MWTC.TRST = HLDA 


Rl := HOLD 

R2 := /R1l 

D2 := Rl 

/HLDA := /R1l + /D2 * /HLDA + 
/RQGT * /HLDA 

; DESCRIPTION 


;THIS DEVICE CONVERTS THE MIN MODE SIGNALS HOLD AND HLDA TO THE 
>MAX MODE RQGT PROTOCOL. ADDITIONALLY IT GENERATES THE 82884 
;EQUIVALENT CONTROL OUTPUTS MRDC, MWTC, IORC, AND IOWC. THIS 

;PAL DEVICE WAS USED TO CONNECT THE Am9516 TO THE 8086 IN MAX MODE. 


; SIMULATION NOT INCLUDED 


Figure 5. Source Listing for Example of Figure 3 
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80286 to Am9568 Data Ciphering 


Processor Interface 


80286 to Am9568 Data Ciphering 
Processor Interface 


This interface is designed for an 8-MHz CPU where the Data 
Ciphering Processor (DCP) is synchronously operating at the 
maximum clock rate of 4 MHz. A block diagram for the inter- 
face is shown in Figure 1. The Am9568 requires a narrower 
width of address strobe than the Am9518. This works comfor- 
tably with the 60-ns address strobe width of an 8-MHz CPU. 


The MULTIBUS Mode Select input of the Bus Controller 82288 
is tied LOW to optimize the command and control signals for 
short bus cycles. The Command Delay (CMDLY) becomes 
active-HIGH for one 16-MHz clock cycle whenever the DCP is 
selected to delay the Read and Write strobes by 125 ns. This 
satisfies the timing requirement of the minimum delay between 
ALE inactive and Read or Write strobe active of the DCP. An 
open-collector gate must be added to allow other peripherals to 
drive this input. 


The ALE, lORC and |OWC outputs of the 82288 are wired 
directly to the DCP. ALE strobes a D-Flip-Flop to store the state 
of Chip Select for the entire cycle. 


Q3; and the latched Chip Select CSL are ANDed externally to 
generate the Synchronous Ready for the 82284. The 82284 
samples the line at the falling edge of the clock. The registered 
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output Q3 is clocked with the rising edge of the same clock, 
thus satisfying the setup and hold time requirements of the 
82284. Two Wait States are inserted. 


Half of the PAL device operates as a bidirectional Ad- 
dress/Data Multiplexer. During the Address Latch Enable ac- 
tive phase, the state of Aj and Ao is transferred to the AD; and 
ADo pins of the PAL device. The DCP latches this two bit- 
address with the falling edge of ALE. 


When IORC and CSL are active, the states of AD; and ADs 
are passed to D, and Do respectively. The DCP Register can 
be read. If IOWC and CSL are active, the data path is turned 
around: Dy and Do are inputs, AD; and ADo are outputs. 
The Address Hold Time of the PAL device is sufficient because 
the address information is passed to AD; and AD2 whenever 
IORC*CSL or IOWC*CSL are not true, i.e. whenever data is 
not transferred between the CPU and the DCP. 

The Read Data Hold Time requirement of 5 ns of the Am9568 
is satisfied by the propagation delay of the PAL device. 

The Read Data Hold Time requirement of 5 ns of the iAPX286 
is also satisfied by the PAL device. 

The Master Port Chip Select (MCS) input of the DCP is con- 
nected to the unlatched address decoder output. 


Figure 3 shows the source listing and the pin descriptions. 
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80286 to Am9568 Data Ciphering Processor Interface 








02188A-43 


Figure 1. 80286 to Am9568 Interface 


The DCP Clock 


The PAL device synchronizes the DCP clock to the Data 
Strobes IORC and IOWC (Figure 2). It also divides the 
16 MHz system clock (8 MHz CPU clock) down to the maxi- 
mum DCP clock rate of 4 MHz. At this clock rate, the Data 
Strobe Delay to the DCP clock must be 0-30 ns. The Bus 
Controller is specified to generate a Data Strobe timing of 
3-15 ns to the falling edge of CLK (16 MHz). Because of the 
higher propagation delay of a standard PAL device, the regis- 
tered outputs are toggled at the rising edge of CLK before the 
Data Strobes become inactive. This gives an additional 
32.5 ns for the DCP clock signal path. 


Q, to Q3 are three outputs of the PAL device state machine. 
The registered outputs are clocked with the rising edge of the 
16-MHz 82284 clock. Whenever ALE and CS are active, Q; to 
Q3 are set to the initial state. Qy to Qg are outputs of a 3-bit 
down counter, with Q3 as the most significant bit. 


Q3 is used to generate the SRDY signal for the 82284 as 
mentioned above. 








Q, is the DCP clock. This design must guarantee that the 
minimum DCP clock HIGH or LOW time is at least 115 ns or 
two 16-MHz clock cycles. This is done by toggling Q2 only 


during phase 2 cycles of the CPU. The CPU design guarantees 
that there is always a phase 1 cycle between two phase 2 
cycles. 


Assuming a typical PAL device propagation delay of 25 ns, 
timing parameter tCDS (Time Clock Data Strobe) is 10.5 to 
22.5ns(3 + 32.5 — 25nsto 15 + 32.5 — 25ns). This satisfies 
the 0 to 30 ns requirement. 


The AmPAL16R4 has active-LOW outputs. But one output, Qo, 
should be active-HIGH. The equation for Q2 was derived to be 


Q, =ALE* CS +Q,*Q,+0,*Q, 
To compensate for the inversion in the PAL device, the 
PALASM software minimizer (or 16RP4) can be used to 


convert it to the form shown in the PAL device Design Speci- 
fication. 


Improvements 


The DCP needs two Wait States only when the Control Reg- 
isters are read. Data Register read or writes and Control 
Register reads can be executed with only one Wait State, 
which improves the Data Ciphering speed of this interface. 
The more sophisticated Wait control logic and the two exter 
nal TTL gates can be integrated into one PAL22V10 device. 
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80286 to Am9568 Data Ciphering Processor Interface 
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Figure 2. Timing Diagram 
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TITLE 
PATTERN 
REVISION 
AUTHOR 
COMPANY 


DATE 


80286 to Am9568 Data Ciphering Processor Interface 


80286 TO Am9568 (DCP) INTERFACE 
DCP043 

01 

JUERGEN STELBRINK 

ADVANCED MICRO DEVICES 

11/06/87 


CHIP DCP _INTFCE PAL16R4 


CLK /CS CSL ALE /IORC /IOWC Al A2 NC GND 
JOE Dl D2 /Q1 Q2 /Q3 CMDLY AD1 AD2 vcc 
EQUATIONS 
Ql ALE*CS 

+ /Ql 
/Q2 Q1*/Q2*/ALE 

+ Q1*/Q2*/CS 

+ Q1*Q2*/ALE 

+ Q1*Q2*/CS 
Q3 ALE*CS 

+ Q1*Q2*Q3 


+ /Q1*Q2*Q3 
+ Q1*/Q2*Q3 
+ /Q1*/Q2*/Q3 


/CMDLY:= /ALE 


+/CS 

/D1 = /ADI 

D1.TRST = CSL * IORC 

/D2 = /AD2 

D2.TRST = CSL * IORC 

/AD1 = /Al1*ALE + /D1*/ALE 
AD1.TRST = CSL * IORC 
/AD2 = /A2*ALE + /D2*/ALE 


AD2.TRST 


= CSL * IORC 


; SIMULATION NOT INCLUDED 
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Figure 3. Design File for the Example of Figure 1 
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80286 to Am8530 Interface 





80286 to Am8530 Interface 


The Am8530 is a high-speed, dual-channel serial communi- 
cations controller that supports a variety of advanced 
communication protocols. It supports data rates up to 
1.5M bps. 


This design note shows an 80286 to an Am8530 CPU inter 
face when no interrupts are used (Figures 1 and 2). The 
design is for a CPU running at 5 MHz with a 100 ns system 
clock cycle time. This clock is used to generate the clock for 
the 6 MHz Am8530 SCC. 


The chip select generated by the external decoder in an 
80286 system that is enabled by the M/IO line will go inactive 
with the falling edge of Tci«-phase 1. However, the chip 
enable for the SCC should stay valid for the entire read or 



















write cycle. The PAL device generates the chip enable signal 
for the SCC using a structure equivalent to two cross-coupled 
NOR gates. 


The Am8530 requires a minimum pulse width of 250 ns for RD 
and WR control signals. For a 100-ns cycle time 80286 with no 
wait state, these controls will be active for 200 ns only. SRDY 
control is generated for the 82284 bus controller to guarantee 
minimum RD and WR pulse widths for the Am8530. 


CMDLY is not used in this design because the address to read 
or write control setup time required is only 80 ns for the 
Am8530; with a 100 ns system cycle time, the time between 
ALE valid and /IORC or /IOWC valid is 100 ns. Even after 
accounting for the address latch delay, 80 ns timing will still 
be met. 
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Figure 1. 80286—Am8530 Interface ~ 
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" THIS PLPL FILE IS FOR A 16R4 THAT IMPLEMENTS THE LOGIC ™ COUNT DOWN SPECIFICATION FOR THE COUNTER. " 
NECESSARY TO INTERFACE AN Am8530 (SCC) TO AN 80286 SYSTEM. " " Q2 OF THE COUNTER IS THE CLOCK INPUT OF THE Am8530. " 
DEVICE Am8530 to 80286 (PAL16R4) CASE ( Q[3:1] ) 
BEGIN 
PIN 0) Q(3:1] :=7; 
CLK = 1 VCC = 20 1) Q[3:1] :=0; 
/cs = 2 /CE = 19 2) Q(3:1] :=1; 
ALE =3 /INTSO = 18 3.) Q(3:1] :=2; 
/iorc =4 /QC1] = 17 4) Q(3:1] :=3; 
/lowe  ==5 /Q{2] = 16 5 ) Q(3:1] :=4; 
NC1 =6 /Q{3] = 15 6) Q(3:1] :=5; 
NC2 =7 NC5 = 14 7 ) Q(3:1]) := 6; 
NC3 = 8 /SRDY = 13 END ; 
NC4 =9 NC6 = 12 
GND = 10 NC7 = 11; " SRDY IS GENERATED FOR WAIT STATE GENERATION, IT IS SENT 


TO THE 82284 CLOCK GENERATOR. " 


BEGIN 
SRDY = /(Q[3] * CE) ; 
" CHIP ENABLE FOR THE Am8530 IS DERIVED FROM ALE AND 
THE EXTERNAL DECODER CHIP SELECT OUTPUT. " END. 


CE = CS * ALE + INTSO ; 
INTSO = /€ IORC * IOWC) + CE ; 


™ SYNCHRONIZE THE COUNTER WITH ALE. " 


IF ( CS * ALE ) THEN Q[3:1] = 7 ; 


Figure 2. Source Listing for AmPAL16R4 for the Example of Figure 1 
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interfacing to the G8000/68020 


The 68000 has an asynchronous, 16-bit, bidirectional data 
bus. Data types supported by the 68000 are: bit data, integer 
data of 8, 16, or 32 bits, 32-bit addresses and binary-coded 
decimal data. It can transfer and accept data in either words 
or bytes. The DTACK input indicates the completion of a data 
transfer. When the processor recognizes DTACK during a 
read cycle, the data is latched and the bus cycle terminates. 
When DTACK is recognized during a write cycle, the bus 
cycle also terminates. An active transition of DTACK indi- 
cates the termination of a data transfer on the bus. All contro! 











and data lines are sampled during the 68000's clock HIGH ' 


time. The clock is internally buffered, which results in some 
slight differences in the sampling and recognition of various 
signals. The 68000 mask sets prior to CC1 and allows 
DTACK to be recognized as early as S2, and all devices 
allow BERR or DTACK to be recognized in S4, S6, etc., 
which terminates the cycle. If the required setup time is met 
during S4, DTACK will be recognized during S5 and S6, and 
data will be captured during S6. DTACK signal is internally 
synchronized to allow for valid operation in an asynchronous 
system. If an asynchronous control signal does not meet the 
required setup time, it is possible that it may not be 
recognized during that cycle. Because of this, synchronous 
systems must not allow DTACK to precede data by more 
than 40 to 240 nanoseconds, depending on the speed of the 
particular processor. I/O is memory-mapped, i.e., there are 
no special !/O control signals, any peripheral is treated as a 
memory location. 
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DMA: This Bus is requested by activating the BR input of the 
68000. Bus Arbitration is started by the BG output going active. 
The Bus is available when AS becomes inactive. The request- 
ing device must acknowledge bus mastership by activating the 
BGACK input to the CPU. 


The 23-bit address (Aq...A23) is on a unidirectional, three-state 
bus, and can address 8 M words (16 M bytes) of memory or 
I/O. It provides the address for bus operation during all cycles, 
except the interrupt cycles. During interrupt cycles, address 
lines A1, A2 and A3 provide information about the level of 
interrupt being serviced. Instead of Ag and BYTE/WORD, 
there are two separate data strobe lines for the two bytes ina 
word. A note of caution here, the 68000 treats the MSB of the 
lower byte as an even byte, or word address. The same goes 
with processors such as the Z8000. Processors such as the 
8086 treats the lower byte as the odd byte. 


Interrupt is requested by activating any combination of the in- 
terrupt inputs to the 68000 (IPLO...2), indicating the encoded 
priority level of the interrupt requester (inputs at or below the 
current processor priority are ignored). The 68000 automati- 
cally saves the status register, switches to supervisor mode, 
fetches a vector number from the interrupting device, and dis- 
plays the interrupt level! on the address bus. For interfacing 
with old 68000 peripherals, the 68000 issues an Enable signal 
at one-tenth of the processor clock frequency. There are a 
number of AMD proprietary third generation peripherals that 
can be interfaced to the 68000 CPU, to improve system perfor- 
mance. This section deals mainly with the interfacing of the 
68000 and some of the AMD proprietary peripherals. 
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The 68000 to Am8530 Interface 
with Interrupts 





This example shows how a PAL device simplifies the task of The timing during register programming is not shown. The PAL 
interrupt generation compared to the MSI implementation. device allows selection of one or two Wait States by making Wo 
The block diagram for the interface via a PAL device is shown HIGH or LOW, respectively. The table below shows the appro- 
in Figure 1. The timing diagram (Figure 2) illustrates the Inter- priate number of Wait States as a function of CPU speed. 
rupt Acknowledge cycle. As in the other designs, RD is 

generated during Interrupt Acknowledge to place the vector CPU Speed 

on the bus. 





PAL device equations are shown in Figure 3. 
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Figure 1. 68000 to Am8530 Connection Using a PAL Device 02188A-68 
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The 68000 to Am8530 Interface with Interrupts 
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Figure 2. Timing Diagram for 68000 to Am8530 interface 
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The 68000 to Am8530 Interface with Interrupts 





TITLE 68000 TO 8500 OR 9500 PERIPHERALS 
PATTERN PATOO2 

REVISION 01 

AUTHOR JOE BRCICH 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 


CHIP PERI_INTFC PAL16R4 


CLOCK /CS RW /LDS /WO /AS FCO FCl FC2 GND 
/OE /INTA /ACK /C /B /A /DLDS /RD /WR VCC 


EQUATIONS 
A := A*/B 
+ B *C 
+ /AS 
B := A*/C 
+ /A*C 
+ /AS 
C := /A*/B*AS 
+ B*C*AS 
DLDS := LDS 
RD = LDS*DLDS*RW*/INTA 
+ A*C*INTA*AS 
+ A*/BRINTA*AS 


WR = LDS * /RW 

INTA = FCO*FC1*FC2*AS 
ACK = /INTA*/A*/B*/C*/WO 
/INTA* /A* /B*C*WO 
INTA* /B*A 

ACK * LDS 


++en 


; DESCRIPTION 


;THIS PAL DEVICE INTERFACES 85XX TYPE PERIPHERALS TO THE 68000 
;MICRO PROCESSOR. IT INSERTS 1 OR 2 WAIT STATES AS SELECTED BY 
;/WO=0 IS ONE AND /W0=1 IS TWO WAIT STATES. FOUR WAIT STATES ARE 
; INSERTED DURING INTERRUPT ACKNOWLEDGE CYCLES. ALSO THE RD OUTPUT 
;GENERATED DURING INTA IS A FUNCTION OF THE INTERNAL STATE 
;MACHINE AND NOT A FUNCTION OF LDS. OE CAN BE LEFT OPEN SINCE 
;THE FLIP FLOP OUTPUTS ARE NOT USED DIRECTLY. THE FALLING EDGE 
;OF RD IS DELAYED IN ORDER TO GUARANTEE THE CS TO RD SETUP TIME 

; REQUIREMENTS. 


; SIMULATION NOT INCLUDED 


Figure 3. Source Listing for the Example of Figure 1 
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68000 and Am7990 LANCE interface 





The design on the LANCE has made it easier for the user to faster 68000 (Figure 1). The two flip-flops are needed to adapt 
interface the device with demultiplexed buses. The example the LANCE bus request handshake to the 68000. 
shown here is an interface to be compatible with an 8 MHz or 


ADDRESS BUS DATA BUS 





DTACK FROM 
SYSTEM 
A 
BR 


QD 
mB Aa 
P 
LANCE 
el | o ‘AS /CS/RB/TB Am7990 


— ICLR2 IHOLD HOLD 
ae IBGACK 
IDAS DAS 


AmPAL 16L8 
68K9U.PAL 


02188A-70 





Figure 1. Am7990 to 68000 Interface 
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68000 and Am7990 LANCE Interface 





Autovectoring is used since the Am7990 does not return a 
vector during interrupt acknowledge cycles. The BYTE and 
DAS signals of LANCE are used to generate the UDS and LDS 
when LANCE is in Bus Master mode; the UDS and LDS is used 
to generate the DAS when LANCE is in Bus Slave mode. It 


TITLE 68000 TO LANCE INTERFACE 
PATTERN PATOO2 

REVISION 01 

AUTHOR JOE BRCICH 

COMPANY ADVANCED MICRO DEVICES 
DATE 11/06/87 


CHIP LANCE IN PAL16L8 


/AS 
/CS 


RW BYTE /HOLD NC /BG AO 
/TB /UDS /DAS /CLR1 


EQUATIONS 


RB = CS*RW*UDS 
+ CS*RW*LDS 
RB.TRST = /BGACK 


TB = CS*/RW 
TB.TRST = /BGACK 


UDS = DAS*/A0*BYTE 
+ BYTE*DAS 
UDS.TRST = BGACK 


LDS = DAS*AO*BYTE 
+ /BYTE*DAS 
LDS.TRST = BGACK 


DAS = UDS*LDS 
DAS.TRST = /BGACK 


CLR1 


/AS*BG 


CLR2 


/BR = 


BGACK ; DELAY 
/HOLD 


; DESCRIPTION 


takes two latches to demultiplex the LANCE address/data 
lines to adapt to the 68000 address bus. The flip-flops can be 
replaced by a PAL22V10 to minimize parts count. Equations 
for the PAL16L8 are shown in Figure 2. 


NC /BGACK GND 
BR /CLR2 


/LDS /RB VCC 


;THE GOAL OF THIS INTERFACE WAS TO BE COMPATIBLE WITH 8 MHZ 


;AND FASTER 68000'S WHILE MINIMIZING PARTS COUNT. 


THE 


;Am22V10 COULD BE USED TO ELIMINATE THE TWO FLIP-FLOPS 


; SHOWN. 


AUTOVECTORFING IS USED SINCE THE 7990 DOES NOT 


;RETURN A VECTOR DURING INTERRUPT ACKNOWLEDGE CYCLES. 
;NOTE PROGRAM BSWP, BCON TO 1, AND ACON TO 0 IN CSR3 REG. 


; SIMULATION NOT INCLUDED 


Figure 2. Source Listing for the Example of Figure 1 
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68000 to Am Z8068 Data Ciphering 


Processor Interface 





Figures 1 and 2 show the 68000-DCP interface and the inter 
face timing. This interface provides a two-chip solution to add 
high-speed data ciphering to a 68000-based system. About 
500 kbyte/sec are possible in a CPU-controlled transfer. The 
ciphering rate can be increased with a sophisticated DMA 
controller, or with several DCPs operating in parallel. The 
CPU operates at 8 MHz and the DCP operates synchro- 
nously at 4 MHz. The Interface Controller, a PAL device, gen- 
erates the Address and Data Strobes for the DCP and the 
Data Acknowledge for the CPU. It also divides the CPU clock 
by two and synchronizes it to the Data Strobes. 


The main features of this interface are: 
— Multiplexed Control Mode 


— Demultiplexed address and data bus 
— Two-Cycle Operation 


PROM, 
COMPARATOR, 
OR OTHER 
DECODER 


68000 





Figure 1. AmZ8068 to 68000 Interface 
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— Clock Synchronization with two Low Cycles after the Data 
Strobes 


— About 500 kbyte/sec ciphering speed 


Data transfers between the CPU and the DCP are accom- 
plished by a two-cycle operation. First the address of an inter- 
nal register is latched in, then the data is transferred. This 
causes a small overhead in the initialization phase, but im- 
proves the ciphering rate in a high-speed data ciphering ses- 
sion. The rate of 500 kbyte/sec can be reached only if a 
high-speed peripheral device is connected to the Slave Port 
and the DCP is programmed for dual port configuration. 


The PAL device is programmed to allow only DCP transfers to 
the DCP. The PAL device equations are shown in Figure 5. 
Ao must be odd to make the CPU transfer the data on the Low 
byte of the data bus. A “0” on A, indicates an Address Latch 
Cycle, whereas a'“1" on A, indicates a Data Transfer Cycle: Ag 
must be “1" in both cycles. 









PAL16R4 AmZ8068 








MA AS 


02188A-73 
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68000 to Am Z8068 Data Ciphering Processor Interface 





So $1 82 $3 S4 
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Figure 2. 68000-AmZ8068 Address Latch Cycle (A; =Low) 


———_--O 


An address decoder generates the Chip Select for the DCP. 
The Address Strobe indicates a valid address. The PAL device 
is only activated if the Lower Data Strobe becomes active while 
the Upper Data Strobe stays inactive. This means that data is 
transferred in MOVE.B instructions with an odd peripheral 
address. 


The PAL device provides two Data Acknowledge outputs. 
DTACK, is an active Low TTL output. DTACK. has the same 
timing as DTACK, , but is an Open Collector output. (The 
Open Collector output is realized by a three-state output which 
has only two states, Low or Floating.) 











Address Latch Cycle 


In this cycle only a Master Port Address Strobe (MAS) is is 
generated. Master Port Chip Select (MCS) is tied to Low. LDS 
is sent to the MAS output. The minimum pulse width of LDS is 
115 ns; 80 ns are required for the AmZ8068. 


DTACK is activated with the falling edge of the CPU clock after 
cycle Sg. The CPU inserts no Wait states. DTACK is deac- 
__ tivated with the first edge of CLK after AS becomes inactive. 





Data Read Cycle: (Figure 3) 


The generation of MDS in a Data Read Cycle is similar to the 
Data Write Cycle. Because the CPU activates LDS one cycle 
earlier, there is no need for a Wait State. The minimum pulse 
width of LDS is 240 ns; the DCP requires 200 ns for a Status 
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Register read. DTACK is activated using the same logical con- 
dition as in the Data Write cycle. Because of the earlier activa- 
tion of LDS, DTACK becomes active earlier and the CPU 
inserts no Wait states. 





Data Write Cycle: (Figure 4) 


A Data Write Cycle is performed with Ag is HIGH, AS, CS and 


LOS are LOW. The minimum pulse width of LDS is not suffi- 


cient for the DCP which requires at least 125 ns. One Wait 
state or a slower system clock will satisfy this parameter. In this 
interface, one Wait state is inserted by activating DTACK at the 
end of Sq. 


The DCP clock is synchronized in Data Read or Write Cycles 
by forcing it Low when DTACK becomes active. This guaran- 
tees that the DCP clock has a falling edge just before LDS 
(MDS) rises. The delay of the DCP clock to CLK is typically 
8 ns for a PAL device. The delay of LDS to MDS is typically 
12 ns. The delay of LDS to the system clock is 0-70 ns for the 
8 MHz version. This results in a delay of 4-74 ns of MDS to 
the DCP clock. The DCP requires O—50 ns when operating at 
the maximum clock rate. 





This problem is solved by stretching the clock for one cycle. 
The DCP clock stays LOW for two cycles in the end of a 
transfer cycle. This is done automatically by the PAL device 
(see Figure 4). 
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68000 to AmZ8068 Data Ciphering Processor Interface 
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02188A-75 
Figure 3. 68000-AmZ8068 Data Read Cycle (A; =HIGH) 
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Figure 4. 68000-AmZ8068 Data Write Cycle (A; = HIGH) 
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68000 to AmZ8068 Data Ciphering Processor Interface 





TITLE 68000 - AmZ8068 (DCP) INTERFACE DEVICE 
PATTERN DCP044 

REVISION 01 

AUTHOR JUERGEN STELBRINK 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 


CHIP AmZ_ INTFCE PAL16R4 


CLK2 /CS  /AS  /UDS /LDS RW Al CLK NC GND 

/OE  /MAS /MDS DCPCLK NC NC /DTACKl CLK1 /DTACK2 VCC 

EQUATIONS 

/CLKl = CLK ; INVERT CLOCK TO TRIGGER THE REGISTERED 


;OQUTPUTS WITH THE FALLING EDGE OF CLK 


MAS = AS*LDS*/UDS*/RW*/A1*CS 
MDS = AS*LDS*/UDS*A1*CS 
/DCPCLK := DCPCLK ; DIVIDE BY TWO 
+ /DTACK1*CS*AS*LDS* /UDS 
+ DTACK1*/AS* /LDS* /UDS ; TWO CLOCKS LOW IN 
+ THE END OF A DATA CYCLE 
DTACK1 := AS*LDS*/UDS*A1*CS DATA TRANSFER CYCLE 


me Ne 


+ AS*/RW*/A1L*CS ADDRESS LATCH CYCLE 
DTACK2 = DTACK1 
DTACK2.TRST = DTACK1*AS*CS 


; SIMULATION NOT INCLUDED 


Figure 5. Source Listing for Example of Figure 1 
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68000 and Dual Am9516 DMA 


Controllers Interface 


There has been interest shown in connecting two Am9516 
DMA Controllers to obtain four channels. The example here 
shows that such a system can be built by incorporating one 
PAL device, AMD’s 22V10 (Figure 1). Address and data 
buses are not shown as they are straightforward and require 
no explanation. The PAL device, designated 68K16D2, con- 
verts the two DREQs into the 68000 three-wire handshake, 
prioritizes the request, and converts the control signals 


PAL22V10 
68K16D2 


ALLOWS 
OTHER BUS 
MASTERS 





appropriately. Equations for the PAL device are shown in 
Figure 2. 


The key parameters are: 1) data hold with respect to the rising 
edge of DS during a write, and 2) DTACK setup time. Control 
for a data bus transceiver is shown because it will be required 
in most systems. The PAL device provides these signals when 
the CPU is bus master; the Am9516 generates these control 
signals directly when it is bus master. 





Am9516 
AW 


RBEN 
TBEN 





BACK: BACK: 
BACK2 





BACK2 BGT 


02188A-80 


Figure 1. Dual Am9516 UDCs to 68000 CPU Interface 
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68000 and Dua! Am9516 DMA Controllers Interface 


The Am9516s are shown with independent clocks. The clocks 
may be divided from the CPU clock or may be generated inde- 
pendently of the CPU. Because WAIT must meet the set-up 
and hold times, DTACK may need synchronization by the use 
of a flip-flop, as shown. This flip-flop may be eliminated in 
synchronous systems. 


The clock is inverted to the PAL device to meet DTACK set-up 
time in systems of 10 MHz or faster. This inverter may be 
deleted in slower systems, if appropriate changes to the PAL 
device equations are made. This PAL device implements fixed 
priority between the Am9516s. BREQ, is the highest priority. 
Rotating priority can be implemented by adding another PAL 
device. 








EOPs are pulled up separately, but could be tied together 
since they affect a channel only if it is active. The bus-error 
function can be supported by connecting BERR and EOP. 





If a bus error occurs, EOP will stop the current transfer and 
interrupt the CPU. The Interrupt Service routine can read the 
status to determine if EOP caused the interrupt or if termina- 
tion was normal. If EOP caused the interrupt, the Address 
Register can be read to determine where the bus error oc- 
curred. After the problem is corrected, the CPU can program 
the Am9516 to complete the transfer or do an alternate trans- 
fer, as appropriate. 





When operating the DMA in interleave mode, an external EOP 
should be gated with DACK to prevent affecting the wrong 
channel. This is unnecessary if interleave is not used, since the 
UDC releases the bus. 





This arbiter design supports both serial and parallel-expansion 
techniques and is therefore compatible with VME bus protocol. 
Bus grant out was implemented with an external gate due to a 
shortage of pins. The VME/BCLR function was not imple- 
mented because the Am9516 does not support pre-emption. 
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68000 and Dual Am95i6 DMA Controllers Interface 





TITLE 68000 TO DUAL 9516 INTERFACE 
PATTERN PATOO1] 

REVISION 01 

AUTHOR JOE BRCICH 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/06/87 


CHIP DUAL _9516 PAL22V10 


CLK RW AU BREQl BREQ2 /BG NC ALEl /BW ALE2 /BGACK GND 
/CS /LDS /UDS /DS /C /AS /BR BACK2 BACKl /TB /RB VCC 


EQUATIONS 


BR = BREQ1*/BGACK 
+ BREQ1*BG 
+ BREQ2* /BGACK 
+ BREQ2*BG 





BACK1 = BREQ1*BG*/AS 
+ /BG*BGACK 


BACK2 = BREQ2*/BREQ1*BG*/AS 
+ /BG*BGACK 


RB = CS*RW*UDS 
+ CS*RW*LDS 
RB.TRST = /BGACK 


TB = CS*/RW 
TB.TRST = /BGACK 


DS = AS*/C*/RW 
+ AS*RW 
DS.TRST = /BGACK*AS 


AS = ALEl 
+ ALE2 
AS.TRST = BGACK 


UDS = DS*/A0*/BW 
+ BW*DS 
UDS.TRST = BGACK 


LDS = DS*A0*/BW 
+ BW*DS 
LDS.TRST = BGACK 


C := UDS*BGACK 


+ LDS*BGACK 
C.TRST = AS 


Figure 2. Source Listing for the Example of Figure 1 
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Am8530 to 68020 Interface 





This design shows the logic to interface a 6 MHz Am8530 
Serial Communications Controller to the 68020 CPU (Fig- 
ure 1) running at 10 MHz with a system clock cycle time of 
100 ns. The Am8530 is a high-performance, dual-channel 
SCC that supports data rates up to 1.5M bps and a variety of 
communication protocols. 


The PAL device generates the /RD and /WR control signals for 
the Am8530 from the 68020 /DS and R/W- control signals. It 
also generates the clock for the SCC by dividing the 68020 
system clock. This meets the 165-ns minimum cycle time for 
the 6-MHz SCC clock.. 


D[24:31] 


ey. i 


RIW- 


DSACK[1] 
DSACK{0] 





Also, a state machine is implemented to perform the hand- 
shake necessary for byte transfers on the 68020 bus. The 
state machine transition diagram is shown in Figure 2. Nor- 
mally, DSACK[1:0] is inactive. When this device is selected, a 
wait state is inserted and then data transfer is acknowledged 
by asserting DSACK[1:0] for byte transfer. When the CPU 
deasserts the address strobe, DSACK[1:0] is negated. 
DSACK[1:0] will be driven by other devices on the 68020 
system bus. The PAL device enables DSACK{[1:0} only when 
the CPU performs a read or write cycle for the SCC; at other 
times DSACK[1:0] lines are three-stated. 


Since the Am8530 is a byte-wide peripheral, it is connected to 
the upper byte of the 32-bit wide 68020 data bus. 


ICS 
IAS 
10S 
RW 


PAL16R4 


CLK 


08749A-11 


Figure 1. Am8530 to 68020 Interface 
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Am8530 to 68020 Interface 





ICS 








DSACK[1:0]=11 


DSACK[1:0]=11 2. 


INIT 


DSACK[1:0]=10 


AS 


08479A-12 


Figure 2. 68020 Byte Port DSACK Handshake State Diagram 
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Am8530 to 68020 Interface 





" THIS PLPL FILE IS FOR A 16R4 THAT IMPLEMENTS THE LOGIC " THE FOLLOWING CODE IMPLEMENTS THE STATE MACHINE TO 
NECESSARY TO INTERFACE AN Am8530 (SCC) TO A 68020 SYSTEM. " PERFORM DSACK OPERATION. IT INSERTS WAIT STATES, 
ASSERTS DSACK FOR BYTE TRANSFER AND REMOVES DSACK 

DEVICE Am8530_TO 68020 (PAL16R4) WHEN AS IS NEGATED. " 
PIN IF ¢( /CS ) ENABLE ( DSACK[1:0] ) 

CLK = 1 vcc = 20 

/AS = 2 DSACK(1] = 19 IF ¢ ST(1:0] = #B10 ) 

/DS =3 DSACK[0] = 18 THEN DSACK[1:0) = #B10; 

RW =4 STi) = 17 ELSE DSACK[1:0) = #811; 

/CS =5 sT(0] = 16 

NC1 =6 Q = 15 CASE ( ST[1:0) ) 

NC2 =7 NC5 = 14 BEGIN 

NC3 = 8 WR = 13 #800) BEGIN 

NC4 =9 RD = 12 IF ¢ /CS ) 

GND = 10 NC6 =11; THEN ST(1:0}] := #B00 ; 

ELSE ST(1:0]) := #B01; 
BEGIN END ; 
#801) ST(1:0] := #810; 
" READ AND WRITE CONTROL SIGNALS ARE DERIVED FROM #310) BEGIN 
R/W- AND DS-. " IF ¢ /AS ) 
: THEN ST[1:0] := #800 ; 
WR = /(DS * /RW) ; ELSE ST(1:0} := #810; 
RD = /(DS * RW) ; END ; 
END ; 

" THE INCOMING CLOCK IS DIVIDED BY 2 TO GENERATE THE END. 


CLOCK FOR THE Am8530. " 


CASE ( Q ) 
BEGIN 
0) Q := 1; 
1) Q := 0; 
END; 


Figure 3. PLPL Specification for the 8530 to a 68020 Interface 
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Interfacing to the 8088 





Overview 


The 8088 CPU is an 8-bit processor designed around the 8086 
internal structure. Most functions of the 8088 are identical to 
the equivalent 8086. The 8088 fetches and writes 16-bit words 
in two consecutive bus cycles. Both the 8086 and the 8088 
handle the external bus the same way but the 8088 handles 
only 8-bits at a time; and both appear identical to the software 
engineer, with the exception of execution time. 


The hardware interface of the 8088 contains the major differ- 
ences between the two CPUs. The pin assignments are nearly 
identical, however, with the following functional changes: 


Ag-A15 These pins are only address outputs on the 
8088. They are latched internally and remain 
valid throughout a bus cycle in a manner 
similar to the 8085 upper address lines. 


Sso Provides the SO status information in the mini- 
mum mode. This output occurs on pin 34 in 
minimum mode only. 


DT/R,1O/M, and SSO provide the complete bus status in 
minimum mode. 


1O/M- has been inverted to be compatible with the 
MCS-85 bus structure. 


ALE is delayed by one clock cycle in the minimum mode when 
entering HALT, to allow the status to be latched with ALE. 
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The 8088 and AMD Proprietary 
Peripherals 


The evolution of chip design has taken the 8-bit environment 
into the 16-bit environment. While the new generation of pe- 
ripheral devices are often 16 bits wide, the older, established 
8-bit orientation of CPUs and peripherals are still significant. 
Interfacing a 16-bit peripheral with an 8-bit CPU often encoun- 
ters data path incompatibility and involves bus control 
manipulation. This type of integration mainly involves separat- 
ing the control and data paths from the new peripheral and the 
system. 


The ability to mix different data path widths can improve sys- 
tem functionality, performance, and cost. It is less expensive 
to use an 8-bit bus in a new design because the memory 
requirements are generally cheaper. A designer can use this 
data path mixing to upgrade the existing system until a new 
system design is warranted, or the designer can simply im- 
prove onthe existing design as new peripherals become avail- 
able. 
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8088 to Am9516 UDC Interface 





Figure 1 shows the Data Bus and Control Interface between 
an 8088 microprocessor and an Am9516. This interface is 
accomplished by using an AmMPAL22V10, a 74LS161 
counter, an Am2952 bidirectional |/O port, and an Am2949 
bidirectional transceiver. Figures 2 and 3 show the PAL 
device timing for both the Bus Master and the Bus Slave 
cases. 


In this design, certain simplifying constraints have been made. 
Word operations are only allowed during command chaining 
operations when the Am9516 is Bus Master. During Slave 
Write operations, the first byte output to the Am9516 must have 
an odd address, and the following second byte an even ad- 
dress. Conversely, during a Slave Read cycle, the first byte 
read from the Am9515 must be at an even address and the 
second at the next higher odd address. Furthermore, for both 


RESET 


Figure 1. The Am9516 UDC to 8088 CPU Data Bus and Control Interface 
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slave and master operations, the system must use the 
latched Ao (LAo) from the PAL22V10 as its sole Ao. That is, 
this LAg must circumvent any address latching to memory 
and so is to be used directly. 


As can be seen from the figures, the AmPAL22V10 manip- 
ulates the control signals to the transceivers and latches so 
that bytes can be funneled into words during Am9516 Com- 
mand Chaining operations and Slave Writes, and words can be 
funneled into bytes during Am9516 Slave Reads. Also, LAO is 
alternately toggled in order to provide the dual address (16- 
bits) during chaining operations. 


Figure 4 shows the PLPL equations and resulting sum-of- 
products equations for the design (PLPL is a higher level 
“C-like” language for PAL devices). For those not familiar 
with “C” or Pascal, the intermediate sum-of-products form 
shows the equations in Boolean form. 
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8088 to Am9516 UDC Interface 
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02188A-61 
Figure 2. The Am9516 UDC to 8088 CPU Bus Slave Timing 
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02188A-62 
Figure 3. The Am9516 UDC to 8088 CPU Bus Master Timing 
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8088 to Am9516 UDC interface 


DEVICE 8088 9516 (pal22v10) 


"This is a PLPL file to implement an interface between the 8-bit 
8088 and the 16-bit Am9516. This design assumes that the Am9516 
is programmed for byte operations and so only accesses 16-bit 


words during chaining. -James Williamson" 
PIN 
CK = 7 /RD = 23 
SC0:2] = 2:4 /WR = 22 
AO = 5 LAO = 21 
/SEL = 6 /DS = 20 
ALE S49 /RW = 19 
HLDA = 8 /WAIT = 18 
/BW = 9 /A = 17 
READY = 10 /B = 16 
RESET = 1] /c = 15 
/D = 14; 


BEGIN 


IF (RESET) THEN ARESET(); 


IF (HLDA) THEN ENABLE(); 
IF (/S(2] * HLDA) THEN BEGIN 
IF (/s(C1) * /S{0]) THEN 
LAO = /CK * BW + /BW * AO * ALE + 
/BW * LAO * /ALE 
ELSE 
LAO = BW + /BW * AO * ALE + 
/BW * LAO * /ALE 


END; 
IF (HLDA) THEN 
CASE (S[2:0]) 


BEGIN 
1) BEGIN 
RD = /RW * DS 
A = /BW * /RW * /CK 
WR = /BW * RW * DS 
c = /BW * RW 
WAIT = 1 
END; 
2) BEGIN 
RD = /RW * DS 
B = BW 
A = /BW * /RW 
WR = /BW * RW * DS 
Cc = /BW * RW 
WAIT = BW 
END; 


Figure 4. PLPL Specification for 8088 to Am9516 Interface 
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3) BEGIN 
RD = /RW * DS * B : 
B = BW * CK : 
A = /BW * RD ; 
WR = /BW * RW * DS ; 
c = /BW * RW : 
WAIT = BW ; 
END; 
5) BEGIN 
RD = /RW * DS ; 
A = /BW * /CK : 
WAIT = BW : 
END; 
6) BEGIN 
RD = /RW * DS H 
A = BW 7 
END; 
7) BEGIN 
RD = /RW * DS 7 
A = /RD ;: 
END; 


BEGIN 
LAO = AO * ALE * SEL + 
LAO * /ALE * SEL ; 
B = LAO * WR * SEL ; 
A = /LAO * WR * SEL ; 
DS =At 
/LAO * RD * SEL ; 
C = /LAO * RD * SEL : 
D = LAO * RD * SEL : 
END; 


END. 


Figure 4, PLPL Specification for 8088 to Am9516 Interface (Continued) 
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80186 to Am9516 Universal DMA 
Controller Interface 











The addition of the Am9516 to an 80186 design is a natural This interface accomplishes two major control transformations. 
choice in systems requiring four channels of DMA. Figure 1 First, it converts RD and WR into R/W and DS when the 80186 
shows the interface between the 80186 and the Am9516 with is Bus Master, and vice versa when the Am9516 is Bus Master. 
a PAL device. PCSs is programmed to provide a latched A, Secondly, the transceiver control signals, TBEN and RBEN, 
signal. are generated from DEN and DT/R. This example shows only 


one possible configuration. Other configurations can be made 
as dictated by system requirements. A PAL device is used here 
to reduce board space. 


16 MHz 









pone (ae 
ca Meh ee tT a 
. St T Pn 
ee Pn TRANSCEIVER 
ARDY RO WR MIIO Ao 


02188A-31 


Figure 1. Am9516 to 80186 Interface 
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80186 to Am9516 Universal DMA Controller Interface 





TITLE Am9516 TO 80186 INTERFACE 
PATTERN PATOO1L 

REVISION 01 

AUTHOR JOE ENGINEER 

COMPANY ADVANCED MICRO DEVICES 
DATE 11/06/87 


CHIP DMA_INT PAL16L8 


NC ALED ALEP HLDA BW ADO DT /DEN /SEL GND 
NC /RBEN /RD ALE AO /RW /DS /WR /TBEN VCC 


EQUATIONS 


DS = RD + WR 
DS.TRST = /HLDA 


RW = DT 
RW.TRST 


/HLDA 





TBEN = /DT * /SEL * DEN 
TBEN.TRST = /HLDA 


RBEN = DT * /SEL * DEN 
RBEN.TRST = /HLDA 


RD = /RW * DS 
RD.TRST = HLDA 


WR = RW * DS 
WR.TRST = HLDA 


ALE = /ALEP * /ALED 


AO = /ADO * /BW * HLDA * ALED 
+ ADO * BW * HLDA * ALED 
+ /ADO * /HLDA * ALEP 
+ AO * /ALEP 
+ AO * /ALED 

; DESCRIPTION 


; THIS PAL DEVICE CONVERTS THE CONTROL SIGNALS TO INTERFACE THE 80186 
; TO THE Am9516 DMA CONTROLLER. 


; SIMULATION NOT INCLUDED 


Figure 2. Source Listing for Example of Figure 1 
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68000 Interrupt Controller 


AN-119 





introduction 


Commercial and industrial microprocessor-based systems typi- 
cally consist of a processor interfaced with many peripherals 
which randomly require service. To fully utilize the processor's 
computing potential, sources of hardware interrupts must be 
used to free the processor from software routines. The 68000 
16-bit microprocessor has 256 different exception-processing 
vectors which point to predetermined locations in the program 
memory space. There are 192 external user interrupt vectors and 
seven autovector interrupts. Table 1 shows the exception vector 
assignments. The interrupt structure of the 68000 will be dis- 
cussed in more detail along with two methods of designing an 


Number(s) ) | Dee | Hex | Space| Assignment 
o | o | ooo] sp | Reset: Initial SSP 
a 004 Reset: Initial PC 
2 | 8 | oo8 | sp | Bus Error 
3 Address Error 
4 | 16 | 010] SD | legal Instruction 
5 Zero Divide 
6 018 CHK Instruction 
7 01C TRAPV Instruction 
8 Privilege Violation 


Line 1010 Emulator 
Line 1111 Emulator 


10 cafe [oo 
1 02C 


(Unassigned, reserved) 


13* (Unassigned, reserved) 
14* (Unassigned, reserved) 
15 ee Uninitialized Interrupt Vector 


(Unassigned, reserved) 


mas [oer [J = 

24 ee ee Spurious Interrupt 

25 Level 1 Interrupt Autovector 
26 Level 2 Interrupt Autovector 
27 O6C Level 3 Interrupt Autovector 
28 Level 4 Interrupt Autovector 
29 Level 5 Interrupt Autovector 
30 Level 6 Interrupt Autovector 


31 Level 7 Interrupt Autovector 
32-47 SD | TRAP Instruction Vectors 
[191 | oBF | = 
48-63* (Unassigned, reserved) 
cea ee ae = 


64-255 256 100 User Interrupt Vectors 
1023; 3FF 








* Vector numbers 12,13, 14, 16 through 23 and 48 through 63 are reserved for 
future enhancements. No user peripheral devices should be assigned these 
numbers. 


Table 1. Exception Vector Assignment 


interrupt controller using PAL (Programmable Array Logic) 
devices. 


68000 Exception Processing and 
Pin Description 


The interrupt structure of the 68000 can grant up to 256 types of 
interrupt requests. These interrupts, or exceptions, are generated 
either by external or internal causes and are serviced by directing 
the flow of program to an exception processing routine. Table 1 
lists these exceptions and their respective address in memory. 
Exception vectors are locations in memory where the processor 
fetches the address of a routine or subprogram which will service 
that exception. The exception vector is either internally or exter- 
nally generated depending on the cause of the interrupt. The 
externally generated exceptions ar are interrupts whit which are placed 
on the interrupt control pins (IPL2, IPL1 and jPLO), bus errors, 

and reset requests. The interrupts placed on control pins iPL2- 
IPLO are requests from peripheral devices. The internally-gener- 
ated exceptions come from instructions, address errors, or 
tracing. These different types of exceptions and their priorities 
are shown in Table 2. We will focus on group 1 interrupt 
exceptions. 










Processing 


Reset 
ie) Bus Error 
Address Error 


Exception processing begins 
within two clock cycles. 


Trace 
1 Interrupt 
Illegal Privilege 


Exception processing begins 
before the next instruction. 





TRAP, TRAPV, 
CHK, 
Zero Divide 






Exception processing is started 
by normal instruction execution. 


Table 2. Exception Grouping and Priority 


The pinout of the 68000 is shown in Figure 1. The three 
interrupt control pins (IPL2, !PL1, IPLO) are asynchronous 
active low inputs which indicate the encoded priority level of 


VCC(2) 
—— 
GND(2) 
CLK 


ADDRESS 


DATA BUS »DO-D15 





A1-A23 











68000 
MICROPROCESSOR 


FCO ASYNCHRONOUS 
PROCESSOR BUS CONTROL 
STATUS 
M6800 BUS 
PERIPHERAL ARBITRATION 
CONTROL CONTROL 
SYSTEM INTERRUPT 
CONTROL CONTROL 


Figure 1. 
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68000 Interrupt Controller 








the device requesting an interrupt. The least significant bit is 
IPLO and the most significant bit is IPL2. Level seven, (IPL2, 
IPL1, IPLO) = (000), has the highest priority, while level zero, 
(111), indicates that no interrupts are requested. All lower or 
equal priority interrupts are masked during the interrupt 
service routine of the present interrupting device. There are 
two ways by which a peripheral: device can interrupt the 
normal flow of program: autovectoring or external vector 
number generation. The function code pins (FC2, FC1, and 
FCO) all become high during an interrupt acknowledge cycle. 
The interrupt acknowledge cycle always follows an interrupt 
request only after the present instruction cycle is completed. 
Thus interrupt acknowlege cycles come in between the 
instruction cycles and no information is lost. 


The signals which are used during an external interrupt 
request are listed below with a description of their behavior 
and function. 


ADDRESS BUS: the 24-bit address bus holds the address of the 
data to be accessed. During an interrupt acknowledge cycle the 
lower three bits (A3, A2, A1) hold the encoded level of the inter- 
rupt being serviced. If a level 5 interrupt is being serviced then 
(A3, A2, A1) will be (101) respectively. 

DATA BUS: The lower eight bits of this 16-bit data bus must 
contain the vector number during a user interrupt acknowledge 
cycle. If an autovector routine is in process then the data bus is 
ignored. 


AS: The processor asserts address strobe anytime there is valid 
data on the address bus. It remains asserted for as long as the 
address is valid. 


R/W: This signal defines the data bus transfer as a read ora write 
cycle. During an interrupt acknowledge cycle the processor is in 
a read mode. 


UDS, LDS: The upper and lower data strobes are asserted when 
the processor is ina read or write instruction cycle. Upper strobe 
enables the most significant byte of data while the lower strobe 


INTERRUPT ACKNOWLEDGE SEQUENCE 
FLOW CHART 


PROCESSOR INTERRUPTING DEVICE 


REQUEST INTERRUPT 


GRANT _ INTERRUPT 


1) COMPARE INTERRUPT LEVEL IN STATUS REGISTER 
AND WAIT FOR CURRENT INSTRUCTION TO COMPLETE 

2) PLACE INTERRUPT LEVEL ON Aj, A2, A3 

3) SET R/W TO READ 

4) SET FUNCTION CODE TO INTERRUPT ACKNOWLEDGE 

5) ASSERT ADDRESS STROBE (AS) ___ 

6) ASSERT LOWER DATA STROBE (LDS) 


enables the least significant. 


DTACK: Data transfer acknowledge is an externally-generated 
signal which tells the processor that valid data is present on the 
data bus. If DTACK is not asserted before the falling edge of $4 
(S4 is the fourth CPU clock state in a seven-state instruction cycle) 
then wait states are introduced. 


IPL2-IPLO: The three interrupt contro! pins are inputs which 
contain the encoded priority level of the external interrupting 
device. Note that these are active low pins where (000) is level 
seven encoded. 


FC2-FCO: Function code output pins indicate the processor's 
status. When they are all high the processor is in an interrupt 
acknowledge cycle. 


E, VMA, VPA: Enable, valid memory address, and valid peripheral 
address are the standard 6800 family signals. VPA is also used 
when autovectoring. 


The following two design examples use PAL units as a simple 
and cost-effective interrupt controller interface to the 68000. The 
first introduces external vector generation while the second 
shows how autovectoring can be done on the 68000. 


Prioritized Individually Vectored 
Interrupt (Method 1) 

As shown in the interrupt acknowledge sequence flow chart and 
timing diagram (Figure 2), a peripheral device must interrupt the 
processor by placing the encoded level of priority on the 
interrupt control pins (IPL2-IPLO). The processor then grants 
the interrupt after the completion of the current instruction cycle. 
The encoded priority level of the interrupt is placed on address 
bus bits A3-A1 during the interrupt acknowledge cycle. The 
status code lines (FC2-FC0) become high, indicating an interrupt 
acknowledge cycle. Once the lower data strobe (LDS) is asserted 
the external device must place an 8-bit vector on the least signif- 
icant byte of the data bus. This data is latched in state S7. 


INTERRUPT ACKNOWLEDGE SEQUENCE 
TIMING DIAGRAM 


PROVIDE VECTOR NUMBER er Na sO ON ee NN 


1) PLACE VECTOR NUMBER ON DO-D7 
2) ASSERT DATA TRANSFER ACKNOWLEDGE (DTACK) 





ACQUIRE VECTOR NUMBER 


1) LATCH VECTOR NUMBER 
2) NEGATE LDS 
3) NEGATE AS 


cs, ah 


RELEASE 
1) NEGATE DTACK 








{fe oe 


LAST BUS CYCLE STACK IACK CYCLE STACK AND 
OF INSTRUCTION PCL (VECTOR NUMBER VECTOR 
(READ OR WRITE) (SSP) ACQUISITION) FETCH 


Figure 2. 


START INTERRUPT PROCESSING 
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68000 Interrupt Controller 





As shown on Table 1 vectors 64-255 are assigned to the user 
interrupt vector numbers. These vectors must be generated 
externally and placed on the data bus during an interrupt 
acknowledge cycle. In our interrupt controller we arbitrarily 
choose vectors 249-255 as the vectors assigned to the inter- 
rupting peripheral devices. Table 3a shows this assignment 
and how the vector can be decoded from the address 


bits A1-A3. 
Vector 
—— Address 


D7 D6 D5 D4 D3 D2 D1 DO|A3A2A1 










Priority 
level 


Device # 














1 (low priority) 0 1 
2 10 
3 11 
4 00 
5 0 1 
6 10 
7 (high priority) 11 





Table 3a. 


The two PAL devices used on this design example generate the 
interrupt vectors and all the necessary control signals. The var- 
ious signals and their implementation on the PAL devices are 
explained below. 


INT7-INTO: Any of the seven peripheral devices can 
request an interrupt by asserting one of these inputs. The 
interrupt must remain asserted until acknowledged by the 
CPU. 

FC2-FCO and AS: Function code or processor status code 
become logical high during an interrupt acknowledge cycle. 
Address strobe is asserted anytime valid address is on the 
bus. DTACK and Data output control are decoded from these 
four outputs of the 68000. 

A1-A3: The three least significant bits of the address bus 
contain the encoded level of the interrupting device. These 
signals are used in generating the vector number which is 
to be put on the data bus. 

RESET: Reset is an input which clears all outputs, and is 
used for system initialization. 

IPL2-IPLO: These PAL device outputs are active low synchro- 
nous signals which interface directly to the CPU. They represent 
the encoded level of the highest priority interrupt that is 
requested. Table 3b shows the truth table of our priority encoder 
implemented on a PAL device. 





Encoded int. ievel 
IPL2 IPL1 IPLO 


Interrupt request input from peripheral! 
INT7 INT6 INTS INT4 INT3 INT2 INT1 











Table 3b. 


DTACK: Data transfer acknowledge must be asserted by outside 
circuitry during a data transfer operation. The logic diagram 
shown below illustrates how DTACK is derived from address 
strobe and processor status signals. 





+5V 
AS 





D7-D0: The three least significant bits of the data output can be 
decoded straight from the address bits A3, A2 and A1. That is 
D2=A3, D1=A2 and DO=A1. The other five bits of data can be held 
high with pull-up resistors. Outputs of the three data bits become 
enabled by using the same scheme which enables the DTACK 
output. 


INTACK7-INTACK1: Only one of these signals will be 
asserted during the interrupt acknowledge cycle. This signal 
feeds back into the interrupting device to tell that device 
that its interrupt has been acknowledged. We can use the 
3-bit addresses to decode these signals as shown in Table 3c. 












INTACK 

7 6 5 4 3 2 1 
0) 0) 0 1 1 1 1 1 1 
¢) 0 1 1 1 1 1 1 0) 
0 1 0) 1 1 1 1 0 1 
) 1 1 1 1 1 0 1 1 
1 0) 0 1 1 1 1 1 1 
1 0 1 1 1 0 1 1 1 
1 1 0 1 0 1 1 1 1 
1 1 1 0 1 1 1 1 1 








Table 3c. 


The logic diagram of the controller is shown in Figure 3. The 
controller can operate without any wait states at the fastest CPU 
clock rate of 12.5 MHz as shown in the timing diagram of Figure 4. 


Auto-vectored Interrupts (Method 2) 


The seven autovector interrupts are assigned vector numbers 25 
through 31. Interrupts are requested by placing the encoded 
level of the request on the interrupt control pins (IPL2- IPLO). The 
processor responds to this request by placing the requested level 
in the processor's status register and by inhibiting all requests of 
lower priority. When the current instruction cycle is completed 
an interrupt acknowledge cycle takes place. If Valid Peripheral 
Address (VPA), is asserted before the falling edge of S4 then an 
autovector routine takes place. The data or the vector number is 
generated internally depending on the priority level of the inter- 
rupt request; the vector assigned is shown on the table at the 
beginning of this report. The autovector timing diagram and a 
very simple and practical interrupt controller implemented ona 
PAL device is shown in Figure 5. The PAL design specifications 
are included in the appendix. Note that VPA is generated_by 
enabling the PAL output when FC2-FC1 are high and AS is 
asserted. 
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Interrupt Controller Logic Diagram +5V 
External Vector Generation 
D7 
D6 
PAL20L10 +5V DS 
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DTACK 
ss e113 ee | 14 —Pr| 19 |«<— 
IPLO-2 
WRITE MACHINE CYCLE STACK PCL INTERRUPT ACKNOWLEDGE CYCLE STACK 
1 CLOCK PERIOD (80 ns) 8 CLOCK HIGH TO R/W HIGH (60 ns) 15 DATA VALID TO DS LOW (15 ns) 
2 CLOCK LOW TO ADDRESS (55 ns) 9 CLOCK HIGH TO R/W LOW (60 ns) 16 CLOCK HIGH TO AS, DS LOW (55 ns) 
3 CLOCK HIGH TO ADDRESS Hi-Z (60 ns) 10 CLOCK HIGH TO FC VALID (55 ns) 17 PAL INPUT TO Hi-Z (25 ns) 
4 CLOCK HIGH TO AS LOW (55 ns) 11 CLOCK HIGH TO DATA Hi-Z (60 ns) 18 PAL INPUT TO Hi-Z (25 ns) 
5 CLOCK LOW TO AS HIGH (50 ns) 12 AS HIGH TO DTACK HIGH (70 ns) 19 PAL INPUT TO Hi-Z (25 ns) 
6 CLOCK HIGH TO DS LOW (55 ns) 13 PAL CLOCK TO OUT (25 ns) 
7 CLOCK LOW TO DS HIGH (50 ns) 14 MINIMUM SETUP TIME (20 ns) 
Figure 4. 
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AUTOVECTOR OPERATION TIMING DIAGRAM 


So $2 S4S6 SO S2S4wwwwwwiw w w w S6 S0S2 


a, 
INPUT TO OUTPUT ENABLE (25 ns) 


Figure 5. 


Conclusions 


We have just seen how to implement an interrupt controller 
circuit using one or two PAL devices. This circuit can operate at 
the maximum operating frequency of the 68000 which is 12.5 
MHz. Since the most critical timing of the circuit is the P&L 
device unit’s input to Hi-Z, we see that we can operate the circuit 
with a wide spectrum of frequencies and with slower PAL devi- 
ces. To guarantee operation, timing spec. No. 16 and No. 19 on 
Figure 4 must add up to assert DTACK 20 ns prior to falling edge 
of S4. Thus 55 ns + (PAL device input delay to Hi-Z) should be 
less than, or equal to, 100 ns for a 12.5-MHz clock. We see that 


this qualifies fast, regular, half-power and quarter-power PAL 
devices for this circuit. 


PAL16R4 
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TITLE INTERRUPT CONTROLLER1 
PATTERN INTA.DAT 

REVISION O01 

AUTHOR DANESH TAVANA 

COMPANY MONOLITHIC -MEMORIES, INC. 
DATE 11/06/87 


CHIP INTA PAL16R4 


CLK /INT7 /INT6 /INTS /INT4 /INT3 /INT2 /INT1 /AS GND 





/OC FCO FCl NC /IPLO /IPL1 /IPL2 /DTACK FC2 VCC 
EQUATIONS 
DTACK = VCC _ ;ASSERT IF OUTPUT ENABLE 
DTACK.TRST = FC2*FC1*FCO*AS 
IPL2 := INT7 ;PRIORITY ENCODED BIT 
+ /INT7* INT6 
+ /INT7*/INT6* INTS 
+ /INT7*/INT6*/INT5S* INT4 
IPL1 := INT7 ;PRIORITY ENCODED BIT 
+ /INT7* INT6 
+ /INT7* /INT6*/INT5*/INT4* INT3 
+ /INT7*/INT6*/INTS*/INT4*/INT3* INT2 
IPLO := INT7 ;PRIORITY ENCODED BIT 
+ /INT7* INT6 
+ /INT7*/INT6* INTS*/INT4* INT3 
+ /INT7* /INT6* /INT5S*/INT4*/INT3*/INT2* INTL 


; SIMULATION NOT INCLUDED 
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TITLE INTERRUPT CONTROLLER2 
PATTERN INTC.DAT 

REVISION 01 

AUTHOR DANESH TAVANA 

COMPANY MONOLITHIC MEMORIES, INC. 
DATE 11/06/87 


CHIP INTC PAL20L10 

/AS A3 A2 Al FC2 FCl FCO NC NC NC NC GND 
NC /INTACK1 /INTACK2 /INTACK3 /INTACK4 /INTACKS /INTACK6 /INTACK7 DO 
D1 D2 VCC 

EQUATIONS 


/D2 = /A3 
/D2.TRST = FC2*FC1*FCO*AS 


/D1l = /A2 
/D1.TRST = FC2*FC1*FCO*AS 


/D0 = /Al 
/DO.TRST = FC2*FC1*FCO*AS 


INTACK7 = A3* A2* Al 
INTACK7.TRST = FC2*FC1*FCO*AS 


INTACK6 = A3* A2*/Al 
INTACK6.TRST = FC2*FC1*FCO*AS 


INTACKS = A3*/A2* Al 
INTACKS.TRST = FC2*FC1*FCO*AS 


INTACK4 = A3*/A2*/Al 
INTACK4.TRST = FC2*FC1*FCO*AS 


INTACK3 =/A3* A2* Al 
INTACK3.TRST = FC2*FC1*FCO*AS 


INTACK2 =/A3* A2*/A1 
INTACK2.TRST = FC2*FC1*FCO*AS 


INTACK1 =/A3*/A2* Al 
INTACK1.TRST = FC2*FC1*FCO*AS 


; SIMULATION NOT INCLUDED 
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Memories are an integral component of any computer system. 
PROMs (Programmable Read Only Memory), SRAMs (Static 
Random Access Memory), and DRAMs (Dynamic Random 
Access Memory) are used most often to store programs and data. 
PROMs are used to store programs which can only be read by the 
computer. The SRAMs and DRAMs are used for reading and 
writing data and programs. Figure 1 shows a system processor 
interfaced to PROMs, SRAMs and DRAMs. 


DRAMs are cheaper bulk memory, and are used instead of 
SRAMs in many systems. DRAMs are available in speeds 
ranging from 70 ns to 250 ns access times, enough to keep up 
with the new generation processor speeds. This performance is 
also sufficient to match the usually high speed SRAMs in many 
existing and newdesigns. DRAM densities have also progressed 
from 64K-bit in 1983 to 1M-bit in 1986, and 4M-bit and 16M-bit 
DRAMs are being developed. DRAMs are also available in space 
efficient SIP, SIMM and ZIP (Zig-zag Inline Package) packages 
with modules that can carry up to eight or nine DRAM devices. In 
this discussion we will focus on the design of PROM, SRAM and 
DRAM control circuitry using PLDs. 


CONTROL BUS 


i ADDRESS 
BUS 


PROCESSOR 





413 01 


ADDRESS 
DECODER 


DRAM 
CONTROLLER 


Large memory requirements result in the use of many DRAMs 
which cannot be conveniently located on the same board with the 
CPU. DRAM devices are placed on separate boards and con- 
nected to the processor board via a bus. Off-board, DRAM 
devices mandate the use of specialized outputs to drive the 
address and control traces of the DRAM array. In systems with 
many off-board DRAMs the soft error rate is high and sometimes 
calls for the use of error detection and correction circuits. A 
detailed discussion of EDC (Error Detection and Correction) 
circuit design with PLDs is also included here. 


The proximity of the DRAMs to the processing units depends on 
the hierarchy of the DRAM memoiy inthe system. This hierarchy 
depends heavily on the speed of the DRAM system versus the 
processor speed, and on the required addressing space. When 
the processor speed can be accommodated by the DRAM 
devices, it is efficient to place a DRAM subsystem as close to the 
processor as possible and to use it as the immediate storage 
space. When DRAMs are not fast enough for the processor, they 
must be used as remote, main-storage, while static-RAM-based 
caches are used in the processor’s immediate proximity. A 
review of such cache systems design using PLDs is also included 
here. 








Figure 1. Processor interface to PROMs, SRAMs and DRAMs 
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PROM and SRAM Control 


The control of static RAM or PROM devices is very simple. Both 
of these devices provide achip select input signal which activates 
the devices to read or write data. The chip select controlof these 
devices is enabled by the decoded processor address signals. 
The address bits selected depend upon the range of addresses 
covered by each SRAM or PROM. More detail on address 
decoders can be found on page 2-35. ; 


To optimize the number of productterms used, the address range 
is generally selected so that the boundary addresses are power- 
of-two numbers. From page 2-53, for an address decoder we 
have the following equations: 

For an n-bit address X in the range from B+1 to A-1 (A>X>B) 


A= 0010000-00 (a power-of-two number) 


np 1 
B= 0000011-11 (a power-of-two number minus one) 
nq 1 
CS = /Xn * /Xn-1 * - - — * /Xp * 
(Xp-1 + Xp-2 + - - - + Xq) 


Typical SRAMs also include a read/write input signal which 
selects between a read and.a write operation. This signal is 
usually directly connected to the processor read/write signal. 
PROMs are “read only”, and thus do not require this signal. 


PROM chip select signals should only be asserted for read 
cycles. Attempting a write toa PROM can cause bus contention 
problems where the memory and processor data signals attempt 
to drive the same bus lines simultaneously. This can be easily 
avoided by ensuring that the chip selects are not asserted fora 
processor write cycle. The PROM chip select signals are gated 
with the read signal (R) to insure achip select only when R is low. 


/CS_PROM = (X) * /R ;(X is a 
combination of address signals) 


Most of the “second generation” SRAMs have output enable (OE) 
and write enable (WE) pins in addition tothe chip select (CS) pins. 
A separate and independent OE line eliminates the data bus 
contention problem entirely. The OE line is disabled before 
writing to the SRAM, allowing the data pins to be driven by the 
external bus. The OE line is enabled only during the read cycles. 
Most microprocessors provide a data signal which can be directly 
connected to the OE lines, providing the proper timing to avoid 
bus contention. 


Ww 






zl 


PAL 
DEVICE 
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Figure 2. Controlling an SRAM Without OE 


Some memories so not provide a separate OE pin (Figure 2). 
Generating CS for these memories can cause serious bus 
contention problems. Often this problem is solved by gating the 
CS with read (R) or write (W) signals as follows: 


/CS = (X) * /R 
+ (X) * /W 


/WE = /W 


This approach allows usually sufficient time for other buffers on 
the data bus to be disabled. This works well for the read cycles 
in which the memory is expected to be the only bus driver. 
However, for the write cycles where the processor is usually the 
data bus driver, this does not quite solve the problem completely. 


In the above equation, the CS is generated either from the read 
or the write signal. Ifthe CS line is generated before the WE is 
asserted low, the memory will drive the data lines, which are 
already being driven by the processor, causing bus contention. 
There is still no way to ensure that CS is generated only after the 
WE signal to the memory is asserted. Some delay canbe added 
by controlling the CS line from the WE signal, by gating the W 
signal with the WE signal (for CS) (Figure 3) as follows: 


WE 


cs 
eae aa 413 03 


Figure 3. Waveform for SRAM Contro! Without OE 


/CS = (X) * /R + (X) * /W * /WE 


This ensures that the WE signal of the memory is low before the 
CS is asserted. This solves only part of the problem. Onthe other 
edge, WE should be removed only after CS is removed. This can 
be accomplished by controlling WE from the CS by gating CS 
along with the WE signal: 


/WE = /W * /CS 


This ensures the assertion of WE as long as the CS is asserted 
and W is low. As soon as the CS is unasserted, the WE of the 
memory will be unasserted. These signals CS and WE used for 
controlling each other are simply the feedback from the appropri- 
ate I/O pins. The feedback path provides the extra delay. 
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Access Time Considerations 


PROMs and SRAMs with data access time which match the 
processor speeds are commercially available. Sometimes, 
however, there is a speed mismatch.. Systems with different 
types of memories encounter this problem the most. For such 
systems, different memories generate their own data available 
(DAV) and write complete signals and inform the processor of 
cycle completion. This allows processors to prolong their cycles 
and add wait states, to match the memory speeds. These signals 
are generated by using PAL devices which can generate timing 
delays. Registered PAL devices are ideal candidates for such 
timers. The application note on page 2-184 illustrates one such 
memory design where a PAL device generates the memory 
control timing signals. This design also implements the memory- 
to-processor interface handshake logic. 


DRAM Systems 


In any system that uses dynamic RAMs, special circuitry in the 
address and control lines must be introduced to drive the memory 
chips and link them to the CPU (see Figure 4). Acircuit known as 
a DRAM controller performs several functions: 


+ Multiplexes the row and column addresses 
* Provides access timing 

* Provides refresh timing and addresses 

* Arbitrates between access and refresh 

* Drives the DRAMs’ capacitive load inputs 


DYNAMIC 
MEMORY 


ADDRESS 


DYNAMIC RAM 
INTERFACE 
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Figure 4. Functions of a DRAM Controller 


Multiplexing processor addresses into row and column ad- 
dresses is required because the DRAM architecture divides the 
addresses into rows and columns. This reduces the number of 
pins required for addressing the DRAMs and consequently the 
package size. The external DRAM controller multiplexes the 
addresses and also generates the RAS (Row Address Strobe) 
and CAS (Column Address Strobe) signals to the DRAMs to 
distinguish between these two addresses. Every DRAM access, 
for reading or writing data, requires a predetermined timing 
sequence of RAS, CAS and WE (Write Enable) signals. ADRAM 





controller’s main task is to generate these read and write cycle 
timings. In addition to the regular read and write cycles, many 
DRAMs are available which provide higher data bandwidth read 
and write cycle. Such nibble mode, ripple mode or page mode 
cycles require different sequences of RAS and CAS signals. 
Depending uponthe data requirements, these schemes are used 
for high speed data throughput in various systems. 


DRAMs also require intermittent refresh cycles to charge the 
capacitors for every row of data. This refresh is provided by a 
specific sequence of RAS and CAS signals. There are three 
different refresh cycles, one of which is selected according to the 
system requirements. These are: 


¢ Regular RAS only refresh cycle 
* CAS before RAS refresh cycle 
¢ Hidden refresh cycle. 


The regular RAS-only refresh ‘cycle requires an external refresh 
address generator which sequences through all the row ad- 
dresses of aDRAM. The CAS before RAS refresh cycle is only 
provided by certain DRAMs. These DRAMs are usually costlier, 
since they require internal circuitry for row address generation. 
The hidden refresh cycle DRAMs retain the last read/write cycle 
data at the outputs during the refresh cycles, which is helpful in 
certain designs. 


In addition to the type of refresh cycle, the frequency of refresh 
cycles also varies in different systems. Some systems provide 
intermittent refresh when refresh cycles are executed at fixed 
time intervals. The DRAM accesses are sometimes delayed 
pending an intermittent refresh cycle. With burst mode refresh, 
a number of refresh cycles are completed simultaneously, allow- 
ing the processor an uninterrupted access for read and write 
cycles later. Time critical systems usually find burst mode more 
useful. 


DRAM Controllers and PLDs 


Many DRAM systems make use of a VLSI DRAM controller. 
Most VLSI DRAM controllers perform the DRAM control func- 
tions which do not change greatly form one system to another. 
Excellent representatives of this approach are the 2964, 2968 
and the 673103 Dynamic RAM Controller/Drivers (Figure 5). 


Most VLSI DRAM controllers do not support all of the various 
types of read/write cycles, refresh cycles and refresh timings. 
Custom DRAM controllers are required for the following reasons: 


* Higher throughput bandwidth 

¢ Different memory access times 

* Different refresh cycles and mechanisms 
* Processor related requirements. 


Depending upon the specific system requirements, DRAM con- 
trollers can be made from PLDs which support the specific cycles 
required by a design. 


Fast registered PLDs are used as DRAM controllers in many 
designs. Functions of a DRAM controller are usually split into a 
number of PLDs. The application note on page 2-187 shows a 





at Monolithic ratty Memorles a | 





2-181 





Memory Control 








ROW 
Ro-R9 > ADDRESS 
INPUT LATCH 


ADS 


co-c9 


RC 
AUTO 


RFSH 


RASIN 


B 


ao 


B 


are 


CASINo 


CASIN 


WEIN 


OE 
413 05 





2-182 














ia COLUMN 
ADDRESS 


INPUT LATCH 








AUTO ACCESS 
TIMING & R/C 
GENERATOR 


CAS 
DRIVER Ss 


Figure 5. A VLSI DRAM Controller 
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DRAM controller for a 68020 microprocessor which implements 
a custom data sizing and alignment function required by the 
microprocessor. 


VLSIDRAM controllers use PAL devices for a variety of functions. 
These functions depend upon the capabilities of individual VLSI 
controllers. Three detailed application notes are included in the 
subsequent sections which illustrate the interface of these VLSI 
DRAM controllers to the various microprocessors, The applica- 
tion note on page 2-202 shows an AMD 8088 microprocessor 
interfaced to a 2968 DRAM controller using a PAL16R4. The ap- 
plication note on page 2-210 shows a 68000 microprocessor 
interface tothe 2968. The interface between the DRAMcontroller 
and the CPU accommodates the specific access protocols and 
speeds of the CPU as well as the access and refresh timing of the 
dynamic RAMs. This interface between the controller and the 
rast of the system is best implemented using fast programmable 
logic devices such as the PAL16R8D family devices. 


Error Detection and Correction 


Compared to PROMs and SRAMs, the physical dimensions, 
signallevels, and the stored charge of the DRAM cells are greatly 
reduced to allow higher density. As the stored charge in the 
DRAM cells decreases, the device is more susceptible to soft 
errors caused by alpha particles as well as by environmental 
noise. 


Soft errors are temporary, random, and they can be corrected by 
rewriting into the erroneous cell. Therefore, if the system has the 
ability to locate the bit in error, the soft error canbe corrected. One 
method to locate the bit in error is to append a fixed number of 
check-bits to the data word and store these check-bits along with 
data. This increases the data width of the DRAM system. Upon 
reading, both the data and check-bits are read into the EDAC. 
The check-bits are regenerated from the read data, and these 
newly generated check-bits are compared with the read check- 
bits. The comparison produces the syndrome bits. The syn- 
drome bits constitute abinary number that points to the bit in error. 


This encoding scheme of generating the check-bits and the 
syndrome bits is called a Hamming code, after its inventor. The 
modified Hamming code has one more check-bit than the original 
Hamming code, but it can detect all double-bit errors as well as 
detect and correct all single-bit errors. In addition it can detect a 
substantial number of multiple-bit errors. The modified Hamming 


code has proved particularly useful for its relatively low overhead 
in today’s wide-data-word systems and its capability to detect and 
correct single-bit errors and detect all double-bit errors. 


A number of XOR logic functions are required for generating 
Hamming codes. PAL devices which provide XOR gates are ideal 
candidates for such designs. Details of Hamming codes and a 
sample eight-bit error detection and correction application note 
which uses the PAL16X4 is included on page 2-229. 


Cache Memory Systems 


Fast access time SRAM caches are used for very high-speed 
processors. These allow high system performance while retain- 
ing the low cost of DRAM bulk memory. Based upon how the 
SRAM cache addresses map to the DRAM addresses, three 
major cache schemes are used. 


For fully associative caches, data in a DRAM location may be in 
any cache location. For direct mapped caches, data may be in 
one of the many locations determined by higher order address 
bits. For set associative caches, data may be in one particular 
location of every set. Depending upon the performance required 
and other such system constraints such as cost, board space etc. 
one ofthe three schemes is used. Details of cache system design 
are included in the application note on page 2-239. 


A major task in cache memory systems is replacement, which 
requires transferring data between the cache and the DRAMs. 
This is done in an effort to keep the cache filled with only those 
blocks of memory which are in use by a program. The replace- 
ment task requires a controller with intelligence, which can read 
cache data and write to the DRAMs and vice versa. This requires 
astate-machine-based controller. Since most cache designs are 
custom designs, due to various system constraints, limited VLSI 
solutions are available. State-machine-based controllers such 
as the Am29PL141 and the PMS14R21 can perform these 
functions very effectively. 


Other uses of PLDs in cache memory systems include general 
glue logic which is required for controlling SRAM caches or tag 
buffers. Such PLDs are usually very high-speed combinatorial 
PLDs. Monolithic Memories also provides a number of ECL 
PLDs which can be used in ECLcache memory systems. These 
ECL PAL devices allow 6 ns access times. 
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A typical contro! logic problem is the memory-to-processor 
handshake on memory transfer used in many computer ar 
chitectures. The processor makes a transfer request by activat- 
ing a request line (REQ) and specifies a read or write operation 
on a Read/Write line (R/W). 


During a read operation, the processor waits for a Data Avail- 
able signal at which time the data bus is sampled and the re- 
quest line lowered, thus completing the cycle. During a write 
operation, the processor places data on the bus and waits for a 
Write Complete signal after the write cycle is finished. Upon 
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Figure 1. State Diagram—Memory Handshake Logic 





write complete, the request line is lowered, hence completing 
the cycle. Figure 2 shows the state assignments and the appro- 
priate outputs. The state diagram is shown in Figure 1. Also the 
handshaking operation is illustrated in the timing diagram of 
Figure 3. 


The memory-board logic to implement this function may be de- 
signed with gates and edge-triggered flip-flops as shown in Fig- 
ure 4. This particular design would require about five SSI/MSI 
packages, but the same design can be implemented by a single 
PALI6RP6. The PAL device design specification using state 
equations is shown on page 2-186. 
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Figure 4. Memory Handshake Logic 
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PAL Device Design Specification 


Title Memory Handshake Logic 
Pattern Memoryl .pds 
Revision B 


Author Kelvin Chow/Nick Schmitz 
Company AMD/MMI 
Date 2/28/85 


CHIP Memory PAL16RP6 


CLK ADR1 ADR2 ADR3 ADR4 REQ RW INIT NC GND 
/OE Nc WC cl co WE DA DOUT NC vcc 


STATE MEALY_MACHINE 
DEFAULT_OUTPUT /DOUT /DA 


WAIT = /WE * /We * /C1 * /CO 
READL = /WE * /WC * /C1 * /CO 

READ2 = /WE * /WC * /Cl * /CO 

READ3 = /WE * /WC * /C1 * /CO 

WRITEL = WE * /WC * /Cl * /CO 

COUNT1 = WE * /we * /Cl * CO 

COUNT2 = WE * /WC * Cl * /CO 

COUNT3 = WE * /WC * Cl * CO . 

WRITE2 = WE * We * /Cl * /CO Logic Symbol 
WRITE3 = /WE * WC * /Cl * /CO 


WAIT := C_2 -> WRITE1 + C_l -> READ1 +-> WAIT 
READL := C_| "5 -> WAIT +-> READ2 

READ2 i= C 5 -> WAIT +-> READ3 

READS) :3= C_ 4 <> WAIT + C_5 -> WAIT +-> READ3 
WRITE] s= C_ 5 => WAIT +-> COUNT1 

COUNT1 := C_ "5 -> WAIT +-> COUNT2 

COUNT2 := C | 5 -> WAIT +-> COUNT3 

COUNT3 3= C_| "5 => WAIT +-> WRITE2 

WRITE2 := Ci 5 -> WAIT +-> WRITE3 

WRITES := c: 1 -> WRITE3 + C_4 ~> WAIT + C_5 -> WAIT 
READlL.outf£ := DOUT * /DA -2 

READ2,outf i= DOUT * DA 33 


CONDITIONS 


C_1 = REQ * RW * ADR1 * ADR2 * ADR3 * ADR4 * /INIT 

= REQ * /RW * ADR1 * ADR2 * ADR3 * ADR4 * /INIT 

(/REQ + /ADR1 + /ADR2 + /ADR3 + /ADR4) * /INIT 

(/REQ + /ADR1 + /ADR2 + /ADR3 + /ADR4 + /RW) * /INIT 
= INIT 


aaa,a 


2 
3 
4 
5 


SIMULATION 
TRACE_ON CLK REQ RW DOUT DA WE WC Cl CO 





SETF INIT /REQ OE RW ADR1 ADR2 ADR3 ADR4 


CLOCKF CLK CLOCKF CLK 
CLOCKF CLK CHECK wait /DOUT 
SETF REQ /INIT 

CLOCKF CLK CHECK readl 
CLOCKF CLK CHECK read2 
CLOCKF CLK CHECK read3 
CLOCKF CLK CHECK read3 

SETF INIT 

CLOCKF CLK CHECK wait 
CLOCKF CLK CHECK wait 


CHECK /DOUT /DA 


SETF /INIT REQ /RW 


CLOCKF CLK CHECK writel 
CLOCKF CLK CHECK countl 
CLOCKF CLK CHECK count2 
CLOCKF CLK CHECK count3 
CLOCKF CLK CHECK write2 
CLOCKF CLK CHECK write3 
TRACE_OFF 
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Ever increasingly dense DRAMs and their interface to the emerg- 
ing sophisticated 32-bit microprocessors provide an increasing 
number of system options to the designer. In order to utilize these 
advanced DRAM features and to satisfy the individual system de- 
sign requirements, custom DRAM controllers are required. PAL 
device sequencers, such as the PAL23S8, provide effective con- 
trol sequencing power, flexibility, customizability and ease of use. 
They offer a very desirable alternative to the dedicated function- 
ality of VLSI DRAM controllers. 


The following megabit DRAM controller design based on the 
PAL23S8 sequencer is illustrative of a design which is easily 
customized to a system requirement. !t supports features such as 
data bus sizing and data alignment which are required for the 
emerging 32-bit microprocessors, but which are notyet easily and 
adequately supported by the VLSI controllers. 


A DRAM controller needs to support at least the following func- 
tions: 


¢ Flexible Refresh Generation 

¢ Flexible Arbitration 

« Processor Handshake 

* Processor Cycle Timing and Control 
¢ Refresh Cycle Timing and Control 


Flexible Refresh Generation 
Mechanisms 


For refresh generation, different system designs may require 
either burst mode, intermittent mode or both kinds of refresh. In 
burst mode, refresh requests occur once very 4 milliseconds to 
meet the dynamic RAM’s speed. For a megabit RAM, which 
needs 512 cycles for a complete refresh, all 512 cycles will be 
performed consecutively. The major disadvantage of the Burst 
refresh approach is that memory is “out of service” for along time. 
The intermittent refresh scheme minimizes the out of service time 
by stealing single cycles between processor cycles. This cycle 
stealing is performed by a pre-defined arbitration mechanism for 
prioritization of different cycle requests. This design incorporates 
the arbitration in a manner which allows both intermittent and 
burst mode refreshes. 


Flexible Arbitration Scheme 


The arbitration in most designs involves prioritizing memory 
accesses for: read cycles, write cycles, read-modify-write cycles, 
and refresh cycles. 


The DRAM controller must allow both asynchronous memory and 
refresh cycle requests. It should decide whether a memory cycle 
is anaccess cycle or arefresh cycle. When refresh and processor 
accesses are requested simultaneously, the refresh is given 
priority. If a refresh cycle is in progress and a memory cycle 


request is made, then the memory access is inhibited until the 
refresh is complete. The memory request in this case is kept 
pending by asserting a processor wait signal. 


For a typical intermittent refresh scheme, the cycle requests are 
derived from the oscillator, which operates asynchronously with 
respect to the system clock. The arbiter grants a refresh request 
when no other memory cycle is either occurring or pending. Ifa 
memory cycle is in progress, the arbiter inhibits the refresh cycle 
until the current cycle is completed. However, it needs to 
remember that a refresh request was made and grant that request 
on completion of the processor cycle. The PAL23S8's buried 
registers are very useful in performing such arbitration functions, 
even between asynchronous events. These registers can also be 
used for remembering refresh cycle requests and initiating ac- 
tions later for execution of the refresh cycles. 


However, some designs may require additional functions, such 
as prioritizing different processor requests. Some may require a 
specific kind of handshake mechanism between the processor 
and the DRAM controller. All such functions can be performed in 
the PAL sequencer by using more of chip resources such as I/O 
pins, buried registers and productterms. Onthe otherhandsome 
systems may not require functions such as read-modify-write 
cycles, in which case the PAL sequencer resources can be saved 
and used for other system functions. 


Flexible Handshake Protocol 


The PAL23S8 sequencer can provide a customizable interface to 
the system processor without requiring additional parts. The logic 
required for interface protocols for the APX family or the 68000 
family microprocessors can be very easily built using the internal 
resources of the PAL23S8. The design example below is for 
68020 processor interface signals. 


Processor Cycle Execution 


Different DRAMs can have different processor cycles. Some just 
support the basic read/write cycles, whereas others support 
extended functions such as page mode, ripple mode or static 
column mode cycles. Some DRAMs also support additional 
functions for extended applications such as the transfer cycle in 
VRAMs (DRAMs optimized for video applications). PAL devices 
allow user-definable functionality to be implemented in the de- 
sign, which optimizes resource utilization and improves system 
performance. Another advantage offered by such designs is the 
customizability of the design to the DRAM timing requirements, 
which can vary between -12, -15, and -20 parts significantly. All 
these timing functions can be easily implemented and modified 
using buried state registers. 


Our design implements the basic read, write, and read-modify- 
write cycles required in most designs. The timing required has 
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been calculated for a -12 memory part. The processor cycle 


execution involves the following functions: 


¢ Address multiplexing 
* Timing and control signal generation 


Am29841 ten-bit buffers are used for address multiplexing. The 
multiplexing of the row or column address is under the control of 
the DRAM controller. 


Generation of the timing and control signals RAS, CAS, WE and 
DACK is also the core function of the controller. The controller 
multiplexes the address and produces RAS, CAS, WE in a 
sequence that is understandable by the DRAMs. It also gener- 
ates a handshake signal “DACK’ for the CPU, to indicate whether 
or not the memory is ready to be accessed. 


Refresh Cycle Execution 


DRAMs support different types of refresh cycles. Some DRAMs 
support CAS before RAS refresh. These DRAMs have the 
refresh address counter embedded along with the memory, thus 
eliminating the need for an external counter. Some designs may 
require “hidden refresh”, or the standard RAS only refresh. 
Depending upon the system requirements, the designer can 
optimize the design and resource utilization. For this example, 
the PAL device has been designed to execute a RAS only refresh. 
However, it can be easily modified to support CAS-before-RAS 
refresh, or hidden refresh also, if needed. 


Megabit DRAM Controller with Data 
Alignment and Dynamic Bus Sizing 


In many cases the currently available DRAM controllers do not 
support the esoteric requirements of different system designs. 
The designer is left to make a custom DRAM controller. The 
PAL23S8, with its embedded extra buried state registers, pro- 
vides an excellent tool for designing such a flexible, fast DRAM 
controller. It also offers the right speed for interfacing to the 
emerging 8 and 16 MHz processors, allowing superior synchro- 
nous designs with few or no wait states. 


Some of the emerging 32-bit microprocessors, such as the 68020 
and the 80386, support the concept of dynamic bus sizing and 
mis-aligned data transfers. Presently, there is no DRAM control- 
ler which utilizes the advantages offered by these two features 
effectively. 


Dynamic Data Bus Sizing 


Dynamic bus sizing allows the processor to handle variable width 
data buses (8, 16 or 32 bits) on acycle-by-cycle basis. Dynamic 
bus sizing makes the size of the R/W port or channel transparent 
to the software designers. 


The 68020 allows operand transfers to or from 8, 16, and 32-bit 
ports by dynamically determining the port size during each bus 
cycle. Similarly, 80386 allows operand transfers to or from 16 and 
32-bit ports by determining the port size during each bus cycle. 
During an operand transfer, the slave device signals its port size 
and transfer status (complete or not) to the processor using the 


DSACK inputs for the 68020, and the Ready & BS16 signals for 
the 80386. 


DSACK1 | DSACKO 








insert Wait States 
Complete Cycle 

Data bus port size 8 bits 
Complete Cycle 

Data bus port size 16 bits 
Complete Cycle 

Data bus port size 32 bits 








The design implemented shows a 68020 interface, but can be 
easily modified to accommodate the 80386. The 68020 always 
attempts to transfer the maximum possible amount of data (32 
bits) ina single cycle. Ifthe port responds with a size smaller than 
32 bits, the processor repeats the cycles with the remaining bytes 
ofdata. For eachcycle, the 68020 informs the slave of the number 
of bytes it is attempting to transfer by two signals SIZO, SIZ1, as 
follows: 





Long Word 





Due to various considerations, memories with different data width 
can exist on the same system. For example, itis often preferable 
to have smaller external bus sizes, while retaining a 32-bit internal 
memory size on a board. Such instances are common when 
using DMA in microprocessor based systems. Varied memory 
widths are also required in graphics and image processing 
systems. PAL devices allow customizable dynamic memory data 
bus sizing based on various system requirements—such as 
address range (address signals) or jumper selectable static 
control for increasing memory width. 


Mis-Aligned Transfers 


Dynamic sizing is used in conjunction with address bitsAO andA1 
to also support mis-aligned data transfers; for example, a word 
write to along word data bus size with an address offset of 1 byte 
(A0/A1=01). The 68020 and the 80386 contain internal bus 
multiplexers to route the data to the correct data lines. Both of 
these also generate extra cycles necessary to complete a mis- 
aligned transfer. However, external circuitry is still needed to 
generate the correct write enables and chip selects to support 
both these functions. PAL devices, however, can be customized 
to support both these functions easily. 





OFFSET 
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By decoding AO and Aj, the two transfer data size signals SIZEO, The DRAM Controller 
SIZE1, and the size of the port being addressed, PAL devices can 


easily support dynamic bus sizing and generate the appropriate The DRAM controller consists of two PAL23S8s: the first is called 
chip select signals for the four banks of memory. the Timing and Arbitration PAL device, and the second device is 


er ; called the Data Sizing and Alignment PAL device (Figure 2). 
The memory organization selected (Figure 1) allows access to 


individual data bytes under the controlofthe CAS signal. The four 
banks of memory are addressed by the four RAS signals. 


CASO CAS1 —_-CAS2 CAS3. EACH BANK OF 
1 MX32 BITS 





20 BIT ROW AND 
COL ADDRESS TO 
ALL FOUR BANKS 





D31-D24 D23-D16 D15-D8 D7-D0 





441 02 
Figure 1. Memory Organization 
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Figure 2. A Megabit DRAM Controller 
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The Timing and Arbitration Controller 


The timing and arbitration PAL device performs the following 
functions: 


* Arbitration between current cycle, Refresh cycle & CPU cycle 
* Read and Write Cycle Execution 

¢ Read-Modify-Write Cycle Execution 

¢ Refresh Cycle Execution 

* Interface Signaling 


The timing and arbitration PAL device arbitrates between the 
processor request MREQ signal and the refresh request RFCLK, 
giving the highest priority first to the current cycle, next to the 
refresh request and finally to the processor cycle. This allows 
support for both intermittent refresh and burst mode refresh. it 
also uses a mechanism to store the refresh requests when the 
processor cycle is in progress for servicing later. 


This device also generates the appropriate ROWADD and 
COLADD signals to multiplex the row and column addresses onto 
the memory address bus respectively. During refresh cycles it 
asserts the RFSH signal enabling the refresh row address onto 
the DRAMs. 


This PAL device executes the read and write cycles as well as 
read-modify-write cycles. The selection between these two is 
under the control of the processor signal RMC. The timing of the 
different cycles is under the control of internal registers. 


The timing requirements of the different cycles are shown in 
Figure 3. The operation of the state machine to perform all these 
functions is also shown in Figure 4. This state diagram has been 
derived directly from the timing diagrams and the arbitration 
requirements of the design. This state machine is implemented 
by five of the six available buried state registers on the device, and 
is used mainly for timing and arbitration functions. The sixth 
buried state register implements a flag function and is repre- 
sented as a small independent state machine, as described 
below. The state description (operation) has been written in 
CUPL software which generates the JEDEC file directly for 
programming the device. 


Arbitration Between Current Cycles, Refresh Cycle 
& CPU Cycle 


The request for a refresh is received by signal line RFCLK. The 
CPU requests a cycle by asserting MREQ line. If a cycle is in 
progress as indicated by states other than state 0, no arbitration 
for afresh cycle is performed. In such acase, the PAL device just 
continues to execute the current cycle. This effectively implies 
that the current cycle is given the most priority and no other cycle 
is initiated until its completion. 


When the current cycle is complete (indicated by state 0) the PAL 
device arbitrates between the MREQ signal and the buried 
register B5 flag. This buried register flag is used to latch the 
request for refresh (RFCLK) which is usually only one clock cycle 
duration. The operation of this flag is controlled by asecond small 
state machine (two states only) inthe same device. When RFCLK 
is high this state machine toggles to state 1 (B5=1) and stays in 
that state till the refresh cycle is executed (indicated by RFSH 
signal low), and then it toggles back to state 0 (B5=0) preparing 
itself for the next refresh request RFCLK. 


For the first state machine, the arbitration begins on state 0. If BS 
is high, a refresh cycle is given priority and the state machine 
jumps to state 16 to execute a refresh cycle. If onthe other hand 
there is no refresh request pending (indicated by the B5 flag being 
equal to 0) and MREQ is asserted, a processor cycle is started by 
jumping to state 1. If none of these conditions exists the state 
machine stays in state 0, polling for any of these events to occur. 
Such ascheme can support both burst mode and the intermittent 
refresh. 


Processor Read Write Cycle Execution 


When the processor cycle wins the arbitration, indicated by state 
1, the timing and control signals required for the processor cycle 
are initiated. 


There are four memory banks in the design, each with a capacity 
of 4 Mbytes of memory organized as 1M X 32 bits (Figure 1). The 
Timing & Arbitration PAL device generates the 4 RASO-RAS3 
signals, one for each memory bank. The selection of one of these 
four RAS signals depends upon the two input signals ADSELO 
and ADSEL1. In most systems these would constitute the high 
bits of the system processor address bus. 


After assertion of the appropriate RAS signal the state machine 
jumps to state 2 where it manipulates its address handling 
signals. In state 2 it removes the ROWADD signal and asserts the 
COLADD signal to allow assertion of the column address to the 
memory. At the same time it asserts another signal, SIG, which 
is used to inform the Data Sizing and Alignment PAL device of a 
processor cycle request, and allows the Data Sizing and 
Alignment PAL device to generate the appropriate CAS signal to 
the memory. 


The state machine then sequences to state 4, allowing for the 
data access time. In state 4 it decides between a read write cycle 
or aread-modify-write cycle, based on the processor signal RMC; 
it jumps either to state 5 for a read or write cycle or to state 8 (for 
read-modify-write cycle. For read or write cycle it then removes 
the RAS signals and waits for another clock period for the RAS 
precharge time before completing the cycle. Simultaneously it 
also removes COLADD signal and asserts ROWADD signal 
preparing the memory for the next processor cycle. The SIG 
signal to the data sizing and alignment PAL device is also 
removed. The DSACK signal (acknowledge to the processor) 
and the WE signals (for a write cycle) are generated if and when 
required by the Data Sizing and Alignment PAL device. 


Processor Read-Modify-Write Cycle Execution 


When the processor requests a read-modify-write cycle by as- 
serting the RMC signal, the Timing and Arbitration PAL device 
automatically increases the length of the cycle. Itkeeps the RAS 
asserted for the appropriate duration of a read-modify-write 
cycle. Atthe sametime the Data Sizing and Alignment PAL device 
generates the appropriate WE and DSACK signals. After this 
extended RAS assertion the Timing and Arbitration PAL device 
completes the cycle, similar to a read or write cycle. 





Refresh Cycle Execution 


When the arbitration is won by the refresh cycle, the timing and 
arbitration control PAL device executes a RAS-only refresh cycle. 
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Figure 3. The Timing Diagram 
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Figure 4. State Diagram for Timing and Arbitration PAL Device 
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On state 16 it removes the ROWADD signal and asserts the 
RFSH signal which applies the refresh row address to the 
memory address bus. In state 17 it generates all four 
RASO-RASS signals, refreshing all the banks of the memory. It 
then counts for a few states (depending upon the refresh timing 
requirements of the memory) before removing the RAS and 
RFSH signals. On removal of the RFSH signal the external 10- 
bit refresh row address counter (PAL22V10) is also incremented 
for the next row. 


Interface Signaling 


One additional function performed by this PAL device is to inform 
the Data Sizing and Alignment PAL device of the execution timing 
reference of a processor cycle. This it does by asserting /SIG 
signal low for all processor cycles. This signal is basically used 
as a synchronizing signal between these two PAL devices doing 
two independent functions. 


Data Sizing and Alignment PAL Device 


The Data Sizing and Alignment PAL device performs the following 
functions: 


1. CAS generation for dynamic sizing & alignment. 
2. DSACK generation for dynamic sizing & alignment. 
3. WE generation for write and read-modify-write cycles. 


READ-MODIFY-WRITE 
CYCLE 


Its state diagram is shown in Figure 5. This state machine is 
synchronized to the operation of the state machine of the Timing 
and Arbitration PAL device by using signal SIG. This PAL device 
has no function for arefreshcycle. Foraprocessorcycle it asserts 
the appropriate CASO-CAS3 and WE signals and generates the 
cycle complete signals DSACKO-DSACK1. The state machine 
functions are implemented in the four buried state registers on 
this device. 


CAS Generation 


The state machine starts at the initial polling state 0, and remains 
there untilit receives the SIG signal from the timing and arbitration 
PAL device indicating the start of a processor cycle. The maxi- 
mum allowable data width for the 68020 processor is 32 bits. The 
data bus is split into four byte wide segments. The CAS signals 
for these four byte segments are independently controlled, allow- 
ing independent byte-wide read and write capability. These four 
CAS signals allow read and write for low-low, low-middle, high- 
middle and high-high bytes of a 32-bit data port. For a 16-bit data 
port size, CAS 2-3 provide the low and high byte strobes, 
respectively. For an 8-bit port, only CASS is used. 


In state 1 the CAS signals are asserted. The CAS signals 
asserted depend upon the PORT size available, the size of data 
being transferred. andthe alignment of the transfer. As explained 
before, the attempted transfer data size is encoded on two SIZO, 


COUNT = 1 


READ AND 
WRITE CYCLES 
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SIZ1 bits by the processor, and the alignment is indicated by the 
least significant address bits AO, A1 of the processor. 


The port size is very application dependent. Different systems 
might require either static or dynamic selection of this port size. 
For example, the designer can define different address ranges 
where the port size may be different. Alternatively, jumpers can 
also define the port size. A general-purpose means for this has 
been selected, in the form of two input pins PRTO & PRT1, which 
define the size of the port. 


Port Size 


The size of the data bus is decided based on the input pins PRT1, 


PRTO. 
pare | a [see | 


8 bits 

16 bits 
16 bits 
32 bits 








For 32-bit ports, all four CAS signals need to be asserted. Fora 
16-bit port only two CAS signals need to be asserted. For an 
eight-bit port only CAS3 is asserted. As mentioned before, the 
processor always attempts to transfer the maximum possible 
amount of data. Its SIZO-1 signals encode the amount of data 
being transferred. The table in Figure 6 illustrates the relationship 
of the data size, port size and alignment which is decided by 
address bits A1, AO. 


TRANSFER 
SIZE 











The data from this table is directly used in the state description 
syntax to generate the CAS signals for state 1. The software 
compiles the Boolean equations for this logic directly. 


Once asserted, the CAS signals are held by the state machine 
until the appropriate read, write, or read-modify-write cycles are 
complete. 


DSACK Acknowledge to CPU 


The amount of data receivable by any device is dependent upon 
its port size. Based on the port size (PRTO-1) the encodings for 
DSACK0-1 are generated to acknowledge the CPU and inform it 
ofthe port size. The timing of acknowledge is synchronized by the 
state machine. For read and write cycles, the DSACK signals are 
generated in state 1 and removed on cycle completion in state 4. 


For a read-modify-write cycle, a complication exists. The read- 
modify-write cycle requires two DSACK signals; one for read and 
one for a write. The first DSACK is generated in state 1 and 
removed in either state 4 or state 5. The second is generated in 
state 8 and removed in state 11. 


WE Generation 
In state 1 the WE is generated for normal write cycles. For read- 


modify-write cycles, a delayed WE signal is generated based 
upon the internal four-bit state counter in state 8 (Figure 3). 


DATA BUS ACTIVE SECTIONS 
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Figure 6. Data Bus Activity for Byte, Word and Long Word Ports 


ot Monolithic ra Memorles Pm | 





2-193 





PG6h-2 


9 SO}JOWOIN TTY) OFDHOUOWN "a 


NAME MEGABIT DRAM TIMING & ARBITRATION CONTROLLER; 
PARTNO AmPAL 23S8 ; 

DATE 03/24/86 ; 

REV 01; 


DESIGNER KAPIL SHANKAR ; 
COMPANY ADVANCED MICRO DEVICES ; 
ASSEMBLY NONE ; 

LOCATION SUNNYVALE, CA ; 

DEVICE P2388 ; 


[BERERERERRERERARRREREREREREERRRRERRERREREREEREEREREREREREREREREREE | 


/* */ 
/* */ 
/* */ 
[PRRRRERIRERERRE RAEI KIRA R ERIKS RIAD IIA IAAI IAI I III TIA IAIAII IIIS | 
/* Allowable Target Device Types: THE ONLY ONE */ 


[RERERRRERERRREIEREIARIRRERIIE IRE IRIE IIIA IAI IAI I III III IIIA III IIIA 


/** Inputs **/ 


PIN 2 = MREQ ; 
PIN 3 = ADSELO ; 
PIN 4 = ADSEL1 ; 
PIN 5 = RMC; 
PIN 6 = RFCLK ; 
PIN 7 = INIT; 


/** Outputs **/ 


PIN 19 = ROWADD ; 
PIN 18 = IRASO ; 
PIN 17 = ICOLADD ; 
PIN 16 = §RAS1 ; 
PIN 15 = §RAS2 ; 
PIN 14 = IRFSH ; 
PIN 13 = IRAS3 ; 
PIN 12 = ISIG; 


NOOE [80,81,82,B3,B4,B5] 


/** Declarations and Intermediate Variable Definitions **/ 


FIELD STMA = 
FIELD STMB = 
FIELD BANK = 


BANKO 
BANK1 
BANK2 
BANK3 


SDEFINE 
SDEFINE 
SDEFINE 
SDEFINE 
SOEFINE 
SDEFINE 
SOEFINE 
SDEFINE 
SDEFINE 
SOEFINE 
SOEFINE 
SDEFINE 
SOEFINE 
SDEFINE 
SDEFINE 
SOEFINE 
SDEFINE 
SDEFINE 
SOEFINE 
SOEFINE 
SDEFINE 
SOEFINE 
SDEFINE 


BANK 
BANK 
BANK 
BANK 


SAO 
SAI 
SA2 
SA3 
SA4 
SA5 
SA6 
SAT 
SA8 
SA9 
SA10 
SA11 
SA12 
SA13 
SA14 
SA15 
SA16 
SA17 
SA18 
SA19 
SA20 
SA21 
SA22 


[80,81,82,B3,B4) 


[B5] ; 


[ADSEL1, ADSELO) 


WN = oO 
we Se me te 


'b'00000 
'b'00001 
'b'00010 
"b'00011 
'b'00100 
'b'00101 
'b'00110 
'b'00111 
'b'01000 
'b'01001 
'b'01010 
'b'01011 
'b'01100 
'b'01101 
'b'01110 
'b'01111 
'b' 10000 
'b' 10001 
'b' 10010 
'b'10011 
'b' 10100 
'b'10101 
'b'10110 


Figure 7. Source Code for Timing and Controller PAL Device 
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/** STATE MACHINE A **/ 
/** STATE MACHINE B **/ 
/** MEMORY BANK ADDRESSED **/ 


/** STATE 
/** STATE 


[** 


STATE 


/** STATE 


/** 
/** 


/** 


STATE 
STATE 
STATE 


/** STATE 


[** 

/** 
/** 
/** 
/** 
[** 
[** 
[** 
{** 
/** 
[** 
/** 
/** 
[** 


[** 


STATE 
STATE 


STATE 
STATE 
STATE 
STATE 
STATE 
STATE 
STATE 
STATE 
STATE 
STATE 
STATE 
STATE 
STATE 


0 wey 

1 wey] 

2 baad A 

3 wR, 

4 ak] 

5 ae] 

6 wey] 

7 ae] 

8 ney 

9 ney 
10 **/ 
11 **/ 
12 **/ 
13 **/ 
14 **/ 
15 baled A 
16 **/ 
17 **/ 
18 baked J 
19 Label A 
20 **/ 
21 ae] 
22 aR] 
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PRESENT SA2 IF BANKO NEXT SA3 OUT RASO 


’ 

SDEFINE $SBO 'b'0O /** STATE 0 **/ IF BANK1 NEXT SA3 OUT RAS1 ; 
SDEFINE SB1 ‘b*'1 /** STATE 1 **/ IF BANK2 NEXT SA3 OUT RAS2 ; 
IF BANK3 NEXT SA3 OUT RAS3 ; 

/** Logic Equations **/ NEXT SA3 OUT ROWADD 
OUT COLADD 

/** THE FIRST STATE MACHINE **/ OUT SIG ; 


/** MEMORY ACCESS TIME DELAY **/ 
SEQUENCE STMA { 


PRESENT SA3_ IF BANKO NEXT SA4 OUT RASO ; 
/** POLLING FOR PROCESSOR OR REFRESH CYCLES **/ IF BANK1 NEXT SA4 OUT RAS! ; 
IF BANK2 NEXT SA4 OUT RAS2 ; 
PRESENT SAO IF !B5 & !MREQ & BANKO NEXT SA1_ OUT RASO ; IF BANK3 NEXT SA4 OUT RASS ; 
IF 'B5 & !MREQ & BANK1 NEXT SA1 OUT RAS1 ; NEXT SA4 OUT ROWADD 
IF !B5 & !MREQ & BANK2 NEXT SA1 OUT RAS2 ; OUT COLADD 
IF !B5 & !MREQ & BANK3 NEXT SA1 OUT RAS3 ; OUT SIG ; 
/** PROCESSOR CYCLE **/ 
/** JUMP TO STATE SA1 **/ 
/** ASSERT THE RAS FOR THE MEMORY BANK ADDRESSED **/ ‘PRESENT SA4_ IF BANKO & RMC NEXT SAS OUT !RASO ; 
IF BANK1 & RMC NEXT SAS OUT !RAS1 ; 
IF B5 NEXT SA16 OUT ROWADD ; IF BANK2 & RMC NEXT SAS OUT !RAS2 ; 
OUT RFSH ; IF BANKS & RMC NEXT SA5 OUT IRAS3 ; 
/** REFRESH CYCLE **/ IF RMC NEXT SA5 OUT !ROWADD 
/** JUMP TO STATE SA16 **/ OUT !COLADD 
/** REMOVE ROW ADDRESS AND SET UP REFRESH ADDRESS bated J OUT SIG ; 


/** DISTINGUISH BETWEEN READ WRITE OR READ MODIFY WRITE CYCLES **/ 


DEFAULT NEXT SAO ; /** IF RMC HIGH EXECUTE AS READ WRITE CYCLE **/ 
/** ELSE POLL AGAIN **/ /** JUMP TO STATE SAS **/ 
/** STAY-IN STATE SAO **/ /** ALSO REMOVE COLUMN ADDRESS AND APPLY ROW ADDRESS **/ 
/** ALSO REMOVE THE RAS SIGNAL **/ 
/** THIS IS THE PROCESSOR CYCLE **/ IF BANKO & !RMC NEXT SA8 OUT RASO ; 
IF BANK? & !RMC NEXT SA& OUT RAS1 ; 
PRESENT SA1_ IF. BANKO NEXT SA2 OUT RASO ; IF BANK2 & !RMC NEXT SA8 OUT RAS2 ; 
IF BANK1 NEXT SA2 OUT RAS1 ; IF BANKS & !RMC NEXT SAS OUT RAS3 ; 
IF BANK2 NEXT SA2 OUT RAS2 ; IF !RMC NEXT SA8 OUT !ROWADD 
IF BANK3 NEXT SA2 OUT RAS3 ; OUT ICOLADD 
NEXT SA2 OUT ROWADD OUT SIG ; 
OUT COLADD /** EF RMC LOW EXECUTE READ MODIFY WRITE CYCLE **/ 
OUT SIG ; /** JUMP TO STATE SA8 **/ 
/** REMOVE ROW ADDRESS AND SET UP COLUMN ADDRESS **/ /** ALSO REMOVE COLUMN ADDRESS AND ASSERT ROW ADDRESS **/ 


/** ASSERT SIG TO INFORM DATA SIZING & ALIGNMENT PAL OF PROCESSOR CYCLE **/ 


Figure 7. Source Code for Timing and Controller PAL Device (Cont’d.) 
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PRESENT SAS 
/** RAS PRECHARGE TIME DELAY **/ 


PRESENT SA6 


PRESENT SA7 


NEXT SA6 


NEXT SA7 


NEXT SAO 


/** PROCESSOR READ WRITE CYCLE COMPLETE **/ 


/** THIS IS THE READ MODIFY WRITE CYCLE **/ 


PRESENT SA8 IF BANKO 
IF BANK1 
IF BANK2 
IF BANKS 


NEXT SA9 
NEXT SA9 
NEXT SA9 
NEXT SA9 
NEXT SA9 


/** ACCESS TIME DELAY FOR BOTH READ AND WRITE **/ 


PRESENT SA9 IF BANKO 
IF BANK1 
IF BANK2 
IF BANKS 


PRESENT SA10 IF BANKO 
IF BANK1 
IF BANK2 
IF BANK3 


PRESENT SA11 IF BANKO 
IF BANK1 
IF BANK2 
IF BANK3 


PRESENT SA12 IF BANKO 
TF BANK1 
IF BANK2 
IF BANK3 


NEXT SA10 
NEXT SA10 
NEXT SA10 
NEXT SA10 
NEXT SA10 


NEXT SA11 
NEXT SA11 
NEXT SA11 
NEXT SA11 
NEXT SA11 


NEXT SA12 
NEXT SA12 
NEXT SA12 
NEXT SA12 
NEXT SA12 


NEXT SA13 
NEXT SA13 


OUT !SIG ; 


me 


OUT 
OUT 
OUT 
OUT 
OUT 


OUT 
OUT 
OUT 
OUT 
OUT 


OUT 
OUT 
OUT 
OUT 
OUT 


OUT 
OUT 
OUT 
OUT 
OUT 


OUT 
OUT 


NEXT SA13 OUT 


NEXT SA13 
NEXT SA13 


OuT 
QUT 


RASO 
RAS1 
RAS2 
RAS3 
SIG 


RASO 
RAS1 
RAS2 
RAS3 
SIG 


RASO 
RAS1 
RAS2 
RAS3 
SIG 


RASO 
RAS1 
RAS2 
RAS3 
SIG 


RASO 
RAS1 
RAS2 
RAS3 
SIG 


me Me 8s te 


es Se Me me te me he me Me te me Me Me te me 
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/** THIS IS THE READ WRITE CYCLE **/ 


PRESENT SA13 IF BANKO NEXT SA14 OUT 
IF BANK1 NEXT SA14 OUT 
IF BANK2 NEXT SA14 OUT 
IF BANKS NEXT SA14 OUT 


NEXT SA14 OUT 
/** REMOVE THE RAS SIGNAL **/ 


PRESENT SA14 NEXT SA15 OUT 
/** RAS PRECHARGE TIME DELAY **/ 


PRESENT SA15 NEXT SAO ; 


/** PROCESSOR READ MODIFY WRITE CYCLE COMPLETE **/ 


/** THIS IS THE MEMORY REFRESH CYCLE **/ 


PRESENT SA16 NEXT SA17 OUT 
OUT 
OUT 
OUT 
OUT 
OUT 
/** ASSERT ALL FOUR RAS SIGNALS **/ 


PRESENT SA17 NEXT SA18 OUT 
OUT 
OUT 
OUT 
OUT 
OUT 
/** REFRESH TIME DELAY **/ 


PRESENT SA18 NEXT SA19 OUT 
OUT 
OUT 
OUT 
OUT 
OUT 


Figure 7. Source Code for Timing and Controller PAL Device (Cont’d.) 


!RASO 
!RAS1 
!RAS2 
!RAS3 
SIG 


ISIG ; 


RASO 
RAS1 
RAS2 
RAS3 
ROWADD 
RFSH ; 


RASO 
RAS1 
RAS2 
RAS3 
ROWADD 
RFSH ; 


RASO 
RAS1 
RAS2 
RAS3 
ROWADD 
QFSH > 
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PRESENT SA19 NEXT SA20 OUT 
OUT 
OUT 
OUT 
OUT 
OUT 


/** REMOVE ALL RAS SIGNALS **/ 
/** ALSO REMOVE REFRESH ADDRESS AND ASSERT ROW ADDRESS **/ 


PRESENT SA20 NEXT SA21 ; 
/** RAS PRECHARGE TIME DELAY **/ 


PRESENT SA21 NEXT SA22 ; 
PRESENT SA22 NEXT SAO ; 


/** TERMINATE MEMORY REFRESH CYCLE **/ 
/** GO BACK TO POLLING STATE **/ 

) 

/** THE FIRST STATE MACHINE COMPLETE **/ 


/** THE SECOND STATE MACHINE **/ 
SEQUENCE STMB € 


PRESENT SBO IF RFCLK NEXT SB1 ; 

DEFAULT NEXT SBO ; 
/** JUMP TO STATE SB1 **/ . 
/** REMEMBER REFRESH REQUEST **/ 


PRESENT SB1_ IF RFSH NEXT S80 ; 
DEFAULT NEXT SB1 ; 

/** JUMP TO STATE SBO **/ 

/** REFRESH CYCLE EXECUTED THUS RETURN **/ 


w 


/** THE SECOND STATE MACHINE COMPLETE **/ 


TRASO 
IRAS1 
IRAS2 
IRAS3 
1 ROWADD 
IRFSH ; 


/** EQUATIONS FOR SYSTEM INITIALIZATION AFTER RESET **/ 


(B0..B5].ar = 


RASO.AR = 
RAS1.AR = 
ROWADD.AR = 
COLADD.AR = 
RAS2.AR = 
RAS3.AR = 
RFSH.AR = 
SIG.AR = 


/* THESE ARE 


RASO.OE = 
RAS1.0E = 
ROWADD.OE = 
COLADD.OE = 
RAS2.0E = 
RAS3.0E = 
RFSH.OE = 
SIG.OE = 


INIT 5 


HINIT 
tINIT 
{INIT 
!INIT 
HINIT 
VINIT 
tINIT 
LINIT 


THE OUTPUT ENABLES */ 


'B'l 
‘Bd 
"Bil 
'B'l 
"Bit 
'B'1 
‘Bt 
"Bt 


me he Me Me Me MR Me Oe 


Ce ee Te Te ee eT ee) 


Figure 7. Source Code for Timing and Controller PAL Device (Cont'’d.) 
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NAME MEGABYT DRAM DATA SIZING & ALIGNMENT CONTROLLER; 
PARTNO = AMPAL 2388 ; 

DATE 03/24/86 ; 

REV o1; 


DESIGNER KAPIL SHANKAR ; 

COMPANY ADVANCED MICRO DEVICES ; 
ASSEMBLY NONE ; 

LOCATION SUNNYVALE, CA ; 

DEVICE P2388 ; 


[RRRRRERERRRERREREEKERERREREEEERERERERERERERREERERRERERERREREEREEEEE J 


/* */ 
/* */ 
i bss */ 
[RRRRRERRRIREKRIER ER EEREREERIEIRERERERK AREER EERE EEEKEEIR ERE EERIE REAEE 
/* Allowable Target Device Types: THE ONLY ONE */ 


[RARRRARRERREREREREIERIERIERREREREREREIER EKER EEE ARER EERIE REEEEEIERE | 


/** Inputs **/ 


PIN 2 = SIG; 
PIN 3 = AO; 

PIN 4 = Al; 

PIN 5 = RW; 

PIN 6 = RMC; 
PIN 7 = $120; 
PIN 8 = siz; 
PIN 9 = PRTO; 
PIN 11 = PRI1; 
PIN 12 = INIT; 


/** Outputs **/ 


PIN 19 = IWE ; 

PIN 18 = !CAS2 ; 
PIN 17 = 1CASO ; 
PIN 16 = !CAS3 ; 
PIN 15 = !CAS1 ; 
PIN 14 = !DSACK1 ; 
PIN 13 = IDSACKO ; 


NODE [80,81,82,B3] ; 


/** Declarations and Intermediate Variable Definitions **/ 


FIELD STM = (80,81,B2,B3] 


FIELD PORTSIZE = [PRT1, PRTO] 


PORTS 
PORT16 
PORT32 


/** DEFINE THE STATES **/ 


SDEFINE 
SDEFINE 
SDEFINE 
SOEFINE 
SOEFINE 
SOEFINE 
SOEFINE 
SOEFINE 
SOEFINE 
SOEFINE 
SOEFINE 
SOEFINE 


/** Logic Equations 


/** THE STATE MACHINE **/ 


SEQUENCE STM € 


PORTSIZE : 0 
PORTSIZE : 2 
PORTSIZE : 3 


SAO 
SA1 
SA2 
SA3 
SA4 
SAS 
SA6 
SA7 
SA8& 
SA9 
SA10 
SA11 


"b'0000 
'b'0001 
*b'0010 
'b'0011 
'b'0100 
'b'0101 
'b'0110 
'b'0111 
'b'1000 
'b'1001 
'b'1010 
'b'1011 


=e te Me 


we] 


[** 


/** 
[** 
/** 
/** 


/** 


/** 
/** 
[** 
/** 
/** 
/** 
/** 
/** 
/** 
/** 
/** 


[** 


THE STATE MACHINE **/ 


SIZE OF THE PORT **/ 
INTERFACED TO PROCESSOR **/ 
8 BIT WIDE PORT **/ 
16 BIT WIDE PORT **/ 
32 BIT WIDE PORT **/ 


STATE O **/ 
STATE 1 **/ 
STATE 2 **/ 
STATE 3 **/ 
STATE 4 **/ 
STATE 5 **/ 
STATE 6 **/ 
STATE 7 **/ 
STATE 8 **/ 
STATE 9 **/ 
STATE 10 **/ 
STATE 11 **/ 


/** POLLING FOR SIG INDICATING A PROCESSOR CYCLE **/ 


Figure 8. Source Code for Data Sizing and Alignment PAL Device 
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Teas PORT DATA OFFSET STATE OUTPUT **/ 
PRESENT SAO 
/** FOR PORTSIZE 8 **/ 

IF !SIG & PORTS NEXT SA1 OUT CAS3 ; 


/** GENERATE ACKNOWLEDGE **/ 
IF ISIG & PORTS NEXT SA1 OUT DSACKO ; 


/** FOR PORTSIZE 16 **/ 
IF !SIG & PORT16 & !A0 NEXT SA1 OUT CAS3 ; 
IF !SIG & PORT16 & !S1Z0 & S1Z1 & AQ NEXT SA1 OUT CAS2 ; 


/** GENERATE ACKNOWLEDGE **/ 


IF 'SIG & PORT16 NEXT SA1 OUT DSACK1 ; 


/** FOR PORTSIZE 32 **/ 
/** BYTE TRANSFER SIZE **/ 


IF §SIG & PORT32 & IA1 & !A0 NEXT SA1 OUT CAS3 
IF ¥SIG & PORT32 & 1A1 & AO NEXT SA1 OUT CAS2 
IF !SIG & PORT32 & !A1 & !S1Z0 NEXT SA1 OUT CAS2 
IF !SIG & PORT3S2 & !A1 & SIZ1 NEXT SAT OUT CAS2 
IF ISIG & PORT32 & Ai & !A0 NEXT SA1 OUT CAS1 


IF !SIG & PORT32 & 1A1 & $S1Z1 & !S1Z0 NEXT SA1 OUT CAS1 
IF §SIG & PORT32 & !A1 & S1Z1 & SIZ0 NEXT SA1 OUT CAS1 
IF !SIG & PORT32 & !A1 & AO & !S12 NEXT SA1 OUT CAS1 
IF !SIG & PORT32 & Al & AO NEXT SA1 OUT CAS3 
IF ISIG & PORT32 & $8121 & !S1Z0 NEXT SA1 OUT CAS2 
IF #SIG & PORT32 & AO & S1Z1 & $1Z0 NEXT SA1 OUT CASL 


Ce ee ee ee i ee Pe Te Ty 


/** GENERATE ACKNOWLEDGE **/ 
IF !SIG & PORT32 NEXT SA1 OUT DSACKO 
OUT DSACK1 ; 


/** JUMP TO STATE SA1 **/ 
/** ASSERT THE CAS FOR THE RIGHT PORTSIZE DATASIZE AND ADDRESS 


IF !SIG & !RW & RNC NEXT SA1 OUT WE ; 
/** GENERATE WE FOR READ WRITE CYCLE ONLY **/ 


IF SIG NEXT SAO ; 


OFFSET **/ 


/** ELSE WAIT FOR THE PROCESSOR CYCLE **/ 
/** POLL AGAIN **/ 

/** STAY IN STATE SAO **/ 

/** THIS IS THE PROCESSOR CYCLE START **/ 


PRESENT SA1 


IF CASO NEXT SA2 OUT CASO ; 
IF CAS1 NEXT SA2 OUT CAS1 ; 
IF CAS2 NEXT SA2 OUT CAS2 ; 
IF CAS3 NEXT SA2 OUT CAS3 ; 
IF IRW & RMC NEXT SA2 OUT WE ; 
IF PORTS NEXT SA2 OUT DSACKO 
IF PORT16 NEXT SA2 OUT DSACK1 
IF PORT32 NEXT SA2 OUT DSACKO 
OUT DSACK1 
NEXT SA2 ; 


/** HOLD CAS GENERATED IN THE LAST STATE **/ 


/** IF WRITE CYCLE AND NOT READ MODIFY WRITE CYCLE ASSERT WE **/ 


/** JUMP TO STATE SA2 **/ 


PRESENT SA2 


IF CASO NEXT SA3 OUT CASO ; 
IF CASi NEXT SA3 OUT CAS1 ; 
IF CAS2 NEXT SA3 OUT CAS2 ; 
IF CAS3 NEXT SA3_ OUT CAS3 ; 
IF RW & RMC NEXT SA3_ OUT WE ; 
IF PORT8 NEXT SA3 OUT DSACKO 
IF PORT16 NEXT SA3 OUT DSACK1 
IF PORT32 NEXT SA3 OUT DSACKO 
OUT DSACK1 
NEXT SA3 ; 


/** HOLD CAS GENERATED IN THE LAST STATE **/ 
/** HOLD WE FOR ACCESS DURATION **/ 
/** JUMP TO STATE SA3 **/ 


PRESENT SA3 
IF RMC NEXT SA4 OUT !CASO 
OUT !CAS1 
OUT !CAS2 


Figure 8. Source Code for Data Sizing and Alignment PAL Device (Cont’d.) 





=e we 


=e 


=. me 


G Wd poeouerapy Buisyp 4911043005 WywHG & eZzIWwosng 


S9IIAZ 


002-2 


OUT !CAS3 /** CONTINUE THE READ MODIFY WRITE CYCLE **/ 
OUT !WE 
QUT !DSACKO PRESENT SA6 
OUT !DSACK1 ; IF CASO NEXT SA7 OUT CASO ; 
IF CAS1 NEXT SA7 OUT CAS1 ; 
/** FOR READ WRITE CYCLE REMOVE CAS AND WE FOR CYCLE COMPLETION **/ IF CAS2 NEXT SA7 OUT CAS2 ; 
/** ACKNOWLEDGE CYCLE COMPLETION BY ACKNOT **/ IF CAS3 NEXT SA7 OUT CAS3 ; 
/** JUMP TO STATE SA **/ NEXT SA7 ; 
IF CASO & IRMC NEXT SA5 OUT CASO ; /** CONTINUE THE READ MODIFY WRITE CYCLE **/ 
IF CAS1 & !RMC NEXT SA5 OUT CAS1 ; 
IF CAS2 & !RMC NEXT SAS OUT CAS2 ; PRESENT SA7 
IF CAS3 & !RMC NEXT SA5 OUT CAS3 ; IF CASO NEXT SAB OUT CASO ; 
IF !RMC NEXT “SAS OUT !DSACKO IF CAS1 NEXT SA8 OUT CASI ; 
OUT !DSACK1 ; IF CAS2 NEXT SA8 OUT CAS2 ; 
IF CAS3 NEXT SAS OUT CAS3 ; 
/** FOR READ MODIFY WRITE CYCLE CONTINUE TO ASSERT CAS SIGNALS **/ IF PORT8 NEXT SAB OUT DSACKO ; 
/** ACKNOWLEDGE COMPLETION OF READ PORTION OF THE CYCLE BY ACKNOT **/ IF PORT16 NEXT SA8 OUT DSACK1 ; 
/** JUMP TO STATE SAS **/ IF PORT32 NEXT SA8 OUT DSACKO 
OUT DSACK1 ; 


NEXT SAS OUT WE ; 
/** REMAINING PORTION OF THE READ WRITE CYCLE **/ 
/** THIS IS THE WRITE PORTION OF THE READ MODIFY WRITE CYCLE **/ 
PRESENT SA4 NEXT SAO ; /** GENERATE THE WE SIGNAL FOR ENABLING WRITE **/ 
/** GO BACK TO POLLING STATE SAO **/ /** GENERATE THE CORRECT ACKNOWLEDGE FOR THE WRITE CYCLE PORTION **/ 
/** CONTINUE TO ASSERT CAS **/ 
/** COMPLETION OF READ WRITE CYCLE **/ 


Mg Sojs0luoyy REI ayyjouoy 


PRESENT SA8 


IF CASO NEXT SA9 OUT CASO ; 
IF CAS1 NEXT SA9 OUT CASI ; 
/** REMAINING PORTION OF THE READ MODIFY WRITE CYCLE **/ IF CAS2 NEXT SA9 OUT CAS2 ; 
IF CAS3 NEXT SA9 OUT CAS3 ; 
PRESENT SA5 IF PORTS NEXT SA9 OUT DSACKO ; 
IF CASO NEXT SA6 OUT CASO ; IF PORT16 NEXT SA9 OUT DSACK1 ; 
IF CAS1 NEXT SA6 OUT CAS1 ; IF PORT32 NEXT SA9 OUT DSACKO 
IF CAS2 NEXT SA6 OUT CAS2 ; OUT DSACK1 ; 
IF CAS3 HEXT SA6 OUT CAS3 ; NEXT SA9 OUT WE ; 
NEXT SA6 ; 


/** THIS IS THE WRITE PORTION OF THE READ MODIFY WRITE CYCLE **/ 
Figure 8. Source Code for Data Sizing and Alignment PAL Device (Cont’d.) 
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PRESENT SA9 /** EQUATIONS FOR SYSTEM INITIALIZATION AFTER RESET **/ 


IF CASO NEXT SA10 OUT CASO ; 
IF CAS1 NEXT SA10 OUT CAS1 ; (80..B3].ar = HINIT ; 
IF CAS2 NEXT SA10 OUT CAS2 ; 
IF CAS3 NEXT SA10 OUT CAS3 ; CASO.AR = FINIT; 
IF PORTS NEXT SA10 OUT DSACKO ; CAS1.AR = FINIT ; 
IF PORT16 NEXT SA10 OUT DSACK1 ; DSACKO.AR = !INIT ; 
IF PORT32 NEXT SA10 OUT DSACKO DSACK1.AR = !INIT ; 
OUT DSACK1 ; CAS2.AR = FINIT ; 
NEXT SA10 OUT WE ; CAS3.AR = SINIT ; 
WEAR = INIT; 
e /** THIS IS THE WRITE PORTION OF THE READ MODIFY WRITE CYCLE **/ 
5 PRESENT SA10 /* THESE ARE THE OUTPUT ENABLES */ 
3 NEXT SA11 OUT !CASO 
= OUT !CAS? CASO.OE = ‘BIT ; 
= OUT !CAS2 CAS1.0E = 'BY ; 
® OUT !CAS3 DSACKO.OE = 'B'1 ; 
in OUT !DSACKO DSACK1.0E = 'B'l ; 
OUT !DSACK1 CAS2.0E = 'B' ; 
= OUT IWE ; CAS3.06 = 'B't ; 
3 WE .OE = 'B'd ; 
8 /** COMPLETION OF THE WRITE PORTION OF THE: READ MODIFY WRITE CYCLE **/ 
S /** REMOVE ALL CAS WE AND ACKNOWLEDGE COMPLETION **/ 
® 
C4 /** REMAINING PORTION OF THE READ HOOIFY WRITE CYCLE **/ 


PRESENT SA11 NEXT SAO ; 
/** GO BACK TO POLLING STATE SAO **/ 

/** COMPLETION OF THE READ MODIFY WRITE CYCLE **/ 
} 


/** THE STATE MACHINE COMPLETE **/ 


Figure 8. Source Code for Data Sizing and Alignment PAL Device (Cont’d.) 
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Introduction 


This application note describes the implementation of a timing 
generator which interfaces between the 8088 and the Am2968 
Dynamic Memory Controller (DMC) using programmable logic 
and a delay line. The implementation does not account for any 
error detection and correction (EDC) circuitry. 


The timing generator, like the Am2970, is needed in memory 
systems utilizing a dynamic memory controller. It serves as an 
interface between the processor and the controller and gener 
ates the necessary control signals for the controller. 


A dynamic memory controller, in brief, interfaces between a pro- 
cessor and the dynamic memory array. It steers the appropriate 
address inputs and Row and Column Address Strobes (RAS 
and CAS) required in the selected memory operations (i.e. 
refreshing, read/write). 


The following sections will introduce and illustrate the implemen- 
tation of the timing generator using 20-pin PAL devices, and also 
show its interface to AMD’s Am2968 (DMC) and 8088 proces- 
sor. Figure 1 shows the block diagram of the interface. Program- 
mable logic was chosen because it is readily available, simple to 
use and reduces the number of devices required to implement 
specific functions. 


Interface Overview 


The implementation of this timing generator is not a general 
purpose application. It is dedicated specifically to a particular 
processor, the Intel 8088, and is limited to accessing only four 
‘banks of memory. The range of the four banks of memory is 


therefore 1 Mbyte—with 256K in each bank. This limitation is set 
by the fact that no additional decoding circuitry has been added. 
Decoding is done with the two address lines (A18—A19) of the 
8088, thus allowing only four banks to be selected. 


The clock to the 8088 and the two PAL devices is provided by 
the 8284A. The CLK signal from the 8284A is connected directly 
to the CLK input of the 8088 and also to CLK of RASER 
(PAL16R4). The PCLK of the 8284A connects to the PCLK input 
of HLDR (PAL16R6) to operate the 6-bit counter. Note that the 
clock operates on a 1/3 and 2/3 duty cycle. 


Description of PAL Device Function 


The function of RASER (PAL16R4) is to create RASI (Row Ad- 
dress Strobe) and CAS! (Column Address Strobe) at the appro- 
priate time. RASI is generated directly from the device, whereas 
CASI is generated from MRASI via a delay line. Figure 2 shows 
the timing delays between the controller and processor interface 
signals. The two modes of operation which the timing is focused 
on is Refresh w/o Scrubbing and Read/Write. By toggling the 
state of the mode control pin MC1 to either a LOW or HIGH 
respectively, and with MCO tied LOW (at the Am2968), the 
desired mode is generated. The mode control pin MC1 is 
generated based on the state of the processor's interface pins: 
1O/M, DT/R and SSO. The combination of these signals decodes 
the processor's current bus cycle and indicates the ongoing 
memory activity. The MUX Select pin (MSEL) is also generated 
from the delay line. It determines whether a Row or Column 
address is sent to the memory address input based on the mode 
contro! inputs, MCO and MC1. 
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Figure 1. Am2968 to 8088 Interface Block Diagram 
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Figure 2. Interface Timing 


A two-bit counter implemented internal to the (PAL16R4) 
RASER is used to keep track of the internal state of RASI during 
the Read/Write or Refresh operations. In a normal memory 
Read/Write operation, the appropriate RASn outputs from the 
DMC will be activated in response to RASI going HIGH, as 
shown in the memory timing in Figure 3, during this time ALE 
initiates the cycle. In Refresh mode, receiving RASI will force all 
the RASn outputs of the DMC to go Low. FALE initiates the 
refresh cycle, since no ALE occurs during this time, as shown in 
refresh timing of Figure 3. The counter is also configured such 
that, during memory operations, sufficient time has been allotted 
within the cycle for RAS precharge (required in DRAMs) to 
occur. 


b) HLDR (PAL16R6) 


The function of the (PAL16R6) HLDR is to generate the two 
signals—FALE (False Address Latch Enable) and HLD (Hold). 
FALE controls the internal latches (ALS and ALR) of the 
(PAL16R4) RASER and initiates the refresh cycle independent 
of the processor's ALE (Address Latch Enable) signal. 





The (PAL16R6) HLDR is essentially a 6-bit counter that gener- 
ates 64 T-Clock cycles for the 8088 processor. When a hold 
request (HLD) is made and a hold acknowledge (HLDA) is 
received from the processor, FALE will be generated for the 
(PAL16R4) RASER to initiate the refresh cycle. Refresh is per- 
formed every 8 js which is double the required frequency. This 
time is derived in the Refresh Calculations section of this appli- 
cation note. The 8088 processor allows request only at the end 
of the bus cycle. A bus cycle duration is 4 T-Clocks, thus refresh 
is performed at cycles “60-63” as noted in the function table in 
the HLDR design specification. Figure 2 refresh timing shows 
this critical portion of the timing activity when RASI is generated 
for refresh. During CLK (0 - 0*), (this corresponds to PCLK count 
value 62), the processor acknowledge is received and RASI 
becomes active. While RASI is HIGH, the internal 2-bit counter 
will cycle through its count sequence (0* - 3*); at the end of the 
sequence as indicated by PCLK count value 63 (i.e., CLK 3°), 
RAS! will become inactive. 
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Figure 3. Refresh Timing 


Refresh Timing Calculations 


The timing calculations are based on 8088 processor being in 
minimum mode (8 MHz) and a cell array of 128 rows. 


The 8088 clock period is 8 MHz, thus 
t = 1/f = 1/8 M = 125 ns. 


Refreshing for DRAMs are performed on 128 rows every 2 ms. 
The calculations for the above memory array is determined 
to be: 


Required refresh time = 2 ms/128 = 15.6 us. 
Thus the required time for 256 row cell array will be: 


Required refresh time = 4 ms/256 = 15.6 us. 


From the above calculations, refreshing needs to be performed 
at least every 15 ps. But for this application, memory is being 
refreshed every 8 us as shown below: 


Refresh time = 64 T-clock cycles x 125 ns = 8 ys. 


Delay Line Taps Computation 

The calculations for the assignment of the delay line taps is 
based on the parameters CASI and MSEL relative to RAS! 
making a transition to the active state. See the timing in Fig- 
ure 4. 
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Figure 4. Delay Line Timing Diagram 
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MSEL may switch some time after To. To is calculated to be as Thus, 64 ns after RASI initiates its transition, CASI may begin to 
follows: go active. 
To = tran + tog = 20 + 11 = 31 ns. For simplicity, delay line taps may be assigned as follows: 
where T2 = delay time from RASI to MSEL RASI ---------- MSEL ~ 30 ns 

thaw = Row Address Hold Time of the DRAM RASI ---------- CASI ~ 60 ns 

to4 = skew time for Qn to RASn of the DMC 


Thus, after 31 ns MSEL should initiate its active transition. 





CASI is calculated to make its change; T3 ns relative to RASI. Counter and Mode Function Table 
The calculated value is as follows: The following function tables describe the activity of the 
= = = RASI/MRASI during the specific modes of the processor's inter- 
lige te les se VASE Sha eS O04, face signals (IO/M, DT/R and SSO) and the state of the internal 
where Tg. = delay from RASI to CASI counter which controls RASI/MRASI. Note: RASI will be active 
tos = skew time for Qn to CASn of the DMC during the counter sequences between 1 to 3 as shown below in 
tasc = column setup time of the DRAM Figure 5. 


Two Bit Counter 










Two Bit Counter 





PASSIVE 
CODE ACCESS 
RD 
WR 





Note: This design does not allow the user to insert Wait States 
during memory operations! Wait States may be inserted in 
interrupt or I/O cycles. 


Figure 5. 
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TITLE 
PATTE 
REVIS 
AUTHO 





COMPANY ADVANCED MICRO DEVICES 
DATE 11/06/87 
CHIP RASER PAL16R4 
CLK RES ALE FALE IOM DTR SSO CLK2 NU2 GND 
TRI MRASI RASI /MCl NC /B /A /ALR /ALS' VCC 
EQUATIONS 
ALS = FALE*/ALE*/ALR 
+ RES 
ALR = /A*B*/ALS 
+ A*/ALS 
/RASI = /A*/B*/RES 
+ RES*CLK2 
/MRASI = IOM*/DTR 
+ IOM*DTR 
+ /IOM*DTR*SSO 
+ /A*/B 
A := /A*/RES*/ALS 
+ /BR*/RES*ALE 
B := A*/B*/RES*/ALS 
+ /A*B* /RES*/ALS 
MCl := IOM*/DTR*ALE*/RES 
+ DTR*SSO*ALE*/RES 
+ IOM*DTR*/SSO*ALE* /RES 
+ MC1*/ALE*/RES 
; SIMULATION NOT INCLUDED 
Figure 6. Source Listing for the 8088 to Am2968 Interface (AmPAL16R4) 
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TITLE Am2968 TO 8088 INTERFACE 
PATTERN HLDR 

REVISION O01 

AUTHOR G. SPEARS 

COMPANY ADVANCED MICRO DEVICES 
DATE 11/06/87 


CHIP HLDR PAL16R6 


PCLK RASI MRASI HLDA RES CLK MCl NU2 NU3 GND 
TRI /FALE /F /E /D /C /B /A HLD VCC 


EQUATIONS 
A := /A*/RES*MC1 


B := A*/B*/RES*MC1 
/A*B* /RES*MC1 


+ 





fib pide EA A 


it 
md eens gh 












A*B* /C*/RES*MC1 
/B*C* /RES*MC1 
/A*C*/RES*MC1 


+ + il 


A*B*C* /D* /RES*MC1 
/C*D* /RES*MC1 
/B*D*/RES*MC1 
/A*D* /RES*MC1 


++ 4h 


A*B*C*D* /E* /RES*MC1 
/D*E* /RES*MC1 
/A*E* /RES*MC1 
/BRE*/RES*MC1 

/C*E* /RES*MC1 


+++4il 


/A* PR /RES*MC1 
/C*P* /RES*MC1 
/B*P* /RES*MC1 
/D*F* /RES*MC1 
A*B*C*D*E* /P* /RES*MC1 
/E*XP* /RES*MC1 


++etettil 


JHLD = /C + /D + /E + /F 
FALE = HLDA*CLK*RASI*/MRASI 


; SIMULATION NOT INCLUDED 


Figure 7. Source Listing for the 8088 to Am2968 Interface (AmPAL16R6) 
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Introduction 


This application note shows how the general-purpose timing 
generator can be configured to interface between MC68000 and 
the Am2968. Figure 1 shows the block diagram of the interface. 
The correct interface signals must be adapted from the proces- 
sor to the TimGen (PAL16L8) and the Am2968. Note, from Fig- 
ure 2, that AS (Address Strobe) replaces CYCREQ, and UDS 
(Upper Data Strobe) and LDS (Lower Data Strobe) replace DS 
and B/W. 


Asserting the Address Strobe signal (AS), which is connected to 
the LE input, latches the valid address into the Am2968. UDS 
and LDS are controls indicating the flow of data on the data bus 
(DO00—D15) during memory read/write operations. By asserting 
the correct polarity on these two signals (UDS & LDS), either a 
byte or a word may be accessed. 


The following sections will show the timing requirements which 
are being considered—that is, the desired processor operating 
frequency and the appropriate DRAM. Timing diagrams are also 
included to show the status of the interface signals during 
read/write operations for various processor frequencies. 


MC68000 Timing Requirements 


In generating the hardware for the timing interface for the 
MC68000 to the Am2968, the overall system timing require- 
ments must be considered to guarantee that the desired 
memory access time can be met. 


The following general equation formulates the parameters which 
must be considered in generating the read cycle time for the 
68000 processor. The write cycle time may be similarly 
generated. 


tREAD = 2tcyc + tcH — tcHsLx — toicL — 2tpaL — ta — tLatcH 
+ Ntcyc 


tREAD = read cycle time 

tcyc = clock period of the processor 

tou = clock width High of the processor 

tcHsLx = Clock High to AS, DS Low of the processor 
topic. = data in to clock Low (setup time) of the processor 
tpaL = programmable logic access time 

to = delay from RAS! to RASn of the DMC 

tLATCH = delay through latch 

N = number of inserted Wait States. 





For simplicity, zero Wait States have been assumed in selecting 

the appropriate DRAMs. If Wait States had been asserted, the 

appropriate DRAM must be selected to meet the overall taeap 

cycle time. Note that the access time of the DRAM (tacc) must 

be less than or equal to treap cycle time or else access to 

memory will be missed. The following evaluations show the cal- 

culated tacc based on the various processor operating frequen- 

cies: 8 MHz, 10 MHz and 12.5 MHz. 

MC68000 W/WAIT STATES 

8 MHz 

treap = (250 + 55 — 60 — 15 — 30 — 23 —- 138 + N* 125) ns 
= (164 + 125N) ns 

DRAM tacc = 150 ns 

10 MHz 

tREAD = (200 + 45 — 55 — 10 — 30 — 23 —- 13 + N« 100) ns 
= (114 + 100N) ns 

DRAM tacc = 100 ns 

12.5 MHz 

tReap = (160 + 35 — 55 — 10 - 30 ~ 23 —- 18 + N*« 80)ns 
= (64 + 80N) ns 
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Figure 1. 68000 System Diagram 
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Figure 2. Interface Diagram 
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Figure 3. Read and Write Cycle Timing Diagram 


to the processor's cycle time. The 8 MHz and 10 MHz are shown 
on the same diagram because of their similarity. Figure 4 shows 
the read and write timing for the processor operating at 12.5 
Figure 3 shows the read and write cycle timing diagrams relative MHz with Wait States asserted during these operations. 


For 12.5 MHz frequency, asserting at least one Wait State will 
allow treap = 144 ns, thus DRAM tacc = 120 ns. 
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Figure 4. Read and Write Cycle Timing Diagram 
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TITLE GEN INTERFACE FOR 68000 
PATTERN TIMGEN 

REVISION 01 

AUTHOR LEE/YEE 

COMPANY ADVANCED MICRO DEVICES 
DATE 11/06/87 


CHIP TIMGEN PAL16L8 


/CS /AS /FR /SUP RW 80 # /UDS /LDS T1100 GND 
NC /DTACK TO FRH /RFRQ /RWRQ /WEL /WEH /OE VCC 


EQUATIONS 


/FRH FR*/FRH + TO*RFRQ 


RWRQ = CS*AS*/RWRQ*/T100 + RWRQ*/T100 
RFRQ = FR*FRH*/RWRQ*/T100 + RFRQ*/T100 
/TO = /RWRQ*/RFRQ + T80 

DTACK = CS*RWRQ*/RFRQ*T80 


DTACK.TRST = CS*RWRQ*/RFRQ*T80 


I 


WEH /SUP*RWRQ*CS* /UDS* /RW 


WEL /SUP*RWRQ*CS*/LDS*/RW 
OE = CS*RW*UDS + RW*CS*LDS 


; SIMULATION NOT INCLUDED 


Figure 5. Source Listing for the 68000 Interface 





2-214 ol Monolithic ER Memories cl 


General-Purpose Dual-Port Arbiter 





Introduction 


This application note shows the implementation of a general- 
purpose dual-port arbiter interfacing three processors, a 
Dynamic Memory Controller (Am2968A DMC), and dynamic 
memory. The arbiter, implemented with two programmable array 
logic PAL devices, and a delay line, can provide a simple and 
complete arbitration solution operating in asynchronous mode. 


The interface consists of two PAL devices (TIMGEN2A and 
TIMGEN2B), a timer, a delay line, and two multiplexers. Figure 1 
shows the block diagram of the system which includes the ar 
biter logic implemented with the PAL devices. These two 
devices, TIMGEN2A and TIMGEN2B, provide the control sig- 
nals to the DMC as well as to the dynamic memory. The purpose 
of the arbiter is to mediate when two processors request for 
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memory cycles, or when refresh cycles need to be performed. 
The first PAL device, TIMGENZ2A, generates the signals for the 
DMC and the Address MUx; it also contains the arbitration logic. 
The second device, TIMGEN2B, generates the control signals 
for the memory and the data bus latches. 


The Am2968A Dynamic Memory Controller (DMC) interfaces 
between the processors and the dynamic memory array. The 
DMC provides the required addresses either for the memory or 
the refresh cycles. For memory cycles, the addresses are 
generated by the processor and latched into the DMC. For 
refresh cycles, the addresses are generated by a 20-bit counter 
internal to the DMC. When the DMC receives the correct contro! 
signals, it generates the appropriate address (Qn) and the Row 
and Column Address Strobes (RASn and CASn) that are re- 
quired to perform memory read/write and refresh. 


DYNAMIC 
RAM 





DOOA-D15A 


DOOB-D15B 


Figure 1. Block Diagram 
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Interface Overview 


The dual-port arbiter can be interfaced to various processors, 
using the proper control signals from the particular processor. 
Typically, the processor interface signals are CS (Chip Select), 
CYCREQ (Cycle Request), RD (Read), and WR (Write), see 
Figure 1. 


The two PAL devices, TIMGEN2A and TIMGENZ2B are designed 
to operate in the following modes: 


(1) generate proper control signals when cycle request is made 
(2) arbitrate between two processor requests 

(3) arbitrate between a refresh request and processor request 
(4) arbitrate between a refresh request and two cycle requests 


Requests by the processor are made when CS and CYCREQ 
are valid and the memory address is present. During dual-port 
arbitration, the occurrence of simultaneous requests, for in- 
stance, CSA \ (Chip Select Port A), CYCREQA (Cycle Request 
Port A) and FR (Forced Refresh), will result in the granting of a 
refresh cycle and the appropriate address being generated 
accordingly by the DMC. 


TIMGEN2A generates the appropriate control signals to the 
data and address multiplexers, the delay line, and the memory 
controller. These control signals are: RASI (Row Address Strobe 
Input), SEL (Processor Address MUX Select), and MC1 (Mode 
Control Input). RAS! is generated for all read/write and refresh 
cycles. It is also used as the delay line input from which MSEL 
(MUX Select for the DMC) and CASI (Column Address Strobe 
Input) are generated. SEL allows the correct port address, A or 
B, to be the input to the DMC depending upon which port is 
acknowledged. MC1 specifies the operating mode of the DMC, 
read/write or refresh (see Am2968A data sheet). 


The arbitration logic prioritizes and grants the requests for 
read/write cycles from either port, A or B, and refresh cycles. 
The Mode Table (Figure 2) shows the order of precedence. 
When simultaneous request for a read/write or refresh cycle 
occurs, the refresh cycle will be given priority. The connection of 
RFRQ (Refresh Request) from TIMGEN2A to MC1 of the DMC, 
as shown in the block diagram, implicitly gives refresh the 
precedence. When simultaneous read/write requests are made 
by processor A and processor B, processor A will be given 
priority over B. Again, the order of precedence is set implicitly by 
the connection of RWRQA (Read/Write Request Port A) to the 
select input signal of the processor address MUX. 


GRANT 


FR (refresh) 

FR (refresh) 

FR (refresh) 

FR (refresh) 
PORT A (pP Req) 
PORT A (pP Req) 
PORT B (pP Rea) 
no activity 





CYCREQ 


FR Port A Port B 


L 
L 
L 
L 
H 
H 
H 
H 


Figure 2. Mode Select Table 





The second programmable logic device, TIMGEN2B generates 
the OE (Output Enable) signal for the data bus latches and the 
WE (Write Enable) signal for | for memory. The inputs to TIMGEN2B 
are the control signals—CSA, CSB, RDA (Read A), RDB (Read 
B), WRA (Write A), and WRB (Write B). Signals such as BHEN 
(Byte High Enable) and AO0, the least significant bit of the ad- 
dress, have been included to show their function (if available). 
These two signals are provided by the processor and together 
they determine whether a word or byte transfer is to be per 
formed during memory operations. 











Figure 3. Refresh Timer Diagram 


Refresh Timer and Calculations 


The refresh timer below shows one method of implementing an 
external refresh clock. This section is optional if an alternate 
source is used. The refresh timer, implemented with a 555 timer, 
needs to generate an active-LOW edge-triggered signal at least 
once every 15 ps. This means that a refresh occurs when a 
LOW-going edge on the FR signal is detected. The 15 ys value 
is derived from the dynamic memory refresh timing require- 
ments; refresh is normally required, over 256 rows, every 4 ms. 
This equates to a required refresh cycle every 15.6 ys. In this 
application, the refresh timer generates FR approximately once 
every 9.8 ps. Figure 3 shows the circuit for the forced refresh 
(FR) timer. The timer is a commercially-available 555 timer. The 
delay equation shown below must be solved for R1, R2, and C 
values. This satisfies the refresh requirement of less than, or 
equal to, 15.6 ns 


T = 0.693 (R1 + 2R2) C < 4 ms/256 = 15.6 ps 

For the following values of R1, R2 and C 

R1 = R2 = 10K and C = 470 pF 

the timer generates a refresh every 9.8 ys which allows some 


margin in the refresh requirement. The specified values for R1, 
R2, and C allow for 30% total discrete component tolerance. 
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TIMGEN2A 


The function of TIMGEN2A is to generate RASI (Row Address 
Strobe Input) and arbitrate between processor and refresh 
cycles. Based on the arbitration, TIMGEN2A also generates 
RWRQ (Read/Write Request), or RFRQ (Refresh Request), that 
determines the type of cycle to be performed. TIMGEN2A can 
arbitrate between refresh and processor requests and also arbi- 
trate between Port A and Port B processor requests. 





When simultaneous refresh and processor requests are 
generated, refresh (RFRQ) will be given priority. The priority is 
implicit by design because the RFRQ output of TIMGEN2A is 
connected to MC1 of the Am2968A DMC. When RFRQ be- 
comes active (LOW), MC1 forces the DMC into refresh mode. 








When simultaneous processor requests are made, RWRQA 
(Read/Write Request of Port A) will be granted before RWRQB 
(Read/Write Request of Port B). This priority is also implicit by 
design because the RWRQA output is connected to SEL of the 
processor's address multiplexer. When RWRQA becomes ac- 
tive, address from Port A will be selected for input to the Row 
and Column Address latches of the Am2968A, also, the data 
from Data Bus A will be allowed to flow to the inputs of the 
DRAMs. !n either case, RAS! must be generated to initiate any 
of the cycles. Once RASI is generated, the control signals, 
MSEL and CASI, will be generated from the delay line relative 
to RASI. 


TIMGEN2B 


The function of TIMGEN2B is to generate WE (Write Enable) for 
memory and also OE (Output Enable) to control the flow of data 
through the data bus latches. The inputs to TIMGEN2B are, 
CSA, RDA (Read Strobe A), WRA (Write Strobe A), CSB, RDB 
(Read Strobe B), and WRB (Write ie B). In addition, there 
are BHENA (Byte High Enable A) and BHENB (Byte High 
Enable B), AOOA (least significant address bit for Port A) and 
AOOB (least significant address bit for Port B). These signals are 
generated by the processor and are used to specify either byte 
or word transfers. 


The following description outlines the operation of TIMGEN2B. If 
CSA and RDA become active (LOW), TIMGEN2B will cause 
OEA to become active (LOW), enabling the data bus latches 
and allowing data corresponding to address A to flow onto the 
system data bus (read operation). The identical procedure oc- 
curs for Port B when the control inputs for Port B is active. The 
block diagram shown in Figure 1 shows a multiplexer for the 
system data bus. This multiplexer controls the data flow into 
memory from either processor A or processor B, during write 
operations. The select to the MUX is controlled by RWRQA, the 
same signal that controls the select to the processor address 
MUX, allowing address Port A to implicitly have higher priority 
during process memory access. 


The following sections provide general information about 
interfacing the arbiter to various major types of processors; they 
are: the iAPX-type, the AmZ8000, and the MC68000-type 
processors. The PAL devices for the three types of processors 
will be identified as follows: TIMGEN2A and TIMGEN2B are for 
the iAPX-type; TIMGEN3A, TIMGENSB for the AmZ8000; and 
TIMGEN4A and TIMGEN4B are for the MC68000-type. When 
mixing different processor in an interface, some of the control 
signals mentioned for the particular groups may need to be 


modified to be applicable to the processor under consideration. 
The designer can tailor the design to meet specific needs with 
simple modification to the PAL device logic equations provided. 


Interfacing the iAPX-Type Processors 


The control signals required for TIMGEN2A are shown in Fig- 
ure 4. These signals are: CS, CYCREQ, FR and the delay line 
outputs (TAP1—TAP5). CS and CYCREQ are provided by the 
processor. FR is the refresh request from the refresh timer. 
Sources to this input can be provided either by the timer 
described in this application note or other appropriate sources. 
TAP1—TAPS are the timing delay outputs used to regulate many 
of the signal generation. The outputs generated from this PAL 
device are TAPO (RASI), XACKA, XACKB, and the qrants for 
processor and refresh requests. 


The signals for TIMGEN2B are RD, WR, BHE, and AOo. For 
byte/word transfers, the designer should consult the appropriate 
processor data sheet. In general, this processor group uses 
BHE in conjunction with A00 to define the transfer function of the 
data bus. The outputs from this PAL device are: Write Enables, 
Latch Enables, and Output Enables. 


Interfacing the AmZ8000 


The control signals required for this interface are shown in Fig- 
ure 5. For TIMGENSA, the general controls are CS and 
CYCREQ, FR and the delay line outputs (TAP1—TAPS5). CS and 
CYCREQ are generated by the processor. FR is the refresh 
request from the refresh timer. TAP1—TAP5 are the delay line 
outputs used to regulate the signal generation. The outputs from 
this PAL device are WAITA, -WAITB, TAPO (RASI), and the 
grants for processor and refresh requests. 


The contro! signals for TIMGEN3B are R/W, DS, A00 and B/W. 
For byte/word transfers, the designer should consult the proces- 
sor data sheet for correct signal generation. The outputs from 
this PAL device are: Write Enables, Latch Enables and Output 
Enables. 


Interfacing the MC68000-Type 
Processors 


The control signals required for this interface are shown in Fig- 
ure 6. For TIMGEN4A, the general controls are CS and 
CYCREQ, FR and the delay line outputs (TAP1—TAPS). CS and 
CYCREQ are generated by the processor. FR is the refresh 
request from the refresh timer. TAP1—TAPS5 are the delay line 
outputs used to regulate signal generation. The outputs 
generated from this PAL device are: DIACKA, DTACKB, TAPO 
(RASI) and the grants for processor and refresh request. 


The contro! signals for TIMGEN4B are R/W, UDS, and LDS. For 
byte/word transfers, the designer should consult the respective 
processor data sheet for correct signal generation. For example, 
to obtain either byte or word transfers with the MC68000, three 
signals, UDS, LDS and R/W, must provide the correct levels to 
guarantee the correct data transfers, see MC68000 Data Bus 
Control Table in the User’s Manual. The outputs generated from 
this PAL device are: Write Enables, Latch Enables, and Output 
Enables. 


The programmable logic device equations for all three interfaces 
are shown in Figures 7, 8, and 9. 
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Figure 4. Interface for |APX-Type Processors 
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Figure 5. Interface for Z8000-Type Processors 
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Figure 6. Interface for MC68000-Type Processors 
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TITLE INTERFACE TIMING PAL DEVICE #1 
PATTERN A3-249A 

REVISION 01 

AUTHOR J. ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 12/15/87 


CHIP TIMGEN2A PAL16L8 


/CSA /CYREQA /CSB /CYREQB TAP1 TAP2 TAP3 TAP4 TAP5 GND 
/FR /XACKA /XACKB TAPO FRH /RFRQ /RWRQA /RWRQB NC vcc 





EQUATIONS 

/FRH = FR*/FRH + TAPO*RFRQ 

RFRQ = FR*FRH*/RWRQA*/RWRQB*/TAPS + RFRQ*/TAP5 

RWRQA = CSA*CYREQA*/RFRQ*/RWRQB*/TAP5 + RWRQA*/TAP5 

RWRQB = CSB*CYREQB*/RFRQ*/RWRQA*/TAP5 + RWRQB*/TAP5 

/TAPO = /RFRQ*/RWRQA*/RWRQB + TAP4 ;PAL device for 8086,80186 


z;and 80286 processors 
XACKA = CSA*RWRQA*/RFRQ*TAP 4 
XACKA.TRST = CSA*RWRQA*/RFRQ*TAP4 


XACKB = CSB*RWRQB*/RFROQ*/RWRQA*TAP4 
XACKB.TRST = CSB*RWRQB*/RERQ*/RWROA*TAP4 


TITLE INTERFACE TIMING PAL DEVICE #2 

PATTERN A3-249B 

REVISION 01 

AUTHOR J. ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 12/15/87 

CHIP TIMGEN2B PAL22V10 

/CSA /RDA /WRA /BHENA AOOA /BHENB /WRB /RDB /CSB NC TAP3 GND 
AOOB /OEA /OEB /SUP /RFRQ /RWROA /RWRQB /WEL /WEH LEB LEA VCC 
GLOBAL 

EQUATIONS ;PAL DEVICE FOR 8086 PROCESSORS 


WEH = /SUP*RWRQA*/RFRQ*CSA*WRA*BHENA 
+ /SUP*RWROB*/RERQ* /RWRQA*CSB*WRB*BHENB 


WEL = /SUP*RWRQA*/RFRQ*CSA*WRA*/AQD0A 
+ /SUP*RWRQB*/RFERQ*/RWRQA*CSB*WRB*/A00B 


OEA = CSA*RWRQA*/RFRQ*RDA 
OEB = CSB*RWRQB*/RFRQ*RDB 
LEA = CSA*RWRQA*/RFRO*RDA*TAP3 


LEB = CSB*RWRQB*/RFRQ*RDB*TAP3 


Figure 7. PLPL Specification for the Example of Figure 4 
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General-Purpose Dual-Port Arbiter 








TITLE INTERFACE TIMING PAL DEVICE #1 
PATTERN A3-250A 

REVISION 01 

AUTHOR J. ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 12/15/87 


CHIP TIMGEN3A PAL18P8 


/CSA /CYREQA /CSB /CYREQB TAP1 TAP2 TAP3 TAP4 TAPS GND 
/FR /WAITA /WAITB TAPO FRH /RFRQ /RWRQA /RWRQB NC vcc 


EQUATIONS 

/FRH = FR*/FRH + TAPO*RFRQ 

RFRQ = FR*FPRH*/RWRQA*/RWRQB*/TAP5 + RFRQ*/TAPS 

RWROQA = CSA*CYREQA*/RFRQ*/RWROB*/TAP5 + RWRQA*/TAPS 

RWROB = CSB*CYREQB*/RFRQ*/RWRQA*/TAP5 + RWROQB*/TAPS 

/TAPO = /RFRQ*/RWRQA*/RWROB + TAP4 ;PAL device for 28000 
;processor 


/WAITA = CSA*RWRQA*/RFRQ*TAP4 
WAITA.TRST = CSA*RWRQA*/RFRQ*TAP4 


/WAITB = CSB*RWRQB*/RFRQ* /RWRQA*TAP4 
WAITB.TRST = CSB*RWRQB*/RFRQ*/RWRQA* TAP 4 


TITLE INTERFACE TIMING PAL DEVICE #2 
PATTERN A3-250B 

REVISION 01 

AUTHOR J. ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 12/15/87 


CHIP TIMGEN3B PAL22V10 


/CSA  RWA /DSA BWA AOOA BWB  /DSB RWB /CSB NC TAP3 GND 
AOQOB /OEA /OEB /SUP /RFRQ /RWRQA /RWRQB /WEL /WEH LEB LEA VCC 
GLOBAL 


EQUATIONS ;PAL DEVICE FOR Z8000 PROCESSOR 
WEH / SUP *RWRQA* /RFRQ*CSA*/RWA*/BWA 

/ SUP *RWRQA* /RFRQ*CSA* /RWA*BWA*A00A 

/ SUP *RWRQB* /ikKFRQ* /RWRQA*CSB*/RWB*/BWB 

/ SUP *RWROB* /Rv'2.9* /RWRQA*CSB* /RWB*BWB*A00B 


+++ il 


/ SUP *RWRQA* /RFRQ*CSA*/RWA*/BWA 

/ SUP *RWRQA* /RFRQ*CSA*/RWA*BWA*/A00A 

/ SUP *RWROB* /RFRQ* /RWRQA*CSB* /RWB* /BWB 

/ SUP *RWROQB* /RFRQ* /RWRQA*CSB*/RWB*BWB*/A00B 


WEL 


+++ 


OBA = CSA*RWRQA*/RFRQ*RWA*DSA 
OEB = CSB*RWRQB*/RFROQ*RWB*DSB 
LEA = CSA*RWRQA*/RFRQ*RWA*TAP3 


LEB = CSB*RWRQB*/RFRQ*RWB*TAP3 


Figure 8. PLPL Specification for the Example of Figure 5 
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General-Purpose Dual-Port Arbiter 





TITLE INTERFACE TIMING PAL DEVICE #1 
PATTERN A3-251A 

REVISION 01 

AUTHOR J. ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 12/15/87 


CHIP TIMGEN4A PAL16L8 


/CSA /CYREQA /CSB /CYREQB TAP1 TAP2 TAP3 TAP4 TAP5 GND 
/FR /DTACKA /DTACKB TAPO FRH /RFRQ /RWRQA /RWRQB NC vcc 


EQUATIONS 


FR*/FRH + TAPO*RFRQ 


/FRH 


RFRQ FR*FRH*/RWRQA*/RWRQB*/TAP5 + RFRQ*/TAPS5 
RWRQA = CSA*CYREQA*/RFRQ*/RWRQB*/TAP5 + RWRQA*/TAPS 


RWRQB = CSB*CYREQB*/RFRQ*/RWRQA*/TAP5 + RWRQB*/TAPS5 





/RFRQ*/RWRQA*/RWRQB + TAP4 ;PAL device for 68000 
;processor 


/TAPO 


DTACKA = CSA*RWRQA*/RFRQ*TAP4 
DTACKA.TRST = CSA*RWRQA*/RFRQ*TAP4 


DTACKB = CSB*RWRQB*/RFRQ*/RWRQA*TAP4 
DTACKB.TRST = CSB*RWRQB*/RFRQ*/RWRQA*TAP4 


TITLE INTERFACE TIMING PAL DEVICE #2 

PATTERN A3-251B 

REVISION 01 

AUTHOR J. ENGINEER 

COMPANY ADVANCED MICRO DEVICES 

DATE 12/15/87 

CHIP TIMGEN4B PAL22V10 

/CSA RWA /UDSA /LDSA NC5 /LDSB /UDSB RWB /CSB NC TAP3 GND 
NC10 /OEA /OEB /SUP /RFRQ /RWRQA /RWRQB /WEL /WEH LEB LEA VCC 
GLOBAL 

EQUATIONS ;PAL DEVICE FOR 68000 PROCESSOR 


WEH = /SUP*RWRQA*/RFRQ*CSA*/RWA*UDSA 
+ /SUP*RWRQB*/RFRQ*/RWRQA*CSB*/RWB*UDSB 


WEL = /SUP*RWRQA*/RERQ*CSA*/RWA*LDSA 
+ /SUP*RWRQB*/RFRQ*/RWRQA*CSB*/RWB*LDSB 


OEA = CSA*RWRQA*/RFRQ*RWA* (UDSA + LDSA) 
OEB = CSB*RWRQB*/RFRQ*RWB* (UDSB + LDSB) 
LEA = CSA*RWRQA*/RFRQ*RWA*TAP3 


LEB = CSB*RWRQB*/RFRQ*RWB*TAP3 


Figure 9. PLPL Specification for the Example of Figure G 
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Dynamic Memory Control 


State Sequencer 





An example of a control path application for a PAL device is ina 
memory system. Most large memory systems use MOS 
dynamic RAMs. Their high density allows packing a large 
memory size irito a small board area. Dynamic RAM prices also 
make them very cost effective. 


Dynamic RAMs require external logic for address multiplexing, 
timing generation and refresh control. This application note 
shows the use of a PAL16R8A and an Am2964B to provide the 
necessary external logic for a typical dynamic memory system. 
The PAL device is used as a state sequencer for timing genera- 
tion and the Am2964B provides specialized control circuitry and 
reduces timing skew between control signals. This implementa- 
tion replaces about 20 SSI/MSI packages. 








ADDRESS BUS 


BS; 





AmPAL16R8A 
MREQ 








RFCK 


CONTROL BUS 


Design Requirements 


A system block diagram is shown in Figure 1. The control bus 
provides most of the inputs to the PAL state sequencer. These 
include: Memory Request (MREQ), READ/WRITE (RW), 
RESET (RST), Refresh Clock (RFCK), and Read-Modify-Write 
(RMW). Two upper address lines of the address bus serve as 
board selects (BS;, BSg), and one local signal, SLOW/FAST 
Memory (FAST), allows use of either slow or fast memory. A 
READ/WRITE sequence is initialized by MREQ ANDed with the 
proper board select conditions and a refresh sequence is in- 
itialized by RFCK. If both sequences are requested at the same 
time, a refresh sequence is performed. RW when HIGH selects 
a READ operation and when LOW selects a WRITE operation. 
RMW when HIGH selects a Read-Modify-Write cycle. 








RAS 






DYNAMIC 
MEMORY 
ARRAY 











CAS 












DATA BUS 
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Figure 1. Dynamic Memory Controller 
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Dynamic Memory Control State Sequencer 





The outputs of the PAL device provide the timing and control 
inputs to the Am2964B. These are: Row Address Strobe (RAS), 
Address Multiplexer Select (MS), Column Address Strobe 
(CAS), and Refresh (RFSH). In addition, the PAL device pro- 
vides the Write Enable (WE) to the Memory Array. Figure 1 
shows the timing for fast READ/WRITE cycles. The memory 
cycle is initiated by MREQ going LOW. The PAL device re- 
sponds by bringing RAS LOW at to, followed by MS going LOW 
at t;, and finally bringing CAS LOW at to. If RW is LOW, WE is 








25 MHz CLK 


also brought LOW at tz. WE is held LOW until tz. RAS, MS and 
CAS are brought HIGH at ts. The rising edge of any of these 
three signals may be used to latch output data during a Read 
operation. The state sequencer then disabled for three states to 
allow for memory precharge. 


By holding the FAST input LOW, an extended memory cycle is 
available to accommodate slower RAMs. The timing appears in 
Figure 3. 





WE en ees ; 
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Figure 2. Fast READ/WRITE Cycle 


to ty te ts ts ts 


t7 ta tg tio 11 t12 413 t14 
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Figure 3. Extended Memory Cycle 
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Dynamic Memory Control State Sequencer 





RAS-Only refresh cycle timing is shown in Figure 4. The refresh 
cycle is initiated when RFCK goes HIGH. The RFSH output 
goes LOW at to, followed by RAS at t;. The Am2964B supplied 
the necessary refresh address. RAS is brought back HIGH at ts 
and precharge is then timed out. An extended refresh cycle for 
slower memory is available also. Burst refresh can be accom- 
plished by leaving RFCK HIGH for as many refresh cycles as 
desired. 


to tt t2 t3 ta 


Read-Modify-Write cycle timing is activated by setting RMW 
HIGH. This is especially valuable in systems with Error Detec- 
tion/Correction (EDC) capability. Data can be read, modified by 
the EDC circuitry (Am2960), and if necessary, written back to 
memory in a single memory cycle. Read-Modify-Write cycle 
timing is shown in Figure 5. Note that WE goes LOW at the end 
of the cycle. 


te t7 tg tg tio 11 t12 
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Figure 4. RAS-Only Refresh Cycle 


to t t2 t3 ts ts te 


t7 3 tg tro th t12 13 t14 











ve My 
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Figure 5. Read-Modify-Write Cycle 
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Dynamic Memory Control State Sequencer 








Design Approach 


The first step in the state sequencer design process is to define 
the timing waveforms for all of the functions desired. Figures 2, 
3, 4, and 5 are the result. Next, characteristics of the resulting 
waveforms are examined. Initially, the sequencer is waiting on 
the MREQ or RFCK input. if MREQ goes LOW, the RAS to MS 
to CAS sequence is initiated. If RFCK goes HIGH, the RFSH to 
RAS sequence is initiated. Both sequences are equivalent to a 
simple “shift” function. Once the shift sequence is completed 
and the signals are asserted, they must stay asserted for a 
specific time depending on the selected function. To time the 








FAST 


FUNCTION 


eee SELECT/ 
MREQ ———P! CONTROL 4 






length that signals must stay asserted requires a “counting” 
function. The precharge sequence at the end of all cycles also 
requires “counting”. This partitions most of the design into two 
smaller functional blocks; a shifter and a counter. The remaining 
function select and control logic is partitioned into a “multiplexer 
like” functional block. Figure 6 shows the PAL device partitioned 
into functional blocks. By dividing the design into blocks, its 
implementation becomes simple. 


Figure 7 shows PAL device equations for the PAL16R8A dynamic 
memory state sequencer. 


cP 


SHIFTER 1 
S195 


COUNTER 1 
S161 
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Figure 6. Partitioned Design/PAL Device Equivalent 
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Dynamic Memory Control State Sequencer 





TITLE DYNAMIC MEMORY STATE SEQUENCER 
PATTERN AMP3-285 
REVISION 01 


NAME JOE ENGINEER 
COMPANY ADVANCED MICRO DEVICES 
DATE 11/25/87 


CHIP MEMORY1 PAL16R8 


CK RFCK /RST RW /MREQ RMW FAST BS1 BSO GND 
/E /Q0  /Q1 /Q2 /RFSH /WE /CAS MS /RAS VCC 


EQUATIONS 
Q0 /RST* /MS*/Q0 
/RST*RFSH*RAS* /QO 
/RST* /FAST* /Q0*Q2 
/RST*/PAST* /Q0*Q1 
/RST*/FAST* Q1*Q2 
/RST* FAST*/RMW*Q0*/Q1 
/RST* FAST* /RMW*Q0* /Q2 


++eetettgiu 


Ql /RST*RAS*/QO0* Ql 
/RST*RAS* QO0*/Q1 
/RST*RAS* QO* Ql 


/RST*RAS*/QO0* Q2 


+++ 


Q2 /RST*RAS* Q2 
/RST* Q0*Q2 


/RST*RAS*QO* Ql 


++ 4 


RFSH /RST*RFCK* /Q2*/Q1*/Q0*/RAS 
/RST*RFSH*RAS 
/RST*RFSH*/FAST* Q1 


/RST*RFSH* Q2 


++4 


WE /RST* /RW* /MS* /RFSH* /RMW*/Q0*/Q2 
/RST* /RW* /MS* /RFSH* /RMW*/Q1*/Q2 
/RST*/RW*/MS*/RFSH* RMW*/Q0* Ql *Q2 


/RST*/RW*/MS*/RFSH* RMW* QO*/Q1 *Q2 


+++ 


CAS /RST* /RFSH* /MS* /QO 
/RST* /RPSH* /MS*/Q1 


/RST* /RFSH* /MS* /Q2 


+++ Hi 


/MS /RST* /RFSH*RAS* /QO 
/RST* /RFSH*RAS*/Q1 


/RST* /RFSH*RAS* /Q2 


++ tt 


RAS /RST* /RFCK*/Q0*/Q1 */Q2*MREQ* /BS1*/BS0 
/RST* /RFSH*/Q0*/Q1*/Q2*MREQ* /BS1*/BSO 
/RST* RFSH*/Q0*/Q1*/Q2 

/RST*RAS*/QO 

/RST*RAS*/Q1 

/RST*RAS* /Q2 


++ ete 


Figure 7. Source Listing for Dynamic Memory Control State Sequencer 
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8-Bit Error Detection 


and Correction 





Single bit error detection and correction for an 8-bit data word 
requires 4 check bits, making a 12-bit code word. The simplest 
code to design is a 12-bit Hamming code. To arrive at the code, 
we set up the following matrix: 


B7 B6 BS B4 B3 B2 Bi BO | C3 C2 C1 CO 
S3 X X X X | X 
$2 X X X X | X 
$1 xX xX xX xX X | xX 
So X xX X X xX xX 


The vertical columns are in a counting pattern, excluding 
the single bit values of 8, 4, 2, 1. The single bit values are 
assigned to the check bits C3-CO. By reading horizontally 
across the rows of the matrix, we get the check equations 
by exclusive OR’ing the data bits with X’s in that row and 
equating that to the check bit with an X in that row: 


C3 = B7 ® BB ® BS @ B4 
C2 = B7 ® B3 ® B2 @ BI 
C1 = B6 ® BS ® B3 @® B2@ BO 
CO = B6 ® B4 ® B3 ® BI @ BO 


The check bits are stored along with the data bits in the 
following message format: 


M12 M11 M10 M9 M8 M7 M6 MS M4 M3 M2 M1 
B7 B6 BS B4 C3 B3 B2 BI C2 BO C1 CO 


When a read occurs, the check bits are recalculated and 
compared with the stored bits to generate the 4 bit syndrome: 


S3 = B7 ® B6 @ BS ® B4 @ CG 
S2 = B7 @ B @® B2 @ BI @ C2 
S1 = B6 @ BS @ B3 @ B2 @ BO @ Cl 
SO = B6 @ B4 @ B3 ® B1 @ BO @ CO 


The 4 syadrome bits indicate the location of any single bit errors 
in the 12-bit message format which may than be corrected by 
inversion. 


The Hamming code works by introducing enough other code 
words to create a difference of exactly 3 bits between legal code 
words. All other code words are illegal. If, in storage, one bit 
flips, the result is an illegal word. In addition, there is only one 
word in the set of legal code words from which it could have 
come, hence the correction. 


The Hamming matrix and resultant sets of check bit and 
syndrome equations are selected so that when a single bit error 
occurs, the syndrome gives the position of that bit (either data 
or check) in the 12-bit message format. 

Example B7 Bo 

random data word: 1 1 011410 0 

check bits: 


C3 = B7 @ B6 @ BS @ B4 
= 1@1@0@1=+1 
C2 = B7 @ B3 @ B2 @ Bi 
= 16160616 0 = 1 
Ci = B6 @ B5 @ BS @ B2 @ BD 
= 1@ 0 @1@6%1@600 8 1 
CO = B6 @ B @ B3 @ B1 @ BO 
=1@1616¢0@0=F1 
message: 


M12 M11M10 M9 M8 M7 M6 MS M4 M3 M2 Mi 
1 $1414 0 14 747 +41 1:0 1 0 7 1 
B7 B6 BS B4 C3 B3 B2 Bi C2 BO C1 CO 


EDAC System Block Diagram 


ADDR DATA 
BUS BUS 














CHECK BIT CONTROL 
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8-Bit Error Detection and Correction 





assume no error: 


S3 = B7 @© B6 @ BS @ B4 @ C3 
= 106016060006 16 1 = 0 
S2 = B7 ® B @ B2 @ B1 @ C2 
= 1461060616 06 1 0 
S1 = B6 @ B5 @ B3 @ B2 ® BO @ Cl 
= 1000606106106 06%1 = 0 
SO = B6 © B4 @ B3 @ BI @ BO @ CO 
=1060106061606 06 1 == 0 
syndrome 0000 indicates no error 
assume data bit B7 flips (1 — 0): 
$3 =-0060160 0616 1 == 1 
S2=-006010601606 1 = 1 
S1 = 1460606016016 06 1 = 0 
SsoO=- 1061010600806 1 = 0 
syndrome 1100 indicates M12 or B7 is in error 
assume check bit C3 flips (1 — 0): 
S3 = 1461460 @ ® QO = 1 
S2 = 1@ 1 © ® 061 = 0 
Si=1@6¢006016061 0606 1 = 0 
SsO= 1060106160060 46 1 = 0 


syndrome 1000 indicates M8 or C3 is in error 


While the check bits can be generated with a 256x4 PROM if 
the tolerances are loose, high performance systems will need 
to latch or register the check bits to meet cycle time require- 
ments. Similarly, the syndrome bits can be generated with a 
4096x4 PROM but in both cases the PAL16X4 is the high 
performance choice. 


The worst case equations are S1 and SO with a 6 term exclusive 
OR. We use 2 properties of the exclusive OR to fit the equations 
into the 16X4: 


1. Associativity 
A®B@C=(A@B)OC 
2. A®B @C = ABC + ABC + ABC + ABC 
Since a 3 term exclusive OR can be realized with a 4 product 
sum in sum of products form, a 6 term exclusive OR can be 


realized by exclusive OR of 2 4 product sums. This is exactly 
the 16X4 configuration. 


Check Bit Equations 
C3= B7 @ B6 @ B5@ B4 
= (B7 - B6 + B7 - BG) @ (BS - B4 + BS - BA) 
C2= B7 ® B3@ B2@ BI 
= (B7 - B3 + B7 - B3) @ (B2- Bi + B2- B1) 
C1 = B6 @ BS @ B3 @ B2 @ BO 
= (B6 -B5- B3+ B6- 85: B3 + B6- BS: B3 + B6: BS - B3) 
® (B2 + BO + B2 -BO) 
CO=B6 © B4 @ B3@ B1®@ BO 
= (B6 -B4-B3+ B6-B4-B3+ BG - B4- B3 + B6- B4- B3) 
@(B1 - BO + Bi - BO) 


Syndrome Bit Equations 


S3= B7 ® B6 ® BS @ B4@ C3 
= (B7 - B6- BS + B7- B6- BS + B7 : BG B5 + B7 « B& BS) 
® (B4 - C3 + B4 - C3) 
S2= B7 @ B3 @ B2@ BI ® C2 
= (B7 - B3- B2 + B7 - B3- B2 + B7- BS - B2 + B7 - BS - B2) 
@ (B1 - C2 + BT: C2) 
S1= B6® BS © B3 @ B2@ BOS C1 
= (B6 -B5- B3 + B6- B5- B38 + B6- BS - B3 + B6 - BS - B3) 
® (B2 - BO- C1 + B2- BO: C1 + B2: BO: Ci + B2- BO) 
SO = B6 @ B4 @ B38 @ B1@ BOSCO 
= (B6 -B4- B3 + B6- B4- B3 + B6- B4- B3 + BG - B4 - B3) 
® (B1- BO- CO + B1 - BO- CO+ BI: BO- C0+ B1 - BO + CO) 


The error correction block decodes the 4 syndrome bits, and, if 
they are not 0000, inverts the indicated bit in the message 
format. The equations: 


M12 (= B7C) = S3-S2-S1+ S0@B7 
=§3-S2-+S1-S0-B7+(S3°S2- 51-50) -B7 
=§3-S2-S1-S0-B7+ S3-B7+S2-B7 +S1 
- B7+S0°B7 

M11 (= BEC) = S3-S2-S1-S0-B6- S3-B6-S2-B6 +S1 
- B6+S0-B6 

M10 (= B5C) = S3-S2+S1+S0-B5+ S3-B5+S2-B5 +Si 
- B5+S0-B5 

M9 (= B4C) =S3°S2-S1-SO-B4+ S3-B4+S2-B4 +S1 
* B4+S0 - B4 

M8 (=C3C) =S3+S2-S1-50-C3+ S3°C3+S2-C3 +S 
-C3+S0-C3 

M7 (= B3C) = S3-S2-S1+S0-B3+ S3-B3+S2-B3 +S1 
- B3+S0 -B3 

M6 (= B2C) = 53-S2-S1-50 -B2+ S3-B2+S2-B2 +Si 
+ B2+S0+B2 

M5 (=B1C) = S3-S2-S1-S0- Bi+ S3-B1+S2-B1 +S1 
* B1+S0- Bi 

M4 (=C2C) = S3-S2-S51-S0 -C2+ S3-C2+S2-C2 +S1 
-C2+S0-C2 

M3 (= BOC) = 53-S2-S1-S0-BO+ S3-B0+S2-B0 +S1 
- B0+ 50 - BO 

M2 (=C1C) =53-S2-S1+S0-Ci+ S3-C1+S2-C1 +S) 
*C1+S0-C1 

M1 (=COC) =S3-52-S1-SO0 -CO+ S3°CO+S2-C0+S1 
-CO-S0-CO 

ERROR =S3-S2-S1-S0 

ERROR is an active high error indicator available for error 

logging along with the 4 syndrome bits. 


To use 2 PAL16L8’s for the error correction block, we need 
only invert the message bits to get active true outputs. 





2-230 


ra] Monolithic Rd Memorles al 


8-Bit Error Detection and Correction 





Description 


This PAL device generates 4 check bits in a 12 bit hamming code word to provide error detection 
and correction on an 8 bit data word. 


PAL16X4 





CHECK 
BITS 








1 mononthic KK memories GN 2-231 


8-Bit Error Detection and Correction 





Description 


This PAL device generates the syndrome bits for a 12 bit hamming code word as a function of the 
8 data bits and the 4 check bits to point to any single bit in error. 


PAL16X4 


SYNOROME 
BITS 


CHECK 
BITS 
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8-Bit Error Detection and Correction 





Description 


This PAL device performs error correction of bits B2-B7 based on the 4 bit error syndrome S0-S3. 





PAL16L8 
SYNDROME De 19} B7C 
BITS 


'18) B6C 

o——{ 17] B5C 6 MSB 
ERROR 
16} Bac { CORRECTED 


BITS 
15} B3C 





oan 


i 













3] NC 
112] NC 
1} B2 
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8-Bit Error Detection and Correction 





Description 


This PAL device performs error correction of bits BO-B1 and checks bits CO-C3 based on the 4 bit 
error syndrome SO-S3. 


PALI6L8 
20] vec 
SYNDROME 18} BIC | 2LSB ERROR 
BITS Fa] soc | CORRECTED BITs 


C2C | CORRECTED 
CHECK 

cic 

coc 

ERROR 

NC 


co 
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PAL16 X4 PAL DESIGN SPECIFICATION 
CBG B. BRAFMAN 02/16/81 
CHECK BIT GENERATOR 

MMI FIELD APPLICATIONS ENGINEER YORBA LINDA, CALIFORNIA 

CHKCLK B7 B6 NC NC NC NC B5 BA GND 

/OC B3 B2 /CO /Cl /C2 /C3 Bl BO VCC 





C3 := B7 * /B6 :B7 :+: B6 
+ /B7* B6 : tt: 
+ GND 3DO NOT BLOW THIS PRODUCT LINE 
+ GND :DO NOT BLOW THIS PRODUCT LINE 
s+: BS*/B4 3B5 :+: B4 
+ /B5S* B4 
C2 := B7*/B3 :B7 :+: B3 
+ /B7* B3 : tts 
+ GND 3;DO NOT BLOW THIS PRODUCT LINE 
+ GND 3;DO NOT BLOW THIS PRODUCT LINE 
s+; B2*/B1 7B2 :+: Bl 
+ /B2* Bl 
Cl := B6* B5* B3 :B6 3+: BS :+: B3 
+ /B6*/B5* B3 
+ /B6* B5*/B3 ; tt: 
+ B6*/B5*/B3 
s+: B2*/B0 3B2 :+: BO 
+ /B2* BO 
CO := B6* B4* B3 3B6 :+: B4 :+: B3 
+ /B6*/B4* B3 
+ /B6* B4*/B3 3 tt: 
+ B6*/B4*/B3 
s+: B1*/BO 7Bl :+: BO 
+ /B1* BO 
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a A a TE 


PAL16X4 PAL DESIGN SPECIFICATION 
SBG B. BRAFMAN 03/13/81 
SYNDROME BIT GENERATOR 

MMI FIELD APPLICATIONS ENGINEER YORBA LINDA, CALIFORNIA 

SYNCLK D7 D6 BO Bl B2 B3 D5 D4 GND 

/OC D3 D2 A3 A2 Al AO D1 DO VCC 


IN THE ABOVE PIN LIST, THE FOLLOWING SUBSTITUTIONS HAVE BEEN 
MADE TO ACCOMODATE THE SPECIFIC FORMAT (FIXED SYMBOLS) FOR THE 
ARITHEMETIC PAL DEVICES IN PALASM SOFTWARE : 


=e =e we 


? D7 MEANS B7 BO MEANS /C3 (CHECK BIT 3) 
? D6 MEANS B6 Bl MEANS /C2 (CHECK BIT 2) 
; DS MEANS B5 B2 MEANS /Cl (CHECK BIT 1) 
7 D4 MEANS B4 B3 MEANS /CO (CHECK BIT 0) 
; D3 MEANS B3 AO MEANS /S3. (SYNDROME BIT 3) 
; D2 MEANS B2 Al MEANS /S2 (SYNDROME BIT 2) 
? Dl MEANS Bl A2 MEANS /Sl1 (SYNDROME BIT 1) 
; DO MEANS BO A3 MEANS /SO (SYNDROME BIT 0) 


BO-B7 ARE THE BITS OF THE DATA WORD. 
; THE SUBSTITUTIONS APPLY BELOW WITH THE EXCEPTION OF COMMENTS. 


ve 


/A0. D7* D6* D5 3B7 3+: B6 s+: BS 
/D7*/D6* D5 
/D7* D6*/D5 
D7* /D6* /D5 
D4* (/B0) 3B4 s+: C3 
/D4* ( BO) 


ee 
+ 
oe 


t++++ 450 


eo 


/Al 


++ eee 0 


D7* D3* D2 3B7 :+: B3 s+: B2 
/D7*/D3* D2 
/D7* D3*/D2 ; t+: 
D7*/D3*/D2 

D1* (/81) 7Bl s+: C2 

/D1*( B1) 


/A2 3: D6* DS5* D3 7B6 :+: BS :+: B3 
/D6*/D5* D3 

/D6* D5*/D3 : t+: 
D6*/D5*/D3 

D2* DO*( B2) 7B2 :+: BO :+: Cl 
/D2*/D0* ( B2) 

/D2* DO* (/B2) 

D2*/DO0* (/B2) 


tee tees 


/A3 3 D6* D4* D3 7B6 :+: B4 s+: B3 
/D6*/D4* D3 
/D6* D4*/D3 ; +e 
D6*/D4*/D3 

D1* DO*( B3) 7Bl :+: BO s+: CO 
/D1*/D0* ( B3) 
/D1* DO* (/B3) 

D1*/D0* (/B3) 


++ +Hetet 
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TITLE ERROR CORRECTION UNIT NO. 1 
PATTERN ECU1 

REVISION 01 

AUTHOR  B. BRAFMAN 

COMPANY MONOLITHIC MEMORIES, INC. 
DATE 11/06/87 


CHIP ECU1 PAL16L8 


/S3 /S2 /S1 /SO B7 B6 B5 B4 B3 GND 
B2 NC NC B2C B3C B4C B5C B6C B7C VCC 





EQUATIONS 
/B7C = $3* S2*/S1*/SO* B7 ;CORRECTION OF B7 

+ /S3*/B7 ;NO CORRECTION 

+ /S2*/B7 ;NO CORRECTION 

+ S1*/B7 ;NO CORRECTION 

+ §0*/B7 ;NO CORRECTION 
/B6C = §3*/S2* S1* SO* B6 ;CORRECTION OF B6 

+ /S3*/B6 ;NO CORRECTION 

+ §2*/B6 ;NO CORRECTION 

+ /S1*/B6 ;NO CORRECTION 

+ /S0*/B6 ;NO CORRECTION 
/B5C = §3*/S2* S1*/SO* BS ;CORRECTION OF B5 

+ /S3*/B5 ;NO CORRECTION 

+ §2*/B5 ;NO CORRECTION 

+ /S1*/B5 ;NO CORRECTION 

+ §0*/B5 ;NO CORRECTION 
/B4C = §3%*/S2*/S1* SO* B4 ;CORRECTION OF B4 

+ /S$3*/B4 ;NO CORRECTION 

+ §2%*/B4 ;NO CORRECTION 

+ §1*/B4 ;NO CORRECTION 

+ /S0*/B4 ;NO CORRECTION 
/B3C = /S3* $2* S1* SO* B3 ;CORRECTION OF B3 

+ §3*/B3 ;NO CORRECTION 

+ /S$2*/B3 ;NO CORRECTION 

+ /S1*/B3 ;NO CORRECTION 

+ /S0*/B3 ;NO CORRECTION 
/B2C = /S3* S2* S$1*/SO* B2 ;CORRECTION OF B2 

+ §3*/B2 ;NO CORRECTION 

+ /S2*/B2 ;NO CORRECTION 

+ /S1*/B2 ;NO CORRECTION 

+ S0*/B2 ;NO CORRECTION 


; SIMULATION NOT INCLUDED 
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TITLE ERROR CORRECTION UNIT NO. 2 
PATTERN ECU2 

REVISION 01 

AUTHOR B. BRAFMAN 

COMPANY MONOLITHIC MEMORIES, INC. 
DATE 11/06/87 


CHIP ECU2 PAL16L8 


/S3 /S2 /Sl1  /S0 Bl BO /C3 /C2 /Cl GND 
/CO NC ERROR COC C1C C2C C3C BOC B1C VCC 


EQUATIONS 
/B1C = /S3* $2*/S1* SO* Bl ;CORRECTION OF Bl 

+ §3*/Bl :NO CORRECTION 

+ /S2*/B1 ;NO CORRECTION 

+ §1*/Bl :NO CORRECTION 
/BOC = /S3*/S2* S1* SO* BO :;CORRECTION OF BO 

+ §3*/BO ;NO CORRECTION 

+ §2*/BO ;NO CORRECTION 

+ /S1*/BO ;NO CORRECTION 

+ /S0*/BO ;NO CORRECTION 
/C3C = $3*/S2*/S1*/S0* C3 :CORRECTION OF C3 

+ /S3*/C3 ;NO CORRECTION 

+ §2%*/C3 ;NO CORRECTION 

+ §1*/C3 ;NO CORRECTION 

+ §0*/C3 ;NO CORRECTION 
/C2C = /S3* S2*/S1*/SO* C2 ;CORRECTION OF C2 

+ §3*/C2 ;NO CORRECTION 

+ /S2*/C2 ;NO CORRECTION 

+ $§1*/C2 :NO CORRECTION 

+ §0*/C2 ;NO CORRECTION 
/C1C = /S3*/S2* S1*/S0* Cl ;CORRECTION OF Cl 

+ §3%*/Cl ;NO CORRECTION 

+ §2*/Cl ;NO CORRECTION 

+ /S1*/C1 ;NO CORRECTION 

+ §0*/Cl :;NO CORRECTION 
/COC = /S3*/S2*/S1* SO* CO ;CORRECTION OF CO 

+ §3%*/CO :;NO CORRECTION 

+ §2%*/CO ;NO CORRECTION 

+ §1%*/CO ;NO CORRECTION 

+ /S0*/CO ;NO CORRECTION 
/ERROR = /S3*/S2*/S1*/S0 ;NO ERROR! 


; SIMULATION NOT INCLUDED 
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Introduction 


The growing sophistication of computer systems has dictated a 
corresponding increase in memory storage capacity. Unfortu- 
nately, the speed of these large memories has not kept pace with 
that of their associated processors. Fast memories are still 
expensive; in order to reduce this cost, a cache memory system 
can be used to provide information to the processor without sac- 
rificing speed. Located between the central processing unit and 
the main memory a cache memory stores only the blocks of 
memory currently in use by aprogram. This provides fast access 
to program and data resulting in higher system performance. 
Once anew memory location is desired, blocks of cache and main 
memory are swapped to update the cache with the memory 
blocks currently inuse. Thus a cache memory can dramatically 
increase the performance of the entire computer system while 
maintaining minimal costs. Acache system with a 35 ns access 
time and a 50 ns cycle time is described below. 


The use of a fuse-programmable controller (FPC), Am29PL141, 
simplifies the design of such acache memory system. The simple 
internal architecture and instruction set of the FPC make this 
controller easy to implement in systems, and an on-chip micro- 
programmable fuse array provides a high level of integration. The 
cache controller algorithm is programmed and stored inthis array. 
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TAG BUFFER 
DATA 


be ne 









CONTROL 


MAIN MEMORY 


DATA ADDRESS CONTROL 


ADDRESS BUS 





Because the FPC operates at 20 MHz, as opposed to the 10 MHz 
clock frequency of other currently available controllers, it is 
especially well suited for high-performance applications. 


Cache Memory Systems 


Almost any system that requires a high-speed memory interface 
can benefit from the use of acache-memory system. The system 
bottleneck is typically the memory access time. Large dynamic 
RAMs that are currently available have access times of 100-180 
ns with cycle times of about 180-400 ns. The smaller, faster 
memory devices that can be used for a cache have access times 
of 25-30 ns and cycle times of around 40-50 ns. In order to 
achieve these lower access times and still have the memory 
capacity of the larger DRAM memories, a cache memory hierar- 
chy is used. 


Any application can make effective use of caches provided that 
the concept of “locality of reference” is operative. This concept 
states that, at any particular time, the addressing pattern of a 
program tends to be localized within a block or area of the total 
available address space. In other words, if data at a given 
address is accessed, it is likely that the next memory access will 
also be in that same block of addresses. In general, for systems 
where this concept does not apply, the use of a cache system 
would be difficult. 





DRAM 












CPU 
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DATA BUS DATA 
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Figuro 1. Block Diagram of Cacho Memory System 
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Sections of main memory data that can be identified as frequently 
used canbe storedincache memories to allow decreased access 
times, thereby increasing the overall performance of the system. 
As different sections of a program become frequently used, they 
can be transferred to the cache memory, displacing other sec- 
tions that are no longer needed. Some of the application areas 
where caches are used to increase the system performance are 
for mini/micro-computers, signal processing and image process- 
ing systems. 


The main components of the cache memory system are the 
cache data memory, the cache tag buffer, the replacement logic, 
and tne cache controller (see Figure 1). 


Cache Data Memory 


The cache data memory is the small, fast memory that contains 
the most often used data, copied from the slower main memory. 
The cache data memory size directly affects the performance of 
the entire system. The larger the cache data memory, the more 
data that can be accessed at the faster speeds. However, if the 
size of the memory device gets very large, the performance of the 
memory decreases. The larger the memory devices are, the 
longer are their access times, and, consequently, their cycle 
times. The optimum data cache size will vary depending on the 
application/algorithm being executed. 


Tag Buffer 


An identifier (or tag) accompanies every block (a contiguous set 
of data words) that can be transferred to or from main memory. 
This tag is stored in the tag buffer. This tag identifies where the 
data is located in main memory. To determine whether a block of 
data is already in the cache memory, the tags stored in the tag 
buffer are compared to the desired tag (Figure 2). Various 
methods exist for searching the tag buffer for a block of data. 
Three of the most popular approaches are: 


1. Fully associative 
2. Direct mapped 
3. Set-associative 


Infully associative caches, any memory word can be found in any 
of the cache-datablocks. The memory address is divided into two 
sections: the TAG and the BLOCK fields. The TAG is used to 
determine if the block of data is in the cache and the BLOCK 
indicates a data word within that block. 


For adirect-mapped cache the memory word can be found in only 
one of the cache blocks. The memory address is sectioned into 
three fields: the TAG, the INDEX, and the BLOCK. The BLOCK 
indicates the data word in the block. The INDEX selects the 
program block that contains the word, and the TAG is used to 
determine if the block is in the cache. 


The third way of searching the cache for data words is the set- 
associative method. This divides blocks into sets and allows the 
cache block to be located in any one of these sets. Typically, the 
number of sets used are two, four, or eight. The TAG, INDEX and 
the BLOCK fields are similar to those in the direct mapped 
techniques. Figure 2 shows the three different address tech- 
niques and their fields. 


When a tag search is performed in the tag buffer, either a hit or 
miss condition results. A hit indicates that the desired word is in 
the cache; a miss indicates that the desired word must be ac- 
cessed from the main memory or a transfer from main memory to 
cache is needed. The performance of the cache system is 
dependent on the ratio of the number of hits to the total number 
of memory references made, which is called the hit ratio. The hit 
ratio depends on the size of the cache memory selected and the 
application of the system using the cache memory. Ahigh hit ratio 
implies that most of the data accessed by the CPU was in the fast 
data-cache memory when requested. The performance of the 
entire system gets degraded if a poor hit ratio occurs. This is why 
“locality of reference” is important for performance. 


Replacement Algorithms 


The replacement algorithm is used to decide the cache location 
to replace with new data when a miss occurs. Three of the most 
common data replacement algorithms currently in use are: 


1. First-in First-out (FIFO) 
2. Least Recently Used (LRU) 
3. Random Replacement (RR). 


The choice between these three techniques is dependent upon 
the nature of the data being transferred to and from the cache. 
The FIFO algorithm simply selects the oldest block of data written 
into the cache to be replaced. After the cache buffer is full, data 
blocks are replaced sequentially. In this manner, even a data 
block which is used very often, will be replaced. 


The LRU technique selects the data block to be replaced by 
determining which block in the cache was least used as com- 
pared to the other blocks at that time. With this strategy, the 
chances that the block just replaced in the cache will be required 
again soon is assumed to be relatively small. 


FULLY ASSOCIATIVE CACHE 
MEMORY SYSTEM 


TAG BLOCK 


TAG IDENTI- DATA WORD 
FIES IF BLOCK = IN CACHE 
IS IN CACHE BLOCK 


DIRECT MAPPED MEMORY 


TAG INDEX 


TAG IDENTI- INDEX FOR 
FIES IF BLOCK CACHE 
IS IN CACHE 


BLOCK 


DATA WORD 
IN CACHE 
BLOCK 


SET-ASSOCIATIVE 
CACHE MEMORY 


TAG INDEX BLOCK 
TAG IDENTI- INDEX FOR DATA WORD 
FIES IF BLOCK CACHE IN CACHE 
IS IN CACHE BLOCK eee 


Figure 2. Cache Organizations 
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The third method, the random replacement approach, randomly 
chooses the block of data to be replaced. The easiest to 
implement, this technique is optimum for some applications. 


Another important design consideration for cache memory sys- 
tems is the method used for keeping the cache data and main 
memory updated. When writing to the cache memcry, the data 
change must be echoed to the main memory before that data 
word is replaced by new data in the cache block. One way this is 
handled is the “store-through” method. On awrite command, the 
data is written to the cache as well as the main memory simulta- 
neously. In this way, the main memory data is kept current. This 
also saves the task of writing back old cache data into main 
memory when replacing a new data block into cache. 


. 
‘MICRO ADDRESS 
1 CONTROL LOGIC 










TEST : 
INPUTS ; 
T(5:0} [D i 
BRANCH CONTROL 
LOGIC (GOTO) 7 
EQ DETECT 
we 
cc [> 6 


CONDITION CODE 
SELECT LOGIC 


MICROINSTRUCTION 
DECODE LOGIC 





ro ee are aoe ocean aed aa eee 3 
2 6 MICROPROGRAM 
MEMORY 
V/ Y Y 


Cache Controller 


All of the signals that are required to keep track of the various 
functional blocks involved in the cache memory system are 
controlled by the cache controller. This controller monitors the 
state of the tag buffer search, the CPU, main memory and cache 
data memory condition, as well as handling all memory transfers 
for hits or misses and read and write operations. The high- 
performance and integrated functions available on a single chip 
make the design of a cache controller using the Am29PL141 
simple. The FPC provides 16 control signals for off-chip control 
and seven input test pins for monitoring external conditions. The 
flexible instruction set permits easy microprogramming of the 
desired function. 


DECREMENTER 
(COUNTER =1) 












COUNT REGISTER 
(CREG) 





SUBROUTINE REGISTER 
(SREG) 










*NOTE: THESE PINS AVAILABLE ONLY IN SSR MODE. 
“NOTE: THESE PINS AVAILABLE ONLY IN NORMAL MODE. 
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Figure 3. FPC Detailed Functional Diagram 
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The Am29PL141 Fuse-Programmable Controller 


The Am29PL141 is a_ high-performance, single-chip, fuse- 
programmable controller (FPC). This chip is designed to allow 
implementation of complex state machines and controllers by 
programming the appropriate sequence of microinstructions in 
the on-chip microprogram memory. With its intelligent micropro- 
gram address sequencer, high-speed 64 x 32 bit microprogram 
memory, pipeline register, and an on-chip diagnostics register, 
this chip offers the benefits of low chip count, fast operation, easy 
development, and testability. This device is available with a 20 
MHz clock rate (50 ns cycle time) in a 28 pin dual-in-line package. 
Amicroprogram address sequencer is the heart of the FPC. The 


Am29PL141 has 29 high-level microinstructions which include 
jumps, loops, subroutine calls, and multiway branching. These 
microinstructions can be conditionally executed based onthe test 
inputs. As a single chip solution, the FPC is designed to ease the 
implementation of distributed microprogrammed systems. It can 
be used to off-load the central controller by serving as an 
intelligent distributed controller for various functional units. Its on- 
chip diagnostics register is used to control and observe the 
parallel pipeline register during diagnostics mode. This capability 
provides both controllability and observability for the functional 
blocks connected to the FPC control outputs. Ablock diagram of 
the FPC is shown in Figure 3 and the micro-instruction format is 
shown in Figure 4. 


31 30 26 25 24 22 21 16 
[ OE | [ orcone | i POL | [ TEST ] [ DATA | 


WHERE: 


OE = SYNCHRONOUS OUTPUT ENABLE FOR P[15:8] 
OPCODE = A FIVE-BIT OPCODE FIELD FOR SELECTING 
ONE OF THE TWENTY-EIGHT SINGLE DATA 
FIELD MICROINSTRUCTIONS 


POL = A ONE-BIT TEST CONDITION POLARITY SELECT 


0 = TEST FOR TRUE (HIGH) CONDITION 
1 = TEST FOR FALSE (LOW) CONDITION 


TEST = A THREE-BIT TEST CONDITION SELECT 


TEST [2:0] 


000 
001 
010 
011 
100 
101 
110 
111 


UNDER TEST 


TO] 
Tt] 
Tr) 
TB) 
Tf] 
T{S] 
cc 
EQ 


DATA = A SIX-BIT CONDITIONAL BRANCH MICROADDRESS, TEST 
INPUT MASK, OR COUNTER VALUE FIELD DESIGNATED 
AS PL IN MICROINSTRUCTION MNEMONICS 


THE SPECIAL TWO DATA FIELD MICROINSTRUCTION FORMAT IS SHOWN BELOW: 


31 30 28 27 22 21 16 
[ OE | [ orcove | [ CONSTANT | [ DATA | 


WHERE: 


OE = SYNCHRONOUS OUTPUT ENABLE FOR P[15:8} 
OPCODE = COMPARE MICROINSTRUCTION (BINARY 100) 
CONSTANT = A SIX-BIT CONSTANT FOR EQUAL TO 
COMPARISON WITH T°M 


DATA = A SIX-BIT MASK FIELD FOR MASKING THE 
INCOMING 7[5:0} INPUTS 


Figure 4, Microinstruction Format 442 04 
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System Overview 


The cache scheme selected for implementation is a set-associa- 
tive cache with four sets (see Figure 5). The 18-bit physical 
memory address word is divided into 8 bits of tag, 8 bits of index, 
and 2 bits of block address. Four 256-byte deep, index-ad- 


BLOCK ADDRESS 
INDEX ADDRESS 





TAG ADDRESS 


TAG INDEX BLOCK 
ADDRESS } ADDRESS }| ADDRESS 


dressed tag buffers are used in this scheme. These tag buffers 
contain four possible sets of tags for the same index location. 
Corresponding to each tag buffer there are four 16-bit wide cache 
memories, which contain four (block) words for each index 
location. The physical block address is also used to address 
these cache memories. 


Figure 6 shows the actual implementation where the main system 
controller is the Am29PL141 FPC. There is additional glue logic 
in the form of one PAL18P8 and one PALI6R6. The PAL18P8 is 
used in the cache access path and is very fast. It generates the 
cache output enable, cache selects, and write signals for the 
cache. The PAL16R6 is used to generate the Ready signal to the 
system CPU. It is also used to inform the FPC of the status of 
read/write cycles. All the PAL devices are controlled by the FPC 
for generation of the system control signals. 


System Operation 


There are four main system cycles possible: read and hit, read 
and miss, write and hit and write and miss. The response of the 
controller is different for each system operation cycle. For read 
& hit cycle the FPC performs no action. Only on detection of write 
cycle, orread & miss cycle, the PAL16R6 asserts the CYCL signal 
initiating FPC action. : 


Read and Hit Cycle 


When the system CPU requests a read cycle, it sends out an 18- 





442 05 
TAGADDO-7 IDXADDO-7  BLOCKO~1 bit address. The most significant eight bits (Figure 6) 
TAGADDO-7 are the eight bit tag which is compared to the tags 
Figure 5. A Set-Associative Cache Scheme stored in the four tag buffers to detect a hit or a miss. The next 
eight bits IDXADDO-7 for index, address the correct tag buffer 
RDY TO CPU 
DRDY 
DOEL 
DREQ 
Am29PL141 CTRST 
DRW 
BLDBL 
BADO 
BAD1 
CLKI (40 MHz) qh CACWRT 
TAGWR 0-3 
H/M 0-5 
MREQ 
RHWL 
4 
CACWR 0-3 
IDXADD 0-7 CACS 0-3 
4 
CACWRT 
442 06 
Figure 6. Control Section for Cache System uses one FPC and two PAL Devices. 
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locations. These eight bits are also used to access the four cache 
sets simultaneously. The least significant BLOCKO—1 block 
address are directly used to address the four cache sets. A 
cache CACS (chip select) is generated by the PAL18P8 to access 
the four cache sets simultaneously. Once the tag buffer match 
occurs, the selected tag buffers asserts the hit signal. The hit 
signal is used to enable the outputs of the corresponding cache 
set, in order to provide the correct cache data to the CPU. The 
flow chart is shown in Figure 9. 


Read and Miss Cycle 


if the proper tag does not appear in any of the four tag buffers, a 
miss signal is generated by the tag buffers on H/MO-3 signallines 
(Figure 7 and 8). On detection of the four miss signals the 
PAL16R6 removes the “RDY” signalto CPU (for CPU wait cycles) 
and informs the FPC of the MISS by asserting the CYCL & HIT/ 
MISS signals. When the FPC detects this read and miss, it 
latches the current physical address by asserting the ADLATCH 
signal, and initiates a DRAM (System Memory) cycle by asserting 
the DREQ signal, to provide the data to the CPU from the system 
memory. When this data is available signalled by the DRAM data 
ready signal DRDY, the FPC signals the assertion of the RDY 
signal, allowing the CPU to read the requested data. Simultane- 
ously the FPC asserts the INHIBIT signal which disallows further 
CPU access to the cache. 


The FPC then performs an update algorithm which brings the 
data from the system memory to the cache. It asserts the CTRST 
signal which disables the CPU data lines to the cache system. It 
also disables CPU block addresses by asserting BLDBL signal 
and enables its own output address lines BADO—1 in order to 


address different memory words within a block itself. It then 
performs read cycles on the system memory by using DREQ 
(DRAM memory request), DRW (read or write) and DRDY 
(DRAM data ready) signals and write cycles on all the blocks of 
the corresponding cache set by asserting CACWRT signal. The 
replacement set is determined by the LRU control block which 
selects one of the four sets. 


Upon completion of the update of new cache data, the FPC 
removes the INHIBIT signal to allow further CPU access to the 
cache. 


Write and Hit Cycle 


The write cycle operation is differentfrom the read cycle, since the 
main memory has to be updated along with the cache. This is 
called the “write-through” strategy. Since typical programs re- 
quire only 10-15% of the processor time for write cycles, this does 
not impact the overall performance. For every write cycle for 
which the tag buffer signals a hit, the CPU writes the data to the 
cache. In addition, the FPC initiates a DRAM (system memory) 
cycle and updates it with the same data (see Figure 10). 


Write and Miss Cycle 


For awrite and miss cycle, the function of the system is similar to 
the read and miss cycle, except that a write of the main memory 
is performed instead of aread. After the write to the main memory 
is completed, the FPC executes the update algorithm to bring the 
contents of that memory block into the cache. The flow chart for 
the microcode executed by the FPC for the write cycle is shown 
in Figure 10. 
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Figure 7. Memory Organization of Cache System 
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Figure 9. Read Contro! Flow Chart 
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Figure 10. Write Control Flow Chart 








al Monolithic AR Memorles Pm | 2-247 


Fuse Programmable Controller Simplifies Cache Design 





FPC Operation 


The System Controller has five main functions: 


. Test for a cycle request (CYCL) 

. Test for different cycle types 

. Execute appropriate action for each cycle 
. Execute update algorithm if required 

. Generate update addresses if required. 


aAhOaND — 


Test for Cycle Request 


The FPC samples the CYCL signal continuously. The CYCL 
signal is generated only for read & miss, write and hit, and write 
and miss cycles. The system requires FPC intervention only for 
these three cycles. When CYCL is asserted, the FPC tests forthe 
three cycle types. 


Test for Cycle Type 


This involves sampling the CPU, memory request MREQ, and 
read or write (RHWL) signals. On the basis of these signals the 
FPC decides between the initiation of read or write cycles. It then 
samples the HIT/MISS signal generated by the PAL16R6 to 
decide on the appropriate action to be taken. 


Action for Each Cycle 


Once the FPC determines the CPU cycle type and whether the 
location exists in the cache (HIT or MISS), it then performs 
appropriate actions such as dynamic RAM (System Memory) 
cycle, and Update algorithm. The functions performed by the 
FPC for each cycle type were detailed earlier. 


Update Algorithm 


On detection of a CPU cycle and a MISS, the FPC performs the 
update algorithm. It consists of reading data from the system 
memory (DRAM) and writing that data to the appropriate cache 
set. The address for this replacement is the one on which the CPU 
cycle MISS occurs, which in turn is latched by the FPC on 
detection of the MISS. The selection of the appropriate cache set 
is dependent upon the information provided by the replacement 
logic block. The update is done for all four blocks (words) of data. 
During this update, the FPC isolates the CPU data bus from the 
cache data bus by asserting the CTRST signal. 


Update Address Generation 


The replacement address is the CPU address at which the MISS 
occurs. However, all the four blocks for the same memory 
address should be replaced in the cache. To generate these four 
addresses, two of the FPC's eight three-state lines are used. The 
two least significant bits of the CPU’s physical address (Block 
Address bits) are disabled by the FPC by asserting the BLDBL 
signal, and the two output bits of the FPC (three-state control 
lines) are enabled onto the address bus. This is done ina manner 
to allow the system CPU to continue to work with the address and 
data bus, isolated from the cache system, during the execution of 
the replacement algorithm. 


The replacement algorithm then executes read cycles on the 
memory and write cycles on the cache for all the four block 
addresses. After the replacement is complete, the control returns 
for sampling of new CPU cycles. The design file for the Cache 
controller is shown in Figure 12. 


The replacement set selection can depend upon various selec- 
tion algorithms as described before. This set selection logic has 
not been shown in this design. However, the algorithm used here 
is LRU (Least Recently Used) method, and only its interface with 
the cache controller is shown. 


Timing and Performance 


For acache read cycle when a hit occurs, the worst case access 
time for the cache system is the tag buffer delay (20 ns) along with 
the cache output enable delay (15 ns). The worst case access 
time for this cache implementation is 35 ns. The cycle time for the 
system will be a little longer than the access time. 


The 50 ns cycle time is a very high speed cache performance, 
considering that the usual cycle time for DRAMs is in the range of 
280-400 ns. Furtherimprovements inthe cycle time can be made 
by using PAL devices with a faster propagation delay of 10 ns. 


CALL REPLACEMENT 
SUBROUTINE 


SET UP BLOCK 
ADDRESS FOR 
DATA TRANSFER 


START DRAM 
READ CYCLE 


CACHE WRITE 
DATA 


REPEAT FOR THREE 
MORE DATA WORD 
TRANSFERS WITH 
CORRESPONDING 
BLOCK ADDRESSES 





READY FOR NEXT 
CPU MEMORY REQUEST wads 


Figure 11. Flow Chart for Cache Update Scheme 
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device (am29p1141) 


default = 0 
define 
mMreq = 
rhwl = 
hitm = 
drdy = 
nocdn = 
intword 
begin 
init : od 
start : od 
od 
read : od 
latch : od 
od 
od 
od 
od 
write od 
wrtht : od 
od 
od 
od 
wrtms : od 
od 
od 
od 
od 
rplce : od 
oe 
oe 
oe 
waitl : od 
oe 
oe 
oe 
wait2 : od 
oe 
oe 
oe 
wait3 : od 
oe 
oe 
oe 
od 
end. 


. 
¢ 


to 
tl 
t2 
t3 
t4 
= 0O7d#h; 


intword, 


intword, 


intword, 


intword, 


079#h, 


069#h, 
063¢h, 
063#h, 


intword, 


intword, 


Ofdth, 


O04d#th, 
O4f#h, 
O4f#h, 


intword, 


049¢h, 
O4b#h, 
O4b#h, 


intword, 


O7b#h, 


022#h, 
022th, 
Oa2#th, 
l3at#h, 


122#h, 
122#h, 
la2th, 
23afth, 


“222#h, 


222¢h, 
2a2¢#h, 
33a¢#h, 


322¢h, 
322#h, 
3a2¢th, 
O3at¢th, 





“No condition" 
“Initialize output word" 


continue ; "Initialize Outputs" 
if (mreq) then goto pl(start); 

"Check for memory cycle" 
(not rhwl) then goto pl(write); 

"Check for read or write" 


if 


if (hitm) then goto pl(start); 


"Read & hit goto start” 
if (not drdy) then goto pl(latch); 

"Read & miss latch address" 
continue; "DRAM req for CPU data" 
continue; “DRAM output enb & CPU cycle" 
if (nocdn) then call pl(rplce); 

"Goto replacement subroutine" 
if (nocdn) then goto pl(start); 
“Read cycle complete" 


if (not hitm) then goto pl(wrtms); 
"Write hit or miss" 


if (not drdy) then goto pl(wrtht); 

"Hit write to cache" 
continue; "Write to DRAM" 
continue; 
if (nocdn) then goto pl(start); 

“Write hit complete" 


if (not drdy) then goto pl(wrtms); 

"Miss check DRAM for write" 
continue; 
continue; "Latch address & DRAM write" 
if (nocdn) then call pl(rplce); 

"goto replacement algorithm" 
if (nocdn) then goto pl(start); 

“Write miss complete" 


if (not drdy) then goto pl(rplce); 

"This is replace routine” 
“Isolate CPU databus" 

“DRAM data enb for cache write" 
“Write to cache at address 00" 
goto pl(waitl); 

“Change Block address” 


continue; 
continue; 
continue; 
if (not drdy) then 


continue; “Repeat as before address 01" 
continue; 
continue; "Write to cache at address 01" 


goto pl(wait2); 
“Change Block address" 
“Repeat as before address 02" 


if (not drdy) then 


continue; 
continue; 
continue; 
if (not drdy) then 


"Write to cache at address 02" 
goto pl(wait3); 
"Change Block address" 


continue; “Repeat as before address 03" 
continue; 
continue; "Write to cache at address 03" 


if (nocdn) then ret;"Return from replace routine" 


Figure 12. Cache Controller Design File 
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PAL22RX8A Provides Control and 
Addressing for a 32-Location-Deep 


RAM-Based LIFO 


AN-164 








Two components used in the temporary storage of data are 
LIFO (Last In, First Out), and FIFO (First In, First Out) buffers. 
Usually, a FIFO would be used to store data between asyn- 
chronously operating devices. For example, data sent from 
one microprocessor system to a second might reside tempo- 
rarily in a FIFO until the second microprocessor is ready to 
read the data. Sometimes the FIFO is known as an asynchron- 
ous temporary buffer store. 


The LIFO also offers temporary storage and will store data in 
the same way as a FIFO but the sequence of reading the data 
will be reversed. LIFO applications in digital processing sys- 
tems would usually be to store return addresses and essential 
registers when the processor is executing a branch to subrou- 
tine or interrupt service routine. The processor will retrieve this 
information when it executes return instructions. The LIFO is 
an essential component in the processor's ability to perform 
context switching. 


The LIFO is sometimes called a stack, and the process of writ- 
ing to the stack is called a PUSH operation and reading data is 
a POP operation. In the central processor dedicated instruc- 
tions for PUSH and POP operations are decoded and executed 
to save and restore return addresses, contents of accumula- 
tors, status registers, and index registers. Other LIFO applica- 
tions could be found in data buffering where the order of data 
reception should be reversed. 


The concept of the stack is used in performing Reverse Polish 
Notation (RPN) mathematics, and it is claimed by some to be 
superior in performance to the conventional algebraic process- 
ing technique. With RPN, instructions and operands are 
pushed onto the stack, and evaluated as they are popped off; 
the result is usually left in the general purpose register. 


When a memory is used as a stack, the evaluation process 
may be repeated because the initial evaluation did not destroy 
the RAM contents. This non-destructive reading of data is a 
feature of RAM-based FIFO and LIFO architectures, and is not 
the case for register-based architectures. FIFOs and LIFOs 
made from register arrays rely on data shuffling through the 
array. Data in the registers are overwritten during each cycle. 
The RAM control for a LIFO essentially consists of an Up/Down 
counter to address locations in the memory, and a control 
sequencer to drive the memory's chip select (CS) and write 
(WR) inputs. 





The new PAL22RX8A has been programmed to perform a 
control function for a small 32-location-deep stack. A two-chip 
solution is achieved by using an Am 9128 static RAM and a 
PAL22RX8A (see Figure 1), for bytewide applications. 
Additional RAM devices may be added in parallel for 16- and 
32-bit wide applications. A deeper LIFO may be configured by 
cascading a second PAL22RX8A to create a higher-order Up/ 
Down counter. 


PAL22RX8A 
CSX 


Am9128 
2Kx8 DATA 


vo 


RAM OPERATION 


[x_[_ 4 | tree-srare | 
He | baraour 


Figure 1. 32-Location-Deep LIFO Controller Uses PAL22RX8A 





Figure 2 shows the block diagram of the PAL22RX8A device. 
Fourteen dedicated inputs drive the single fuse array through 
true/complement buffers. One of those inputs includes the 
clock, so logic operations may be performed on the clock or its 
complement. 
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Figure 2. Block Diagram 


Note: 
PLCC pin numbers are indicated in parentheses. 
PLCC pins 1, 8, 15 and 22 are not connected. 


There exist eight macrocells in the device. The schematic view 
of the macrocell architecture is shown in Figure 3. The D-type 
register is the storage element in the device and is clocked 
from a single clock input. The D input is fed from an exclusive- 
OR (XOR) gate, the two inputs of which are driven from one 
product term and a sum of eight product terms. This XOR gate 
can be used to control the polarity at the output pin; the single 
product term input can be used to create polarity inversion. 


OE 


XOR 





Figure 3. PAL22RX8A Macrocell 


With regard to registered output configurations, the XOR gate 
may be used to configure alternative registered functions. It 
can be shown that a J-K flip-flop may be derived by using feed- 
back, the sum-of-product inputs, and the XOR input, as shown 
in the equations below. Three out of the five equations use the 
XOR gate to create the J-K function. 


1.Q:=QU+Q°K 

OQ := QS +Q*K 

.Q:= Q 24: (Q*U + Q*K) 

Q:= Q 24: (Q*U + Q*K) 

Q:=Q:4: (QS + Q*k) 

Also, S-R and other subset functions may be configured in the 
macrocell. 


There exist two multiplexers in the macrocell which select 
either a registered or combinatorial output path. If the control 
fuse is intact, the output is registered, and the feedback into 
the fuse array is through a buried feedback path from the Q 
node. A combinatorial route may be selected by programming 
the fuse; the register is flushed and feedback to the array is 
from the output pin. Programmable !/O may be achieved in this 
configuration by controlling the three-state condition of the in- 
verting output buffer. A single product term can enable the out- 
put buffer with a logic High or disable it with a logic Low. When 
in a three-state condition the output pin may be used as an in- 
put. 


When using the flip-flop in the macrocell it is possible to use 
the asynchronous Reset or Preset feature. A single input pin 
may be dedicated to a Reset or Preset function, and driven ac- 
tive to initialize the register in the device. 


af wo LP 


Toggle Function as a J-K Subset 


A frequently used subset of the J-K function used in binary 
count applications is the Toggle (T) function. If J and K are tied 
together and driven with a logic High then the flip-flop output 
will Toggle when clocked; if driven Low the output will Hold. For 
a given flip-flop (N) in the count sequence the toggle equation 
will be: 

6. QN := QN :+: QN-1*QN-2*...Q1*Q0. 

This will cause the flip-flop to Hold until the product term of all 
the lesser significant registers goes to a logic High; then, the 


flip-flop will toggle polarity. This equation is suitable for an Up 
count. A Down count is given by: 


7. QN := QN :+: QN-1*QN-2"..Q1°Q0. 
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Both equations can be combined with an Up/Down control to sequence will be "Increment then Write" for a PUSH operation 
effect an Up/Down counter. With regard to the LIFO applica- and "Read then Decrement" for a POP operation. The timing 
tion, the Up/Down control for the address input to the CMOS waveforms (Figure 4) show the sequence for PUSH and POP 
RAM will be decoded from the PUSH or POP instruction. The cycles. Note that the LIFO starts from memory address zero. 


. AyteA Leh Leh oy ae 


push = P| WA) 
WA Sa 








cs 
— 
ADDR n n+1 
——————— i SSS 
Active Write Cycle - PUSH = 
1. PUSH input goes active High at tsu or greater 3a. CS goes active Low. 
before the clock edge. 4. Third clock edge; PUSH still active High. 

2. First clock edge samples the active PUSH input. 4a. CS goes inactive High. 
2a. WR goes active Low. 5. Fourth clock edge; PUSH still active High. 
2b. Address increments. . 5a. WR goes inactive High. 
3. Second clock edge; PUSH still active High. 6. PUSH instruction is removed. 


cuK 





ADDR n n-1 
—_——_—_—_—_——— 

Active Read Cycle - POP 
1. POP instruction goes active High at tsu or 4. Third clock edge; POP still active High. 

greater before the clock edge. 4a. WR goes active Low; the memory is deselected 
2. First clock edge samples the active POP input. so no Write will take place. 
2a. CS goes active Low to select the RAM. 4b. Address lines decrement. 
3. Second clock edge; POP still active High. 5. POP instruction is removed. 
3a. CS goes inactive High. 5a. WR goes inactive High. 


Figure 4. LIFO Conirol Timing Diagrams 





2-252 ra Monolithic Kasi Memories a\ 


PAL22RX8A 





The control outputs to the RAM are CSX and WR. An additional 
register (XO) is required to enable all states to be reached; two 
registers alone are not enough. From the timing waveforms it 
can be seen that an active write cycle is initiated after an ac- 
tive PUSH input has been sampled by the system clock. The 
WR signal then toggles, causing the output pin to go active 
Low. This condition is maintained until the control register XO 
goes High, at the end of the PUSH cycle. 


The equations after the XOR symbol in the design specification 
(page 2-255) cause the toggle function; when these product 
terms become active High a toggle operation is effected in the 
associated flip-flop. For example, the CSX flip-flop is in a Hold 
state until WR goes Low and while XO is High during a PUSH 
sequence; during a POP sequence, CSX holds unless both WR 
and X0 are High. The equations are written for each flip-flop by 
using one product term after the XOR label to effect the 
change of state or toggle condition to fit the requirements of 
the state machine. Without the XOR gate, an Up/Down counter 


would require more than the eight product terms available in 
standard PAL devices. 


Instruction Decoding 


Although not shown in the design specification, it is possible 
for the PAL device to decode instructions for PUSH and POP. 
Assuming the unused input pins are converted to 10-I7, which 
come from an instruction register in a processor system, then 
decoding could be achieved in the fuse array. If an additional 
input VI qualifies a valid instruction and the code for PUSH is 
given as 17*16*15*14*13*12"11*I0*VI, then this equation could 
be added to the string declaration statement of the design 
specification. The PAL device could decode POP operations 
similarly. In this way, external decoding logic may be incorpo- 
rated into the PAL device, which should increase the operation 
speed. 
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TITLE LIFO RAM CONTROLLER 
PATTERN 01. 

REVISION 04. 

AUTHOR CHRIS JAY. 

COMPANY MMI SANTA CLARA, CA 
DATE 23 APRIL 1987 


CHIP LIFOCONT PAL22RX8 


7;THE PAL22RX8 HAS BEEN DESIGNED TO CONTROL 32 RAM 
LOCATIONS AS A LIFO RAM CONTROLLER. THE PAL DEVICE 
DESIGN SPECIFICATION CONSISTS OF TWO STATE MACHINES. 
;ONE STATE MACHINE IS AN UP/DOWN BINARY COUNTER THAT 
*;DRIVES THE ADDRESS LINES OF THE RAM, AND THE SECOND 
;DRIVES THE /WR AND /CS LINES OF THE RAM TO CONTROL 
;WRITE AND READ OPERATIONS. DURING A PUSH OPERATION 
;DATA IS WRITTEN TO THE LIFO; THE PROCEDURE IS INC- 
;REMENT ADDRESS COUNTER AND WRITE DATA. FOR A POP 
OPERATION THE READ CYCLE IS PERFORMED FOLLOWED BY 
7AN ADDRESS DECREMENT. THE XOR GATE IN EACH MACRO- 
7;CELL ALLOWS INCREMENT AND DECREMENT OPERATIONS 

7;TO BE EFFECTED WITHOUT UTILIZING A LARGE NUMBER OF 
;PRODUCT TERMS. 


; PIN 1 2 3 4 5 6 


CLK PUSH POP NC NC NC 
PIN 7 8 9 10 11 12 
NC NC NC NC /RST GND 
;PIN 13 14 15 16 17 18 
NC NC /CSX /X0 /WR /Q4 
PIN 19 20 21 22 23 24 
/Q3 /Q2 /Q1 /Q0 NC vcc 
GLOBAL ;GLOBAL TERM 


;ENABLES THE 
;GLOBAL RESET 


STRING INC '/CSX*/WR*/X0*PUSH' ; INCREMENT LABEL 
STRING DEC '/CSX*/WR* X0*POP' ;DECREMENT LABEL 


. 
, 


Figure 6. LIFO RAM Controller Design File 
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EQUATIONS : 
GLOBAL.RSTF = RST ;RESET AT POWER ON 
QO 3= Q0 :Q0 HOLD 
z+: INC ;INC ADDRESS FOR PUSH 
+ DEC ;DEC ADDRESS FOR POP 
Ql := Ql :Q1 HOLD 
s+: QO* INC s;INC Ql 
+ /Q0*DEC ;DEC Ql 
Q2 := Q2 :Q2 HOLD 
zt: QO* Q1*INC ;INC Q2 
+ /Q0*/Q1*DEC >DEC Q2 
Q3 3= Q3 :Q3 HOLD 
tt: QO* Q1* Q2*INC ;INC Q3 
+ /Q0*/Q1*/Q2*DEC ;DEC Q3 
Q4 := Q4 :;Q4 HOLD 
Ce QO* Q1* Q2* Q3*INC sINC Q4 
+ /Q0* /Q1*/Q2*/Q3*DEC ;DEC Q4 
CSX s= CSX s;CHIP SELECT 
c+: WR* /X0* PUSH *TO RAM DEVICE 
+ /WR* /X0* POP ; 
WR := WR :WRITE CONTROL 
sche /WR* /CSX*/X0* PUSH TO RAM DEVICE 
+ WR*/CSX* X0* PUSH ; 
+ /WR*/CSX* X0* POP ; 
+ WR*/CSX*/X0*/POP*/PUSH ; 
XO = XO ;CONTROL REGISTER 
zt: WR* CSX*/X0* PUSH ;FOR READ-WRITE 
+ /WR*/CSX* X0*/PUSH ;STATE MACHINE. 
+ /WR* CSX*/X0* POP ? 
+ /WR*/CSX* X0* POP ; 


e 
’ 


Figure 6. LIFO RAM Controller Design File (Continued) 
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SIMULATION 
TRACE ON 


RST 


PAL22RX8A 


CLK Q0 Q1 Q2 Q3 Q4 
/CSX /WR /X0 PUSH POP 


SETF /CLK RST /PUSH /POP 


CLOCKF CLK 

SETF /RST 

SETF PUSH 

FOR I := 1 TO 4 DO 
BEGIN 

CLOCKF CLK 

END 

SETF /PUSH 

CLOCKF CLK 

SETF PUSH 

FOR I := 1 TO 4 DO 
BEGIN 

CLOCKF CLK 

END 

SETF /PUSH 

CLOCKF CLK 

SETF POP 

FOR I := 1 TO 4 DO 
BEGIN 

CLOCKF CLK 

END 

SETF /POP 

FOR I := 1 TO 3 DO 
BEGIN 


CLOCKF CLK 
END 
TRACE_OFF 


; SIMULATION 

; SECTION. 

;TRACE ESSENTIAL 
;SIGNALS. 

;RESET PAL DEVICE. 
;CLOCK RESET TO 
;REGISTERS. 

;ENABLE PUSH INPUT. 
;PERFORM ONE PUSH 
;CYCLE. 


' 

;PUSH AND POP MUST 
;BE INACTIVE BETWEEN 
;CYCLES. SET PUSH 
; INACTIVE. 

;SET PUSH ACTIVE 
;AND PERFORM SECOND 
;PUSH. 

;SET PUSH INACTIVE 
:;AND CLOCK DEVICE 
;THEN SET POP 
sACTIVE. PERFORM 
;POP CYCLE. 


=e te 


;SET PUSH AND POP 
; INACTIVE. CLOCK 
;SYSTEM FOR A PASSIVE 
;MODE. 


we me 


Figure 6. LIFO RAM Controller Design File (Continued) 
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Graphics systems are commonly used in most computers for 
displaying pictures and text. Functions of a graphics system 
include creation, manipulation and storage of picture databases 
and text. Usually, graphics systems also allow users to dynami- 
cally control a picture’s content, size or colors. 


Graphics systems are closely related to Image processing sys- 
tems. While graphics systems manipulate images which already 
exist as a database, image processing systems analyze images 
taken from cameras by converting them to a graphical database 
and then processing that database. The hardware used for these 
systems is very similar. In fact, it is so similar that many systems 
offer integrated graphics and image processing functions. 


In this section we will examine the possible uses of PLDs in a 
graphics and image processing system and their implementation. 
We will also determine which PLDs are suited for different 
applications in graphics and image processing systems. Laterwe 
will also see some design examples of PLD applications. 


Asimple graphics system consists of five components, as shown 
in Figure 1. 


The first component is the graphics processor, which draws the 
image (referred to as “image rendering”). It also provides an 
interface to the main system processor (also known as the host 
processor). The second component is the display controller, 
which is an interface between the graphics processor and the 
graphics frame buffer. The display controller is essentially a 
modified DRAM controller. The third component is the graphics 
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GRAPHICS BOARD 


DISK 
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SOFTWARE 


frame buffer, which is typically a memory that stores the image 
data. The data from the frame buffer is then serialized by data 
serializers which make up the fourth component of a graphics 
system. The data serializers are essentially modified shift regis- 
ters. The serial data is then converted to an analog signal suitable 
for driving the video monitors for display. This function is per- 
formed by the fifth and the final component of the graphics 
system, a digital-to-analog converter (DAC). 


Local Graphics Processor 


The graphics (or image) processor gets all the commands from 
the host processor. The interface between the graphics proces- 
sor and the host processor is through a bus (Figure 2), where 
commands and data bytes are transferred. Most of the popular 
standard bus interfaces are used for this function. Some of the 
commonly used bus interfaces are: 


e Multibus 

« VME 

¢ PC-bus 

¢ Nu-bus 

¢ Microchannel 


The bus interface performs three major functions: 
¢ Bus arbitration 


¢ Bus contro} 
¢ Address decoding 





MONITOR 










Figure 1. Components of Graphics System Hardware 
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There are two major types of buses, synchronous and asynchro- 
nous. For synchronous buses, the bus arbitration and control 
functions can be effectively performed by fast registered PLDs 
such as the PAL16R8/6/4, PAL20R8/6/4 and PAL22V10. For 
asynchronous buses, the arbitration and control functions canbe 
provided by combinatorial PLDs, or by an asynchronous PLD (the 
PAL20RA10 or AMPALC29MA16). The details for designing bus 
interface functions are given in aseparate section on page 2-325. 
The address decoding function is performed most effectively by 
a combinatorial PAL16L8, PAL20L8 or PAL22V10. Address de- 
coding is discussed along with other combinatorial functions in 
the section on page 2-35. 






LOCAL GRAPHICS 
PROCESSOR 






VME BUS 


SYSTEM BUS MULTI BUS-II 
PC BUS 
NU BUS 
HOST 
PROCESSOR 


Figure 2. System Bus Interface 
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The local graphics processor is used to interpret the database of 
the graphical image (“display list”), down-loaded from the host 
processor. The graphics processor interprets the display list into 
objects with geometrical entities. It also performs coordinate 
transformation functions for recalculating object size as the object 
moves or viewer perspective changes. The last function per- 
formed by the graphics processor is the actual image rendering 
(drawing) functions of drawing arc/vectors, moving blocks of 
image data (bit-blit), and character manipulation for handling 
large chunks of text. 


In an image processing system the local processor performs the 
role of an image processing engine. The image processing 
software tasks are very different from the graphics tasks. Image 
processing tasks include interpreting the listof commands sent by 
the host and executing required functions such as image en- 
hancement, pattern recognition, or computer vision. The most 
intensive task is image enhancement which requires generating 
image transforms in their spatial or frequency domain. Generat- 
ing transforms requires a large number of multiplication opera- 
tions. Typical image processing systems use either a special 
microprocessor or a floating point multiplier for this purpose. 
PLDs can also be used to interface this multiplier to the local 
microprocessor. Some image processing tasks are very uniform 
and repetitive, and dedicated hardware can be easily designed to 
speed up such tasks in the interest of overall system perform- 
ance. Such dedicated hardware can be readily made with PLDs. 


The Display Controller 


For implementing graphics functions the local processor must be 
able to write to the frame buffer (draw an image). Similarly for 
implementing image processing functions the local processor 
must be able to read from and write to the frame buffer. The 
reading and writing of frame buffer data is possible with the help 


of the display controller, as shown in Figure 3. The display 
controller is essentially a timing generator which performs three 
major functions: 


« DRAM timing 
* Monitor control signal timing 
* Screen refresh 


The display controller generates the read and write cycle timings 
as well as the refresh timing of the DRAMs. The read and write 
cycles are initiated by the local graphics processor. These 
functions are identical to a regular DRAM controller discussed in 
the section on memory control (page 2-179). The second function 
required for the display controller is the generation of monitor 
control signals. This function is usually integrated with the DRAM 
controller, although it can also be implemented separately. 
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Figure 3. The Display Controller 


The controller needs to generate the DOTCLK signal which 
determines the rate at which datais sentto the monitor. The video 
monitor synchronization signals HSYNC for every new line, and 
VSYNC for every new frame (Figure 4) are generated by dividing 
this DOTCLK signal. Another signal, BLANK, asserted only when 
the pixels are being displayed, is also generated by this controller. 
The HSYNC, VSYNC and BLANK signals not only drive the 
monitor, but are also used as synchronization signals by the 
subsequent stages of the graphics system. 


In an image processing system, the HSYNC and VSYNC signals 
are generated by the camera which captures the image. The 
internal DOTCLK signal is a multiple of these HSYNC and 
VSYNC signals and is generated by using a phase locked loop 
(PLL). 


The third function of the display controller is to refresh the screen. 
The controller automatically reads data in sequential addresses 
from the frame buffer at fixed time intervals, and sends it to the 
monitor at a rate determined by the DOTCLK signal. The 
sequential addresses are also generated by the display controller 
as shown in the example on page 2-261. Each chunk of data is 
called a pixel and is displayed on the screen as a dot. 


Registered PLDs can easily implement these display control 
functions. As this would be a fast state machine, the range of 
options includes the PAL32VX10/A, PAL16/20R8/6/4, 
AmPAL23S8, and the PLS105/167/168. An example for a line 
sync generator (HSYNC), and another for a frame sync generator 
(VSYNC), are discussed later on pages 2-263 and 2-265 respec- 
tively. 
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Figure 4. A Raster Scan Display Timing Control 


The Frame Buffer 


The frame buffer (Figure 5) is traditionally made from dynamic 
RAMs (DRAMs). The DRAMs provide cost effective means for 
storing image data. Graphics systems require two ports for 
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Figure 5. The Frame Buffer 
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reading and writing the frame buffer data. The first port is for the 
display update to and from the graphics processor; the second 
port is for the screen refresh to the monitor. 


The performance of agraphics system is measured by the screen 
update bandwidth. This is the rate at which the image data can 
be written to the frame buffer. The DRAMs can be organized in 
various ways to provide access to multiple bits of image data 
simultaneously, which can increase update bandwidth. Another 
related factor is the screen resolution, which determines the rate 
at which pixels have to be sent to the monitor in order to provide 
a flicker free display. The table in Figure 6 shows this pixel clock 
(DOTCLK) rate required for various screen resolutions. Varying 
DRAM organizations can also be used to meet this screen 
resolution bandwidth. 


RESOLUTION | FREQUENCY | PIXEL CLOCK (DOTCLK) 


512 X 512 25 MHz 
640 X 480 25 MHz 
50 MHz 
100 MHz 
125 MHz 


720 X 350 
1000 X 1000 
1280 X 1000 





Figure 6. Various Screen Resolutions and 
Bandwidth Required 
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Figure 7. A Video RAM 


Video DRAMs (VRAMs) are specialized memories which provide 
two ports. The VRAMs (Figure 7) include a built-in serializer 
capable of supporting high data bandwidths. These serializers 
are used to support high bandwidth requirements of the monitors. 
The VRAMs require extra control signals which are typically 
generated by the display controller. 


The performance of a graphics system is also determined in part 
by the frame buffer organization. There are two main types of 
organizations used in the industry. 


* Bit-plane mapped 
° Pixel-packed mapped 


Bit-plane mapping allows the graphics processor to access single 
bits (also called planes) from a number of adjacent pixels simul- 
taneously. The pixel-packed organization allows the graphics 
processor to access all the bits of one pixel simultaneously. The 
former was used in the monochrome type of displays whereas the 
latter is increasingly being used in the modern color displays. 


For high performance and large screen resolution systems, 
multiple pixel access is possible only with custom DRAM organi- 
zations, which require a custom display controller design. Also, 
different frame buffer organizations (bit-plane or pixel-packed), 
and the special timing requirements of VRAMs, further reduce the 
application of dedicated VLSI display controllers. PLDs provide 
an easy means of designing custom display controllers and are 
often used in such systems. 


Serializers 


The function of the serializer is to accept multiple pixels of parallel 
data, as sent by the display controller at slow speeds, and 
transform them into a serial pixel data stream. The serializer is 
required even for VRAMs, which can only support 25 MHz 
bandwidths. The serializer speed requirements are based on 
monitor resolution and are shown in Figure 5. 


The serializer is synchronized by the monitor control signals 
HSYNC, VSYNC and BLANK, which are generated by the display 
controller, Aserializer is essentially a shift register, similar to the 
one discussed earlier on page 2-90. Certain modifications are 
added to accommodate the specific design requirements of a 
graphics system. These include synchronization with the display 
contro} signals. 


Many standard video serializers (679501, 679502, and Am8177) 
are available, which can be used for standard DRAM organiza- 
tions, Fornon-standard organizations, custom serializers can be 
made with PLDs. Sometimes graphics-related functions, which 
are not available in standard serializers, can also be added to 
such custom serializers. For example, a ZOOM function can be 


easily implemented in a custom serializer by having it repeat the 
same pixel datatwo or more times. An example of aseven bit shift 
register used as a serializer is shown on page 2-271. This 
serializer has the capability of providing reverse video, where it 
inverts the outgoing pixel data stream. 


In certain character-based terminals, a dedicated DRAM or 
VRAM based frame bufferis not used. The screen characters are 
stored in a static RAM and are sent to the monitor through a font 
RAMorROMwhich stores the displayed shape of each character. 
This font RAM or ROM is also known as the character generator. 
The data from the font RAM constitutes the pixels which are sent 
to the monitor via serializers. One such application of a dual port 
serializer with changeable fonts is illustrated on page 2-275. 


PLDs which allow fast state machines are ideal candidates for 
such serializers. Due to the very high speed requirements, the 
most suitable PLDs are the registered “D" PAL devices at 55 MHz 
or the registered ECL PAL devices running at 125 MHz speed. 
Sometimes buried registers are also very useful for state machine 
designs. The serializer application on page 2-275 uses a 
PAL32VX10A as it provides buried registers for state machine 
functions. 


Look Up Table and Digital to Analog 
Converter 


The last stage of agraphics system is the look-up table (LUT) and 
digital-to-analog converter (DAC). The numbers of simultane- 
ously displayable colors or the gray scales is determined by the 
number of bits in a pixel. The LUT is used to increase the number 
of possible colors or gray scales that can be displayed on the 
monitor. 


The LUT (Figure 8) is afast RAM addressed by the pixels from the 
serializers. The data stored in the RAMs is used as the color or 
gray scale value. Since the data speeds can be very fast (as little 
as 8 ns for a 125 MHz display) and the static RAM access times 
are relatively slow, multiple static RAMs are used to store identical 
copies of look up tables. Different static RAMs are addressed for 
sequential pixels in order to maintain a fast overall data rate. 
Controllers made with such fast PAL devices as the 

PAL10H20EV8 or PAL16R8/6/4D can be used to spread sequen- 
tial pixels between various static RAMs for this function. The LUT 
output is then routed to the DAC stage of the circuit. 


The DAC is the only analog portion of the circuit, and is used to 
convert the color or gray scale data into an analog format usable 
by the video monitors. There are many standard monolithic and 
hybrid solutions available for such functions. 
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Figure 8. The Lookup Table 
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This small system video controller is designed for an 80 character 
wide terminal display of 24 rows (Figure 1). The display resolution 
is 640 X 400 pixels, with a character box of 8 X 16 pixels. The 
character font used is a dot matrix of 7 X 9 pixels which is stored 
in acharacter PROM. Thefirst seven horizontal pixels display the 
character while the eighth pixel provides the horizontal character 
separation. Similarly the first nine lines display a visible character 
while the remaining seven are blank to provide line separation. 
The frame buffer is an SRAM which stores character ASCll codes 
of 80 X 24 characters and can be addressed by the system 
processor for update or by the display controller. 


The display controller provides all the necessary timing and 
addressing for supporting this display. Synchronized to the 
monitor timing signals, it generates the row and column ad- 
dresses (Figure 2), of the character being scanned. The row and 
column addresses access the character ASCII code from the 
frame buffer at the instant the character is displayed. This ASCII 
code addresses the character PROM which stores the 7 X 9 dot 
matrix font. Suppose the letter "A” is to be displayed (Figure 3). 
The contents of the character PROM pertaining to the current 
character row and column are read and loaded into the video shift 
register to be clocked out as a video signal. The video shift 
register must be seven-bits long, and the entire character is 
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Figure 1. A Character Display 
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displayed over nine lines. To display a character 'A’, the ASCII 
code of '41’ hexadecimal selecting the letter is read from a frame 
buffer location. This character code will point to the base location 
inthe PROM. This base location holds the first line of a table of 
nine locations which hold the dot information for each scan line. 
Theline address counter first selects the RO line address, then the 
next line address R1 and then R2 to R8, until the entire character 
is displayed. 


The character column, row addresses and the line address can 
be generated by what is essentially one big counter. As can be 
seen from Figure 1, there are eight pixels in acolumn, 80 columns 
in aline, and 16 lines in a row. The characters are counted by 
dividing the pixel clock (DOTCLK) by eight, which is the number 
of pixels in acharacter box. The character column addresses are 
generated by a seven bit counter which counts up to 80 character 
columns. For every eighty columns of character addresses a line 
address (a four-bit counter) is incremented. And finally for every 
16 line addresses a character row address is incremented. This 
counter is implemented in two different devices which also gen- 
erate related monitor timing signals. 


The Implementation 


The functions of the display controller (Figure 4) are divided into 
three portions. The related tasks of generating the system 
HSYNC and BLANK signals and character column addresses are 
performed by a line sync generator. The tasks of generating the 
VSYNC signal and the line address and character row address 
are performed by the frame sync generator. Finally, the task of 
generating character period reference from the DOTCLK has 


LINE 0 
0001000 








been conveniently combined with the video shift register, which 
also uses this reference to shift out appropriate font pixels. The 
character period reference is also used by line sync and frame 
sync generators to count number of characters. 


The sequential frame buffer column address MAO-MAG for dis- 
playing these 80 characters per row, is generated by the line sync 
generator (IC2 in Figure 4). The line sync generator is a counter 
clocked by DOTCLK with a count enable function under the 
control of a character period reference. This enables it to count 
the number of character periods. The line sync generator also 
generates HSYNC and BLANK signals at the end of each line. 


The character display is divided into 24 rows of 16 lines each. The 
four-bit line address (RO-R3) is used by the character PROM to 
generate the appropriate bits of font for the different lines in a 
character row. The line address to the character PROM is 
generated by the frame sync generator (IC3), using the BLANK 
signal as anewline reference. For every newline (BLANK signal) 
the line address counter is incremented. The frame sync genera- 
tor also generates the five-bit row address (A4—A8) for the 24 
different rows of characters stored in the frame buffer. This 
address is generated by acounter which is incremented once for 
every 16 counts of the line address counter. 


The final component is the video shift register (IC1) which 
serializes the character PROM seven-bit character line data. This 
function is accomplished by using a three-bit counter which 
divides the DOTCLK frequency by eight to generate the character 
period reference. 


LINE 1 LINE 2 
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0100010 
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Figure 3. Character Generation 
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Line Sync Generator 


The PAL32VX10 used for line sync generation (Figure 5) has a 
versatile macrocell structure capable of transforming a basic D- 
type flip-flop into a more complex T-type flip-flop which efficiently 
implements counter designs. The seven-bit character column 
address counter uses seven of these T-type flip-flops. The line 
syne (HSYNC) signal is generated based on the value of the 
counter. The HSYNC signal initiates line flyback which separates 
the many lines of which a visual display is composed. The device 
also generates the blanking signal for monitor control. This 
design has been developed into state machine logic contained in 
the PAL32VX10. 
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Addressing the Frame Buffer 


The frame buffer address lines MAO-MA6 are shared by a host 
processor which updates the memory with new characters. 
These addresses are generated by a binary character period 
counter. The only system clock conveniently available is the 
DOTCLK. Since the counter has to keep track of character 
periods. One easy way is to clock the device with DOTCLK, but 
enable count only when all three bits of the pixel counter S2, S1 
and SO are high. These three bits high indicate a character 
boundary; this will effectively result in clocking the counter every 
character period. 
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Figure 4. Small System Video Controller 
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Figure 5. Schematic Diagram of Line Address 
Generator Circuit 


The count is extended up to 107 to provide a timing reference to 
generate the monitor contro! signals (HSYNC and BLANK) also. 
Seven registers in the PAL32VX10 provide this count. The 
registers are programmed as T-type flip-flops (Figure 5) for opti- 
mizing the number of product terms used. For each scanline after 
reaching 107 the T-type counters are reset to provide the start of 
the next active line. Of the ten registered cells in the PAL32VX10, 
nine registers are used for addressing memory (MAO-MA6), line 
sync generation (HSYNC) and blanking control (BLANK). 


Since the processor can not read or write to the frame buffer at the 
same time as the display controller, the address lines (MAO-MA6) 
of the display controller arethree-stated during aprocessorcycle. 
When processor signal /WR is asserted LOW, indicating a write 
cycle, the MAO-MA6 address outputs are three-stated to allowthe 
processor address bus to drive the memory address. 


The line blanking signal (BLANK) is designed to go active low 
after 80 character cycles. This is followed by the line sync signal 
(HSYNC), which goes active eight character cycles later. This 
gives an eight character back porch to the display (see Figures 6 
& 7). The line flyback time (HSYNC width) of 16 blank characters 
ensures adequate time for the CRT line circuitry to stabilize 
before starting the next active line scan. The blanking signal is 
then turned off three character periods later to provide a three 
character front porch to the display. The duty cycle for one scan 
line is (80+8+16+3)=107 character periods. 


Selecting a DOTCLK of 16 MHz, and acharacter of eight bits, the 
resulting 2 MHz character cycle can be divided downto aline duty 
cycle of: 


HSYNC Line frequency = Character freq/Line duty cycle 
= 2000 KHz /107 
= 18.7 KHz 


Video data in the frame buffer can be updated by the processor 
only for durations when video is not displayed. A BLANK signal 
is a good indicator to the processor for when to begin and stop 
updates. Most processors, however, need an advance warning, 
preferably a few microseconds earlier. 


The RDY signal is used to provide this advance warning to the 
processor. When HIGH, the RDY signal allows data modification 
inthe frame buffer. This flag is turned off before the removalofthe 
blanking pulse (BLANK) allowing a margin of four micro-seconds 
to elapse at the end of the blanking period. Itis assumed that four 
microseconds is ample time to test the flag and perform the last 
write operation. However, the time taken for a processor to read 
this flag and respond can be programmed into the PAL device 
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depending upon the software requirements of the system. The 
RDY signal is also set during vertical retrace when all processor 
updates are allowed and no display controller accesses are 
required. The design uses the system VSYNC for accomplishing 
this. 


The RDY signal is also disabled unless it is being read by the 
processor. This is done because the RDY signal is usually 
directly connected to the processor data bus which is used to 
carry data from different sources (drivers). The disabling is done 
using the CS chip-select signal asserted by the processor. If CS 
is driven LOW, the RDY output buffer is enabled and the RDY flag 
may be read by the processor. 
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Frame Sync Generator 


Another PAL32VX10 (Figures 8 & 9) is programmed to perform 
the function of a frame sync generator (VSYNC), with the ability 
to access the character PROM and the frame buffer. The VSYNC 
signal controls frame flyback for the CRT controller. Each frame 
of displayed data is terminated, and the electron beam inthe CRT 
performs a retrace operation prior to the display of the next frame. 
Clocked by the system BLANK signal, it uses a counter to 
generate the row address for the character being displayed. It 
also generates the line address to access the dot pattern for each 
scan line, stored in character PROM. 
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Figure 6. Blanking and Sync. Signals. 
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TITLE LINE SYNC GENERATOR. 
PATTERN 02. 

REVISION 02. 

AUTHOR CHRIS JAY. 

COMPANY MMI SANTA CLARA, CA. 
DATE 20TH AUGUST 1986. 


’ 

;THE PAL32VX10 HAS BEEN DESIGNED AS A LINE SYNC GENERATOR FOR A 
;THREE PAL SOLUTION TO A VIDEO DISPLAY CONTROLLER. THE DEVICE 

;HAS BEEN PROGRAMMED TO GENERATE ADDRESSING TO ACCESS A CHARACTER 
;RAM MAO - MA6 ARE CONFIGURED TO ACCESS ASCCII CHARACTERS STORED IN 
;THE RAM FOR AN EIGHTY CHARACTER WIDE DISPLAY. THESE ADDRESS OUTPUTS 
;MAY BE PUT INTO A THREE STATE CONDITION IF THE HOST PROCESSOR 
;REQUIRES TO CHANGE THE CONTENTS OF THE DISPLAYED MEMORY. WITH 

;THE /WR INPUT ACTIVE LOW THE HOST PROCESSOR MAY DRIVE THE 

;ADDRESS INPUTS TO ACCESS AND MODIFY THE CONTENTS OF THE VIDEO RAM. 
;THE INTERFACE SIGNALS TO THE VIDEO SHIFT REGISTER (PAL DESIGN 01 
;OF THE THREE PAL DESIGNS) ARE S0,S1,S2, AND BLANK. WHEN S0*S1*S2 
7;ARE HIGH THE ADDRESS OUTPUTS MAO - MA6 INCREMENT SYNCHRONOUSLY 

;AS A RESULT OF THE DOT CLOCK. S0,S1 AND S2 ARE STATE MACHINE 
;OUTPUTS FROM THE VIDEO SHIFT REGISTER THAT CONTROL THE LOADING 
;AND SHIFTING OF PIXEL DATA. THE BLANK OUTPUT IS ACTIVE HIGH AND 
7IS FED TO THE VIDEO SHIFT REGISTER TO PROVIDE A BLANKING SIGNAL 
;DURING LINE SYNC FLYBACK WITH A MARGIN FOR FRONT AND BACK PORCH. 
7A RDY REGISTER IS AVAILABLE TO INDICATE THAT THE SCREEN IS BLANK 
;DURING LINE AND FRAME FLYBACK PERIODS AND VIDEO RAM MAY BE UPDATED. 
;THE OUTPUT IS 3-STATED AND IS ENABLED BY AN ACTIVE /CS INPUT. 


; 
CHIP VIDEO 1 PAL32VX10 
7LINE SYNC GENERATOR PAL 


7PINS 1 2 3 4 5 6 


CLK /RST so $1 $2 /VSYNC 
?PINS 7 8 9 10 ll 12 

NC NC Nc NC /WR GND 
;PINS 13 14 15 16 17 18 

/CS BLANK MAO MA] MA2 MA3 
;PINS 19 20 21 22 23 24 

MA4 MAS MA6 HSYNC RDY vcc 


GLOBAL QO Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 
;STRING DECLARATIONS. 


STRING HEX6B 1Q7*Q6*/Q5*Q44*/Q3*Q2*Q1' .  }CLEAR ADDRESS COUNTER 
;AND TURN BLANK OFF 
STRING HEX67 *Q7*Q6*/Q5*/04*Q3*Q24Q1' ;TURN LINE SYNC OFF 
STRING HEX63 *07*Q6*/Q5*/04*/Q3*Q2*Q1! ;TURN RDY OFF 
STRING HEX57 *Q7*/Q6*Q5*/Q4*Q3*Q2%Q)' ;TURN LINE SYNC ON 
STRING HEX50 *Q7*/Q6*Q5*/Q4%/Q3*/Q2*/Q1' ;TURN BLANK SIGNAL ON 
STRING CEI *S0*S1*S2' ;COUNT ENABLE INPUT 
EQUATIONS 
GLOBAL.SETF = RST ; ASYNCHRONOUS RESET 
;J - K EQUATION FOR 
/Q0 1 /Q0 ;BLANK OUTPUT. 
it: /Q0*HEX50*CEI ;HEX50 = J. BLANK ON 
+ QO*HEX6B*CEI ;HEX6B = K. BLANK OFF 
BLANK = Q0 ;ASSIGN QO TO BLANK PIN 
BLANK.CMBF = GND ;ENABLE REGISTERED OUTPUT 
/Q1 t= /Ql ;COUNT EQUATION CEI = T 
it: CEI ;TOGGLE. HEX6B*CEI = K 
+ Q1*HEX6B*CEI ;TO RESET MAO AFTER ACTIVE 
MAO = Ql ;LINE. ASSIGN MAO TO Ql 
MAO.CMBF = GND ;REGISTER. 
MAO.TRST = /WR ;ENABLE THREE-STATE 
;DURING MPU/RAM ACCESS. 
/Q2 t= /Q2 ;COUNT EQUATION Q2 
it: Q1*CEI . ;Q1*CEI = T FUNCTION 
+ Q2*HEX6B*CEI ;HEX6B*CEI = K 
MAL = Q2 ;ENABLE Q2 REGISTER TO 
MAL.CMBF = GND ;MA1 OUTPUT. CONTROL 
MAl.TRST = /¥R ;THREE-STATE DURING 


;MPU/RAM ACCESS. 
Figure 7. Design File of Line Syne Generator 
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/Q3 75 /Q3 
rt: Q2*Q1*CEI*/Q7 
+ Q2*Q1*CEI*/Q6 
+ Q2*Q1*CEI*Q5 
+ Q2*Q1*CEI*/0Q4 
+ Q2*Q1*CEI*Q3 
MA2 = Q3 
MA2.CMBF = GND 
MA2.TRST = /WR 
/Q4 2= /Q4 
tts Q3*Q2*Q1*CEI 
+ Q4*HEX6B*CEI 
MA3 = Q4 
MA3.CMBF = GND 
MA3.TRST = /WR 
/Q5 25 /Q5 
i+: Q4*Q3*Q2*Q1*CEI 
+ Q5*HEX6B*CEI 
MA4 7 Q5 
MA4.CMBF = GND 
MA4.TRST = /WR 
/Q6 25 /06 
tt: Q5*Q4*Q3*Q24*Q1 *CEI 
+ Q6*HEX6B*CEI 
MAS = Q6 
MAS.CMBF = GND 
MAS.TRST = /WR 
/Q7 2= /Q7 
it: Q6*Q5*Q4*Q3*Q24Q] *CEI 
+ Q7*HEX6B*CEI 
MA6 = Q7 
MA6.CMBF = GND 
MA6.TRST = /WR 
/Q8 25 /Q8 
r+: /Q8*HEX57*CEIL 
+ Q8*HEX67*CEI 
/HSYNC = /Q8 
/HSYNC.CMBF = GND 
/Q9 := /Q9 
t+: /Q9*HEX50*CEI*/VSYNC 
+ /Q9*CEI*VSYNC 
+ Q9*HEX63*CEI*/VSYNC 
RDY = Q9 
RDY.CMBF = GND 
RDY.TRST = cs 
SIMULATION 


TRACE ON CLK RST SO Sl S2 MAO MA1 MA2 
MA3 MA4 MAS MA6é HSYNC BLANK 


RDY /VSYNC 
SETF WR /VSYNC 
SETF /CLK RST SO Sl S2 /WR CS 
CLOCKF CLK 
SETF /RST 
PRLDF /Q0 /Ql1 /Q2 /Q3 04 
/Q5 /Q6 Q7 /Q8 /Q9 
FOR I := 1 TO 60 DO 
BEGIN 
CLOCKF CLK 
END 
SETF VSYNC 
FOR I := 1 TO 8 DO 
BEGIN CLOCKF CLK 
END 
SETF /S0 
CLOCKF CLK 
SETF /S1 
CLOCKF CLK 
SETF /S2 
CLOCKF 
TRACE OFF 





;COUNT EQUATION Q3 
3Q2*Q1*CEI = T FUNCTION 
;COUNT ENABLE UNTIL 
;END OF LINE BLANKING 
;PERIOD. DE MORGAN OF 
7EQU Q7*Q6*/Q5*Q4*/Q3=K 
3;/Q9 = RESET FUNCTION. 
;ASSIGN Q3 TO MA2 
;REGISTERED OUTPUT. 
;CONTROL O/P WITH /WR 
;COUNT EQUATION Q4 
7Q3*Q2*Ql*CEI = T. 
;Q4*HEX6B*CEI = K. 
;ASSIGN Q4 TO MA3. 
;ENABLE REGISTER O/P. 
;THREE - STATE CONTROL. 


;COUNT EQUATION Q5. 
7Q4*QO3*Q2*Q1*CEI = T. 
;Q5*HEX6B*CEI = K. 
;ASSIGN Q5 TO MA4. 
;ENABLE REGISTER O/P. 
;THREE - STATE CONTROL. 
;COUNT EQUATION Q6. 
7Q5*Q4*Q3*Q2*Q1*CEI = T. 
7;Q6*HEX6B*CEI = K. 
;ASSIGN Q6 TO MAS. 

; ENABLE REGISTER O/P 
;THREE - STATE CONTROL 


7;COUNT EQUATION Q7. 
7Q6*Q5%*Q44*QO3*Q2*Q14CEI =T. 
;Q7*HEX6B*CEI = K. 

7ASSIGN Q7 TO MA6. 

;ENABLE REGISTER O/P 
;THREE - STATE CONTROL 


;J - K EQUATION FOR Q8 
;HEX5S7*CEI = J. 
7;HEX67*CEI = K. 
;ASSIGN Q8 TO HSYNC. 
;ENABLE REGISTER O/P 
;J - K EQUATION FOR Q9 


7RDY STATUS 

7 FLYBACK.HEX50*CEI = Jl 
;AND FRAME = J2 
;HEX63*CEI = Kl RESET 


;AFTER LINE AND FRAME 
;ASSIGN RDY TO Q9 
;ENABLE REGISTER O/P 
;WHEN /CS = LOW RDY 
;OUTPUT IS ENABLED. 
7START OF SIMULATION. 
;TRACE ESSENTIAL I/P 
;AND O/P SIGNALS. 


;TEST THREE ~ STATE 
;CONTROL. INITALISE 
; INPUTS. PERFORM 
;RESET. AND PRELOAD 
;REGISTERS. 


7APPLY 60 CLOCK PULSES 
;TO TRACE THE MEMORY 
;ADDRESS COUNTER, HSYNC, 
;BLANK AND CEO. 


;TEST THAT AN ACTIVE 
; FRAME WILL SET THE 
;RDY FLAG. 

;DISABLE COUNTER BY 
;SETTING S0O,S1 AND S2 
; LOW. 


me se te 


7END OF SIMULATION. 


Figure 7. Design File of Line Sync Generator (Cont’d.) 
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Figure 8. Schematic of Frame Sync Generator 


The PAL device is programmed as acounter. The least significant 
four bits count through sixteen line addresses RO-R3 for each 
character row. The higher significant bits of the counter track the 
24 rows of characters. The frame sync pulse is generated on the 
25th character row and frame flyback is turned on for sixteen line 
count periods, which is the duration of the twenty-fifth row. With 
24 character rows, each having sixteen lines, the system gave 
384 visible lines. Since the frame flyback occurs over sixteen 
lines, the total number of lines in the system is (384+16)= 400. 
The frame repetition rate is derived by dividing the line frequency 
by the total number of lines in the system. 


Frame repetition rate = 18.7 X 10° HZ/400.......(1). 
= 46.75 Hz 


The video terminal used for testing the final design functioned 
best at this frame repetition rate. If desired, other frame repetition 
rates can also be selected. For example athirteen character row 
with nine rows active andthe same 7 X 9 character box would give 
a 57 Hz frame rate. Most video terminals have a synchronizing 
circuit that determines the choice of frame frequency. This is 
similar to a phase-locked loop, providing a window of acceptable 
frame oscillator frequencies to which the display will be locked. 
Outside of this capture range, the display will not be synchro- 
nized, and the result will look like a rolling unstable picture. 


Signal Assignments 


The /VSYNC output is an active LOW signal. When displaying 
data this output is driven HIGH. As with the line sync generator 
design, the /WR input is driven from the host processor system. 
When LOW, the video memory address outputs MA7—MA11 are 
disabled. The host system may enable its output buffers and drive 
the frame buffer’s address inputs directly to modify video data. 


For normal counting, the CEl input should be driven LOW. ATST 
input was provided to minimize the number of test vectors 
required to test the device. This pin is driven LOW for normal 
operation. 
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TITLE FRAME SYNC GENERATOR. 
PATTERN 02 

REVISION 02. 

AUTHOR CHRIS JAY. 

COMPANY MMI SANTA CLARA, CA. 
DATE 20TH AUGUST 1986. 


. 
, 


CHIP VIDEO _1 PAL32Vx10 


;FRAME SYNC GENERATOR PAL 


, 


:;THE PAL 32VX10 IS THE THIRD OF A THREE PAL SYSTEM DESIGNED 
;AS A SIMPLE VIDEO CONTROLLER. THE DEVICE GENERATES ADDRESSING 
;FOR A CHARACTER MEMORY AND CPU RAM. ALSO A FRAME SYNC PULSE 
;/VSYNC IS USED TO PROVIDE FRAME FLYBACK AT THE END OF ONE 
;COMPLETE FRAME. THERE ARE FOUR ROW OUTPUTS FROM THE PAL WHICH 
;ADDRESS THE CHARACTER MEMORY. THESE OUTPUTS ARE DRIVEN FROM 
;AN INTERNAL COUNTER WHICH IS INCREMENTED AFTER EACH LINE 
;SCAN. THE RO,R1,R2 AND R3 INCREMENT AS A SIXTEEN BIT COUNTER 
;ADDRESSING CHARACTER MEMORY. MA7 - MAl1 ADDRESS VIDEO MEMORY 
;ACCESSING THE ASCII CODES OF THE CHARACTER TO BE DISPLAYED 
;ON EACH ROW OF THE VIDEO DISPLAY. THE COUNTER MA7 - MA11 

;IS INCREMENTED AFTER ONE COMPLETE CHARACTER SCAN THAT IS 
;SIXTEEN LINES. THE /WR INPUT CAN BE DRIVEN BY THE HOST CPU 
;TO 3 - STATE MA7 - MAl1 OUPUTS AND DYNAMICALLY CHANGE THE 
;CONTENTS OF THE VIDEO RAM. THIS INPUT CAN BE MAPPED INTO 
;HOST PROCESSOR'S MEMORY ADDRESS RANGE. UPDATING RAM SHOULD 
;BE DONE BY THE HOST DURING LINE AR FRAME FLYBACK. THE PAL 
;DEVICE MAY BE RESET AT POWER ON AND /CEI SHOULD BE WIRED LOW 
;TO ENABLE THE ADDRESS COUNTERS. TST IS HARDWIRED LOW, THIS 
;PIN WAS USED AS A TEST INPUT PULLED HIGH TO DISABLE SOME OF 
;LOWER ORDER COUNTER REGISTERS WHEN GENERATING TEST VECTORS 
;OF THE TEST VECTORS. THIS AVOIDED GENERATING UNECESSARY 
;REPETATIVE LOWER ORDER COUNTS WHEN TESTING THE COUNT 

; EQUATIONS FOR HIGHER ORDER REGISTERS IN THE COUNTER 


;CHAIN. 

;PINS 1 2 | 4 5 6 
CLK /RST /CEI NC NC NC 

;PINS 7 8 9 10 11 12 
NC NC NC /WR TST GND 

;PINS 13 14 15 16 17 18 
NC MA? MA8 MA MA10 MA11 

7PINS 19 20 21 22 23 24 
/VSYNC RO Rl R2 R3 vcc 


GLOBAL QO Q1 Q2 Q3 Q4 Q5 Q6 Q7 O8 Q9 
;STRING VARIABLE DECLARATIONS FOR A NON-INTERLACED SYSTEM. 


STRING HEX18NI 'Q9*Q8*0Q7*Q6*Q4*Q3*/Q2%/Q1%*/Q0' ;SET VSYNC 


STRING HEX19NI 'Q9*Q8*07*Q6*04*03*/Q2*/Q1*Q0' ;CLEAR VSYNC 
STRING ROWNI *Q6*Q7*QB*Q9 ! ;ROW COMPLETE 
EQUATIONS 
GLOBAL.SETF = RST ;RESET AT POWER UP 
7Q0 := /Q0 ;COUNT EQUATION. 
+: CEI*ROWNI ;TOGGLE WHEN CEI*ROWNI TRUE 
+ CEI*HEX19NI ;RESET AFTER FRAME SCAN. 
MA7 = Q0 ;ENABLE MA7 OUTPUT 
MA7.CMBF = GND ;SET REGISTER MODE 
MA7.TRST = /WR ;THREE - STATE O/P WHEN /WR 
;ACTIVE. 
/Ql := /Q1 ;COUNT EQUATION Ql 
+ QO*CEI*ROWNI*/Q5 ;ENABLE TOGGLE 
MAB = Ql ;ENABLE MA8 OUTPUT. 
MA8.CMBF = GND ;SET REGISTER MODE. 
MA8.TRST = /WR ;THREE - STATE O/P WHEN /WR 
;ACTIVE. 


Figure 9. Design File of Frame Sync Generator 
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/Q2 2 /Q2 
sts: Q1*Q0*CEI*ROWNI 
+ Q2*HEX1ONI*CEI 
MAQ = Q2 
MAS.CMBF = GND 
MAQ.TRST = /WR 
/Q3 = /Q3 
sts Q2*Q1*Q0*CEI*ROWNI 
+ Q3*HEXIONI*CEI 
MA10 = Q3 
MA10.CMBF = GND 
MA10O.TRST = /WR 
/Q4 25 /Q4 
sts Q3*Q2*Q1*Q0*CEI*ROWNI 
+ Q4*HEXI1ONI*CEI 
MA11 = Q4 
MA11.CMBF = GND 
MA11.TRST = /WR 
/Q5 7= /Q5 
3Fs /QS*HEXI8NI*CEI*ROWNI 
+ Q5*HEX1ONI*CEI*ROWNI 
/VSYNC = /Q5 
/VSYNC.CMBF = GND 
/Q6 2 /Q6 
$+ CEI 
RO = 06 
RO.CMBF = GND 
/Q7 := /Q7 
sks Q6*CEI*TST 
+ /Q7*/TST 
Rl = Q7 
R1.CMBF = GND 
/Q8 = /Q8 
st: Q6*Q7*CEI*TST 
+ /Q8*/TST 
R2 = Q8 
R2.CMBF = GND 
/Q9 := /Q9 
it: Q6*0O7*O8*CEI*TST 
+ /Q9*/TST 
R3 = Q9 
R3.CMBF = GND 
SIMULATION 
TRACE _ON CLK RST MA7 MA8 MA9 MA10 
MA11 /VSYNC RO R1 R2 R3 
SETF /CLK RST CEI /WR 
CLOCKF CLK 
SETF /RST /TST 
PRLDF Q0 Q1 Q2 /Q3 04 
/Q5 /Q6 /Q7 /Q8 /Q9 
FOR I := 1 TO 40 DO 
BEGIN CLOCKF CLK 
END 
SETF TST 
PRLDF Q0 Q1 Q2 /Q3 94 
/Q5 /Q6 /Q7 Q8 Q9 
FOR I := 1 TO 40 DO 
BEGIN CLOCKF CLK 
END 
SETF /CEI 
FOR I := 1 TO 8 DO 
BEGIN CLOCKF CLK 
END 
TRACE_OFF 


;COUNT EQUATION Q2 

;ENABLE TOGGLE 

;RESET AFTER FRAME SCAN 
;ENABLE MA9 OUTPUT. 

;SET REGISTER MODE. 

;THREE - STATE O/P WHEN. /WR 
sACTIVE. 

;COUNT EQUATION Q3 

;ENABLE TOGGLE 

;RESET AFTER FRAME SCAN 
;ENABLE MA10 OUTPUT. 

;SET REGISTER MODE 

;THREE - STATE O/P WHEN /WR 
;ACTIVE. 

;COUNT EQUATION Q4 

;ENABLE TOGGLE 

;RESET AFTER FRAME SCAN 
;ENABLE MA11 OUTPUT 

;SET REGISTER MODE 

;THREE ~ STATE O/P WHEN /WR 
;ACTIVE. 

iJ - K VSYNC O/P 

;J EQUATION TO SET VSYNC 

;K EQUATION TO RESET VSYNC 
;ENABLE Q5 TO VSYNC. 

;SET REGISTER MODE 


;COUNT EQUATION FOR Q6 

; TOGGLE 

;FRAME SYNC. ENABLE Q6 
;REGISTER OUTPUT FOR 

;ROW O. 

;COUNT EQUATION FOR Q7 
;ENABLE TOGGLE 

;DISABLE COUNT DURING TEST. 
;ENABLE Q7 REGISTER OUTPUT 
7;FOR ROW 1. 


;COUNT EQUATION FOR Q8 

; ENABLE TOGGLE 

;DISABLE COUNT DURING TEST. 
;ENABLE Q8 REGISTER OUTPUT 
;POR ROW 2. 


;COUNT EQUATION FOR Q9 

; ENABLE TOGGLE 

;DISABLE COUNT DURING TEST. 
;ENABLE Q9 REGISTER OUTPUT 
7FOR ROW 3. 


;SIMULATION SECTION 
;TRACE ESSENTIAL SIGNALS 


;SET INITIAL CONDITIONS 
;ENABLE COUNT ON 

;DISABLE ROW R1, R2, R3. 
;PRELOAD INITIAL STATE 

;TWO COUNTS BEFORE FRAME 
;SYNC. CLOCK INPUT TO VIEW 
;WAVEFORMS OF ROW 0, VSYNC 
;AND MEMORY ADDRESSES MA7 TO 
;MA11. 

;DISABLE TEST MODE, THE ROW 
;COUNTERS ARE ENABLED. 
;PROVIDE CLOCK INPUTS TO 
;GENERATE TEST VECTORS TO 
;VERIFY THE NORMAL COUNT 
;OPERATION OF THE ROW 
;ADDRESS COUNTER. 

;DISABLE COUNT MODE AND 
;APPLY CLOCK INPUT TO 
;VERIFY THE COUNTER 

;HOLD OPERATION. 


Figure 9. Design File of Frame Syne Generator (Cont’d.) 
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Video Shift Register 


The third component of this video system (Figures 10 & 11), 
provides a means of converting parallel data into a high speed 
serial data stream. The PAL82VX10 implements one such video 
shift register, where it receives parallel data from a character 
memory and clocks the serial data out at a rate determined by the 
pixel clock (DOTCLK). The conversion of parallel character data 
to a serial stream is synchronized to the character period of the 
system. The character period is generated by dividing the 
DOTCLK by eight and this circuit has been conveniently imple- 
mented in the same device. The design also offers blanking 
controls and a reverse video option. 


The PAL32VX10 contains ten registers, three of which are used 
as a binary counter which synchronously counts through eight 
states at a rate determined by the DOTCLK of the video system. 
This count to eight determines a character period reference since 
there are eight pixels (dots) in every character. The outputs 
S0-S2 are used by the rest of the system as a character period 
reference. 


Thethree-bit output S2,S1 and SO of the character period counter 
are also used internally as a timing reference for loading new 
character data. When all three S2, S1 and SO are high, the 
remaining seven registers of the device Q1-Q7 are loaded with 
the seven bit character data from inputs DO-D6. The registers 
used as a counter are Q8, Q9 and Q10. Q10, the least significant 
bit, is assigned to the output pin SO, Q9 is assigned to pin S1; and 
Q8 is assigned to pin S2. 


The “new character” load command is given as astring statement 
in the design specification: STRING LOAD ‘Q8*Q9°Q10’. 


This parallel load is followed by seven shift states when the data 
is transferred from one register to another. The binary sequence 
of 0-6 at counter S2-SO enable this synchronous data shift 
through registers Q1-Q7, Q7 being the last register in the chain. 
The VIDEO output of the PAL device is therefore assigned to the 
last register Q7. 


The shift command is developed from the DeMorgan expansion 
of NOT(LOAD): Q8 + Q9 + Q10. 


Additional Control Features 


The video shift register has a number of inputs which control the 
serial output of the video data stream. The BLANK input, when 
asserted, inhibits the video, causing it to go LOW independent of 
activity at the parallel data inputs. It is necessary to use the 
BLANK signal to override active video during line flyback or 
retrace. The VSYNC input also inhibits the loading of video data 
and suppresses video drive during frame flyback. 


The reverse video input REV can be used to create an inverse 
video display. For normal video this input is LOW. The characters 
displayed onthe video screen will be bright on adark background. 
When the reverse video control is set HIGH, the video data is 
inverted when itis loaded. The display background is then bright, 
and the displayed characters are dark. The BLANK and VSYNC 
controls have the same effect of driving the video output LOW 
during line and frame flyback whetheror not the video is reversed. 


THREE BIT BINARY COUNTER 


D1 LOAD SHIFT 
=a H{oIN ar hs 
> _DOUT __RSTp 
LOAD SHIFT a 


* ea 
i DOUT —s RST 
LOAD SHIFT a 
D3 is a 
DOUT RST 
LOAD SHIFT <i 
D4 DIN a4 
RST 
SHIF 
DS ee 
NS,  DOUT 
pe LOAD SHIFT Ea 
a6 
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LOAD SHIFT 
BF {DIN a7 
N DOUT RST 
REV 
BLANK 





VSYNC 


414 08 
Figure 10. Schematic of Video Shift Reglster 
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TITLE 
PATTERN 
REVISION 
AUTHOR 
COMPANY 
DATE 


VIDEO SHIFT REGISTER. 


01 
02. 


CHRIS JAY. 


MMI SANTA CLARA, 


CA. 


20TH AUGUST 1986. 


;THE PAL32VX10 HAS BEEN DESIGNED AS A VIDEO SHIFT REGISTER 

iTO SUPPORT A THREE PAL SOLUTION TO A SMALL VIDEO CONTROLLER 
;CIRCUIT. SEVEN INPUTS FROM THE CHARACTER GENERATOR ARE LOADED 
;INTO THE INTERNAL SHIFT REGISTER THROUGH INPUTS Dl - D7, 
;AND SUBSEQUENTLY SHIFTED THROUGH IT. AN INTERNAL THREE BIT 
;BINARY COUNTER CONTROLS THE LOAD AND SHIFT ACTIVITY. WHEN 
;THE BINARY OUTPUTS SO0,S1 AND S2 ARE ALL HIGH THE SHIFT 
;REGISTER IS LOADED WITH NEW DATA FROM THE CHARACTER MEMORY, 
;SHIFTING TAKES PLACE FOR ALL THE OTHER BINARY STATES. 

;THE DATA IS LOADED AND SHIFTED SYNCHRONOUSLY, AT A RATE 
;DETERMINED BY THE APPLIED CLOCK INPUT. THE DEVICE MAY BE 
;ASYNCHRONOUSLY RESET BY APPLYING A LOGIC LOW TO THE /RST PIN. 
;THERE IS A REV INPUT, WHICH CAUSES THE DATA TO BE REVERSED 
7FOR REVERSE VIDEO. THIS INPUT IS HELD LOW FOR NORMAL DATA, 
;THE BLANK AND /VSYNC CONTROLS ARE FEEDBACK INPUTS FROM THE 
;LINE SYNC AND FRAME SYNC PAL DEVICE OUTPUTS. THESE CAUSE THE 
;VIDEO DATA OUTPUT TO BE INACTIVE DURING LINE AND FRAME SYNC. 


;PERIODS. THE SO, 


Sl AND S2 OUTPUTS DRIVE THE LINE AND FRAME 


;SYNC PAL32VX10 AND ACT AS COUNT ENABLE INPUTS TO THOSE 


;DEVICES. 


CHIP VIDEO 1 PAL32VxX10 


;VIDEO SHIFT REGISTER PAL 


7PINS 1 


2 3 4 5 6 
CLK /RST Dl D2 D3 D4 
;PINS 7 8 9 10 11 12 
DS D6 D7? BLANK /VSYNC GND 
7;PINS 13 14 15 16 17 18 
REV NC NC NC NC NC 
;PINS 19 20 21 22 23 24 
NC VIDEO S2 $l so vcc 
GLOBAL Ql Q2 Q3 O04 Q5 Q6 Q7 QB QI Q10 
STRING LOAD 'Q8*Q9*Q10' ;ENABLE LOADING OF CHARACTER DATA. 
EQUATIONS ; 
GLOBAL.SETF = RST ASYNCHRONOUS RESET. 
/Q10 = Q10 ; INTERNAL STATE COUNTER 
so = Q10 ;LEAST SIGNIFICANT BIT Q10. 
SO.CMBF = GND ;ENABLE REGISTER TO PIN 
7S0. 
/Q9 = /Q9 ; INTERNAL STATE COUNTER 
+ Q10 7010 = HIGH = TOGGLE, 
sl = Q9 7 ENABLE REGISTER OUTPUT 
S1.CMBF = GND ;TO PIN Sl. 
/Q8 = /Q8 :; INTERNAL STATE COUNTER 
+ Q10*Q9 7Q10*09 = HIGH = TOGGLE. 
$2 = QB ; ENABLE REGISTER OUTPUT 
S2.CMBF = GND ;TO PIN S82. 
/Ql := /D1*LOAD* /REV* /BLANK* /VSYNC ;LOAD DATA INPUT Dl. 
+ D1*LOAD* REV* /BLANK* /VSYNC ;LOAD DATA INPUT /D1. 
+ /Q8*/REV ;SHIFT HIGH INTO Ql. 
+ /Q9* /REV ; 
+ /Q10*/REV : 
+ BLANK ;BLANK DATA DURING 
+ VSYNC ;LINE AND FRAME SYNC 


7 INPUTS. 


Figure 11. Design File of Video Shift Register 
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/Q2 = 
+ 
+ 
+ 
+ 
+ 
+ 
/Q3 25 
+ 
+ 
+ 
+ 
+ 
+ 
[QA4 35 
+ 
+ 
+ 
+ 
+ 
+ 
/Q5 := 
+ 
+ 
+ 
+ 
+ 
+ 
/Q6 z= 
+ 
+ 
+ 
+ 
+ 
+ 
/Q7 i= 
+ 
+ 
+ 
+ 
+ 
+ 
/VIDEO 


/VIDEO.CMBF 


SIMULATION 
TRACE_ON 


SETF 


SETF 


/D2*LOAD* /REV* /BLANK* /VSYNC 
D2*LOAD*REV* /BLANK*/VSYNC 
/Q1*/Q8 
/Q1*/Q9 
/Q1*/Q10 

BLANK 

VSYNC 


/D3*LOAD* /REV*/BLANK* /VSYNC 
D3*LOAD*REV* /BLANK* /VSYNC 
/Q2*/Q8 
/Q2*/Q9 

/Q2*/Q10 

BLANK 

VSYNC 


/D4*LOAD* /REV* /BLANK* /VSYNC 
D4*LOAD*REV* /BLANK* /VSYNC 
/Q3*/Q8 
/Q3*/Q9 

/Q3*/Q10 

BLANK 

VSYNC 


/D5*LOAD* /REV* /BLANK* /VSYNC 
DS*LOAD*REV* /BLANK* /VSYNC 
/Q4*/Q8 
/Q4*/Q9 

/Q4*/Q10 

BLANK 

VSYNC 


/D6*LOAD* /REV* /BLANK* /VSYNC 
D6*LOAD*REV* /BLANK*/VSYNC 
/Q5*/Q8 
/Q5*/Q9 
/Q5*/Q10 

BLANK 

VSYNC 


/D7*LOAD* /REV*/BLANK* /VSYNC 
D7* LOAD*REV* /BLANK* /VSYNC 
/Q6*/Q8 

/Q6*/Q9 

/Q6*/Q10 

BLANK 

VSYNC 

/Q7 

GND 


ion 


CLK SO Sl S2 

VIDEO /VSYNC BLANK 
REV Q1 Q2 Q3 Q4 Q5 
Q6 Q7 Dl D2 D3 D4 D5 
D6 D7 RST 

/CLK RST REV 

/D1 /D2 D3 D4 


D5 /D6 /D7 
BLANK /VSYNC 
/RST 


FOR J := 0 TO 5 DO 


BEGIN 


IF J = 1 THEN 
BEGIN SETF VSYNC /BLANK 


END 


;LOAD DATA INPUT D2. 
;LOAD DATA INPUT /D2. 
;SHIFT DATA FROM Ql 
;INTO Q2, FOR SIX 
;BINARY COUNT PERIODS. 
;BLANK DURING LINE 
;AND FRAME PERIODS. 


;LOAD DATA INPUT D3. 
;LOAD DATA INPUT /D3. 
;SHIFT DATA FROM Q2 

; INTO Q3, FOR SIX 
;BINARY COUNT PERIODS. 
;BLANK DURING LINE 
;AND FRAME PERIODS. 


; : 
;LOAD DATA INPUT D4. 
;LOAD DATA INPUT /D4. 
+;SHIFT DATA FROM Q3 
;INTO Q4, FOR SIX 
;BINARY COUNT PERIODS. 
;BLANK DURING LINE 
7;AND FRAME PERIODS. 


;LOAD DATA INPUT D5. 
;LOAD DATA INPUT /DS 
;SHIFT DATA FROM 04 
;INTO Q5, FOR SIX 
;BINARY COUNT PERIODS. 
;BLANK DURING LINE 
;AND FRAME PERIODS. 


;LOAD DATA INPUT D6. 
;LOAD DATA INPUT /D6 
;SHIFT DATA FROM Q5 
;INTO Q6, FOR SIX 
;BINARY COUNT PERIODS. 
7BLANK DURING LINE 
;AND FRAME PERIODS. 


;LOAD DATA INPUT D7 
;LOAD DATA INPUT /D7 
;SHIFT DATA FROM Q6 
;INTO Q7, FOR SIX 
;BINARY COUNT PERIODS. 
7;BLANK DURING LINE 
;AND FRAME PERIODS. 
;ENABLE REGISTER 
;OUTPUT Q7 TO VIDEO 
;PIN. 


° 
’ 


+;START OF SIMULATION 
;TRACE ALL ESSENTIAL 
;INPUTS AND OUTPUTS. 


;SET INITIAL CONDITIONS. 
;APPLY ASYNCHRONOUS 
;RESET. SET REVERSE 
;VIDEO INACTIVE AND 
;CREATE ACTIVE BLANK 
7AND REMOVE RESET. 

7 INPUTS GENERATE FIVE 

; LOOPS. 


;SET VSYNC TO TEST 
; BLANKING. 


Figure 11. Design File of Video Shift Register (Cont’d.) 
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IF J = 2 THEN 


BEGIN SETF /REV /VSYNC BLANK 


END 
IF J = 3 THEN 
BEGIN SETF Dl /D2 D3 


/D6 D7 /BLANK /REV 


END 
IF J = 4 THEN 
BEGIN SETF REV 


END 
FOR I := 1 TO 8 DO 
BEGIN 
CLOCKF CLK 
END 
END 
TRACE_OFF 

Figure 11 

DOTCLK Voc 

RST RDY 
So HSYNC 
$1 MAG 
$2 MAS 
VSYNC MA4 
NC MA3 
NC MA2 
NC MA1 
NC MAO 
WR BLANK 
GND cs 
PAL32VX10 
LINE SYNC GENERATOR 


/D4 DS 


;TEST ACTIVE BLANK 
; INPUT. REMOVE ACTIVE 
;VSYNC. CLEAR ALL 
;BLANKING CONTROLS. 
;TEST THE LOADING 
;AND SHIFTING OF 
;DATA. FOR REVERSE 
;VIDEO ACTIVE AND 

; INACTIVE. 
;GENERATE LOOPS 

;TO LOAD AND SHIFT 
;DATA. 


. 
’ 


i 
;END OF SIMULATION 


. Design File of Video Shift Register (Cont’d.) 


DOTCLK Voc 
RST R3 
CEl R2 
NC R1 
NC Ro 
NC VSYNC 
NC MA11 
NC MA10 
NC MAQ 
WR MA8 
TST MA7 
GND NC 

PAL32VX10 
FRAME SYNC GENERATOR 


DOTCLK Vcc 
RST SO 
Di $1 
D2 $2 
D3 VIDEO 
D4 NC 
DS NC 
D6 NC 
D7 NC 
BLANK NC 
VSYNG NC 
GND REV 
PAL32VX10 


VIDEO SHIFT REGISTER 


414 09 


Figure 12. Pinouts for Various Components of a Small System Graphics Controller 
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The PAL®32VX10 offers design engineers several features not 
previously found in programmable logic devices (PLDs). Many 
of these features are found in the output macrocell; which al- 
lows over thirty possible configurations. This application note 
will highlight some of these features, and show how they are 


used in a Dual Port Video Shift Register. 

The PAL32VX10 is a 24-pin PLD with twelve dedicated inputs. 
The ten I/O macrocells have dual feedback, providing twenty 
additional array inputs. Each macrocell has the architecture 
shown in Figure 1. 


CLOCK INPUT 








SR 
FUSE SYNCHRONOUS 
ARRAY RESET 







OUTPUT ENABLE 





REGISTER 
BYPASS CONTROL 









REGISTER 
+ POLARITY 
CONTROL 


VARIED 
PRODUCT TERMS, 
8, 10, 12, 14 OR 16 





BURIED REGISTER 
FEEDBACK 


FEEDBACK FROM INPUT PIN 







AP 
ASYNCHRONOUS 
PRESET 


Figure 1. PAL32VX10 Macrocell 


Macrocell Description 


A D-type flip-flop is clocked by a dedicated input which also 
feeds the programmable array. The flip-flop data input is fed by 
a varied number of product terms, from eight to sixteen per out- 
put. The sum of these products feeds an Exclusive-OR (XOR) 
gate controlled by an additional product term. Global reset and 
preset signals are also controlled by product terms. 


A fuse controls flip-flop output polarity. A product term allows 
the flip-flop to be bypassed, and another product term enables 
the output. Feedback signals come from the flip-flop and the 
1/O pin, providing dual feedback. 


The device is manufactured with Monolithic Memories’ ad- 
vanced oxide-isolated process, providing high speed and low 
power consumption. The propagation delay is 30 ns maximum 
for the standard version and 25 ns maximum for the A-speed 
version. Power consumption is only 180 mA maximum. 


J-K Programmable Function 


The XOR gate in each of the ten macrocells provides the de- 
signer with the option to extend the functionality of the D-type 
register to perform J-K and subset functions, such as T-type 
toggle flip-flops for binary counter applications. This option can 
aid product term economy considerably. In many state ma- 
chine designs large numbers of product terms are needed as 
holding functions for a D-type flip-flop. The hold function is not 
inherent to the truth table of a D-type register, but in a J-K 
function it is, so a logic designer can invoke the hold condition 
without using up any of the product terms in the macrocell. 


It can be shown that there exist five logic equations for a J-K 
function using a D-type flip-flop, an XOR gate, and sum of 
product inputs; 


1.Q:=QU+Q°K 

2.Q:= QJ +Q’K 

3. Q := Q :+: (QU + Q*K) 

4.Q:=Q:+: (Q’U + Q'K) 

5.Q:=Q:4: (Q'S + Q*k) 

See Appendix 1 for the derivation of these equations. 


The key feature to note about these equations is that the first 
two can be realized from the sum of two product terms but ei- 
ther J or K must be inverted. A DeMorgan expansion of a com- 
plex J or K input might result in using too many product terms, 
and may not fit in the PAL32VX10. This limitation can be cir- 
cumvented by using equation 3; here, J and K inputs need no 
inversion but there is a requirement for the XOR operator, as 
shown in equations 3, 4 and 5. Alternatively, the non-inverted J 
and K functions may use more terms than the inverted ver- 
sions, pointing to the use of equations 1, 2, or 5. The macrocell 
Structure is capable of making use of any one of these five 
equations, allowing the designer to choose the one that makes 
optimum use of product terms. 


The fuse array was extended to 9,738 programmable fuses to 
accommodate ten additional feedback paths from the ten ma- 
crocells. These are the ten buried feedback paths (from the 
flip-flop) that make it possible to realize buried register state 
machine designs. Any macrocell whose output pin is pro- 
grammed as an input still has available its internal feedback 
path, and the registered cell's functionality need not be lost. 
The cell could be programmed as a buried register for use ina 
buried state machine. This feature extends the application of 
this device to input-intensive designs such as the Dual Port 


Video Shift Register. 
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PROM CLOCK SHIFT 
REGISTER AG 

RO DB1 A2 A7 
R1 DA2 DB2 A3 AB 
R2 A DA3 DB3 A4 AQ 
R3 A3 D3 DA4 DB4 AS A10 

MDO A4 D4 DA5 ——DBS5 AG RO 

MD1 A5 D5 DAG DB6 AT 

MD2 AG D6 DA7 DB7 A8 

MD3 A7 SELA so AQ 

MD4 A8 VSYNC S1 A10 

MD5 AQ BLANK S2 CHARACTER 

MD6 A10 VIDEO 
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GENERATOR 














eee 


OUTPUT 





Figure 2. Dual Port Video Shift Register in Video System 


Dual Port Video Shift Register 


This design highlights the the advantages of using PAL devices 
forimplementing custom designs. This design is shown inthe PAL 
device Design Specification file on page 2-279. Figure 2 shows 
how the device would be configured to a Microprocessor 
Unit (MPU) system as a parallel-to-serial converter in video 
applications. 


The PAL32VX10 in the system is capable of supporting one of 
two parallel input paths from character storage elements. The 
selected port input is converted to a serial video data stream 
through the VIDEO output for display on a CRT screen. Events 
of loading and shifting are synchronized to a Dot Clock input 
which is applied to pin 1 of the device, and is the rate at which 
the character dots are displayed on the CRT. There are two in- 
put ports, DA1-DA7 and DB1-DB7. One of these inputs is select- 


ed by the SELA control. When SELA is High the DA1-DA7 input 
is loaded, and subsequently shifted to the VIDEO output. When 
SELA is Low the DB1-DB7 input is loaded and shifted. 


The DA1-DA7 port could be configured to a character PROM, 
the content of which is a fixed character font. A second character 
memory could be configured to the DB1-DB7 port, which could 
be a RAM circuit. While the standard character font of the PROM 
is being displayed on the CRT, the RAM memory could be pro- 
grammed by the host CPU with a specialized character font. The 
contents of the RAM could be changed at any time, so a wide 
variety of custom fonts could be loaded, while the system is dis- 
playing the regular standard font stored in the PROM. Switching 
of the SELA input is controlled by the host system to select the 
required character memory. 
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Each of the two input ports is seven bits wide and is designed 
to interface with character matrix sizes of 7 by 9. However, if 
the SELA input is controlled by an externa! register which de- 
codes the state outputs SO, S1 and S2, the two ports may be 
configured as one single port, fourteen bits wide. 


The inputs BLANK and VSYNC are used to force a video blank- 
ing output during line and frame flyback. The BLANK input 
overlaps the line sync signal in a practical application to create 
front and back porch blanking on the CRT screen. 


Of the ten registered macrocells in the PAL32VX10, seven are 
connected as a parallel load serial shift register to the VIDEO 
output. The three remaining registers are configured as a 
three-bit binary counter. The counter outputs SO, S1 and S2 
determine whether the device is in load or shift mode. When 
S0-S2 are al! High the shift register loads character informa- 
tion; otherwise, shifting takes place for the seven remaining 
states of the binary counter. The output pins SO, S1 and S2 are 
connected to the buried registers Q10, Q9 and Q8 respective- 
ly, and are assigned in the PAL device Design Specification by 
SO = Q10, etc. Similarly, the VIDEO output pin is assigned reg- 
ister Q7. Registers Q1 to Q6 remain buried; the output pins 
normally associated with these register cells have been as- 
signed as the dedicated DB port inputs DB2-DB7. 


The three-bit binary counter has been designed using a T-type 
toggle register. Choosing Equation 4 and equating J = K = 1 for 
the Toggle function and J = K = 0 for the Hold function, in gen- 
eral J = K = T. The new input for a Toggle function becomes T 
and the equation becomes: 


6. Q:= Q +: (Q+Q)'T 
but since 

7.(Q+Q)=1 

the equation becomes: 


8.Q:=Q:+:T. 

The basic function in a binary counter is that a flip-flop must re- 
main in a Hold state until the product term made up of all of the 
less significant flip-flops in the count sequence becomes a log- 
ic High. So, the equation for Q8 in the design specification be- 
comes: 


9, Q8 := QB :+: Q9*Q10. 
When Q9*Q10 = 0 flip-flop Q8 holds its contents, and when 
Q9*Q10 = 1 Q8 changes state. 


The PAL device Design Specification shows that eighteen in- 
puts and four outputs are required, but the buried register fea- 
ture has not wasted the macrocells whose associated pins 
have been used as inputs. The design has been programmed 
with two other PAL devices (not shown here), one as a line 
sync generator and one as a frame sync generator, to form the 
basis of a low-cost video system. 


A simulation section has been included with the design to veri- 
fy functional operation before the Design Specification is com- 
mitted to the fuse array of the PAL device. It is recommended 
that the A-speed part is used in this specific application to ac- 
commodate good resolution of the displayed video information. 


The oscilloscope photograph in Figure 4 shows the SO, S1 and 
$2 outputs, running from the top trace down, and the serial 
VIDEO data on the bottom trace. The Load event takes place 
when all state outputs are High. For the first section of the 
trace, port DB7-DB1 is loaded and subsequently shifted 
through the buried register file. The cursor on the right marks 
the end of one Load/Shift cycle. The bit pattern loaded into 
DB7-DB1 is HLLHHHL. In the second half of the the trace, port 
DA7-DA1 is selected. The bit pattern of LLHLHLH is clocked 
into the DA7-DA1 port and shifted through to the VIDEO out- 
put. 


LOAD DB LOAD DA 





so EgReae 
S1 


$2 


VIDEO & 


Figure 4. Video Shift Register Waveforms 
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Dual Port Video Shift Register 
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Appendix 1 
Derivation of the J-K Function 


The complete truth table of a J-K function is shown in Table 1. 
Inputs J and K are tabulated with the current state of the regis- 
ter (QN) and the required next state (QN+1). Equations 1 and 2 
from the text are derived by directly grouping ones (S and s) or 
zeros (R and r) in the truth table. The other equations for the J- 
K function are derived by tabulating active 'SET' and 'RESET' 
functions and passive 'set' and 'reset' functions. 








Q=/Q*J + Q*/K EQ.1 
/Q = /Q*/S + Q*K EQ.2 
Q= Q:+: (/Q*J + Q*K) EQ.3 
/Q = /Q:3+: (/Q*J + Q*K) EQ. 4 
Q=/Q:+: (/Q*/3 + Q*/K) EQ.5 


Table1. Truth Table of J-K Function Using Change of State. 


Identifying Transition and Hold conditions as well as logic High 
and Low states enables the designer to draw a Karnaugh map 
for an XOR-gate register input. The XOR gate has the capabili- 
ty of a Hold function for the remain in 'set' and 'reset' states, 
and a Change-of-State function for active 'SET' and 'RESET’. 
To derive Equation 3, 


Q:= Q 1+: (Q*U + Q’K), 

the following equation is considered: 

Q:=s :+:(R+S), 

or, "Q holds unless an active transition is true." The entries in 
the equation show Hold and Transition states. When both 'S' 
and 'R' are inactive Low the register Q is in a Hold condition by 
the 's' entry in the equation. If 's' and 'R' become High simulta- 
neously the result will be an active Reset after the following 
clock edge; for this reason minimization of the 's' equation can 
include any 'R' entries in the Karnaugh Map. Also, the states of 
'S' and 's' are mutually exclusive, so with 's' inactive and 'S' ac- 
tive the output will transition to a Set condition. 

Minimize all the Hold 'set' states but include the active 'RESET' 
states for the entry to the left of the XOR function.This gives Q 
as shown in group 2 of the Karnaugh map. Then identify the 
change-of-state entries shown by the capital letters 'S' and 'R’. 
Minimization achieved in groups 1 and 3 are shown to the right 
of the equation. 

With regard to the equation for Q a similar procedure applies: 


Qi=rit:(S +R) 


This time all the passive ‘reset’ conditions are minimized with 
active 'SET' states, and XORed with minimization of active 
‘SET' and 'RESET' states (group 4). This leads to Equation 4. 


Equation 5 is derived by examining the equation for "Q will tog- 
gle unless a Hold is true", or 


Q:=Q:4:(s +1); 


the derivation is similar to those above. 
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TITLE DUAL PORT VIDEO SHIFT REGISTER. 
PATTERN 01B 

REVISION 01 

AUTHOR CHRIS JAY. 

COMPANY MMI SANTA CLARA, CA. 

DATE 15TH MARCH 1987. 


a‘ 

;THE PAL32VX10 HAS BEEN DESIGNED AS A VIDEO SHIFT REGISTER 

;WITH A DUAL SEVEN BIT PORT INPUT. PARALLEL CHARACTER DATA 

;MAY BE LOADED FROM A FIXED CHARACTER ROM VIA PORT INPUTS 

;DAl - DA7, WHILE THE SELA INPUT IS HIGH. PORT INPUTS DB1-DB7 

;MAY BE CONNECTED TO A CHARACTER RAM, THE CONTENTS OF WHICH 

;CAN BE CHANGED BY THE HOST MICROPROCESSOR. THIS ENABLES THE 

; VIDEO SYSTEM DISPLAY A STANDARD CHARACTER FONT WHILE DYNAMICALLY 
;CHANGING CUSTOM CHARACTER FONTS IN THE CHARACTER RAM. WHEN A 
;NEW CHARACTER DATA IS LOADED INTO THE RAM THE SELA INPUT CAN BE 
;DRIVEN LOW TO SELECT THE DB1-DB7 PORT INPUT. THE ON CHIP 

;STATE COUNTER SO, Sl AND S2 FORMS A SYNCHRONOUS BINARY COUNT. 
;STATES 0 TO 6 REPRESENT SHIFT ACTIVITY IN THE SEVEN BIT VIDEO 
;SHIFT REGISTER. WHEN THE COUNTER REACHES A COUNT OF SEVEN THE 
;NEXT CHARACTER DATA IS LOADED INTO THE REGISTER VIA PORT DA OR DB. 
;THERE ARE TWO BLANKING INPUTS /VSYNC AND BLANK. WHEN /VSYNC IS 
;LOW THE CONTENTS OF THE VIDEO SHIFT REGISTER ARE LOADED WITH LOGIC 
;ZEROS. THIS IS ALSO THE CASE WHEN BLANK IS ACTIVE HIGH. THE 
;VSYNC AND BLANK INPUTS SHOULD BE USED DURING FRAME AND HORIZONTAL 
;FLYBACK, RESPECTIVELY. 





CHIP VIDEO 2 PAL32VxX10 


; 
;DUAL PORT VIDEO SHIFT REGISTER PAL 


; 

;PINS 1 , 2 3 4 5 6 
CLK DAL DA2 DA3 DA4 DAS 

;PINS 7 8 9 10 11 12 
DA6. DA7 BLANK /VSYNC SELA GND 

;PINS 13 14 15 16 17 18 
DB1 DB2 DB3 DB4 DB5 DB6 

;PINS 19 20 21 22 23 24 
DB? VIDEO s2 Sl SO vcc 


GLOBAL Ql Q2 Q3 Q4 Q5 Q6 Q7 QB QI Q10 


STRING LOAD 'Q8*Q9*Q10' ;ENABLE LOADING OF CHARACTER DATA. 
EQUATIONS ; 
/Q10 3= Q10 ; INTERNAL STATE COUNTER 
SO = Q10 ;LEAST SIGNIFICANT BIT 
SO.CMBF = GND 7;Q10. ENABLE REGISTER 
;OUTPUT FOR PIN SO. 

/Q9 = /Q9 :; INTERNAL STATE COUNTER 

st: Q10 7Q10 = HIGH = TOGGLE. 
Sl = Q9 ;ENABLE PIN Sl 
S1.CMBF = GND ;REGISTERED OUTPUT. 
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/Q8 


S2 
S2.CMBF 


/Ql 


/Q2 


/Q3 


/Q4 


/Q5 


/Q6 





2-280 


won + tt 


+++eetei 


t++et¢4¢41 


+e etesti 


++ ¢etetil 


t++ete testi 


t++eeetseil 


; INTERNAL STATE COUNTER 


/Q8 

Q10*Q9 7Q10*Q9 = HIGH TOGGLE. 
Q8 ;ENABLE PIN S2 AS A 

GND ;REGISTERED OUTPUT. 


/DA1*LOAD* SELA* /BLANK* /VSYNC 
/DB1*LOAD* /SELA* /BLANK* /VSYNC 
/Q8 

/Q9 

/Q10 

BLANK 

VSYNC 


/DA2*LOAD* SELA* /BLANK* /VSYNC 
/DB2*LOAD* /SELA* /BLANK* /VSYNC 
/Q1*/Q8 
/Q1*/Q9 
/Q1*/Q10 

BLANK 

VSYNC 


/DA3*LOAD* SELA* /BLANK* /VSYNC 
/DB3*LOAD* /SELA* /BLANK* /VSYNC 
/Q2*/Q8 
/Q2*/Q9 

/Q2*/Q10 

BLANK 

VSYNC 


/DA4*LOAD* SELA* /BLANK* /VSYNC 
/DB4*LOAD* /SELA* /BLANK* /VSYNC 
/Q3*/Q8 

/Q3*/Q9 

/Q3*/Q10 

BLANK 

VSYNC 


/DAS5*LOAD* SELA* /BLANK* /VSYNC 
/DB5*LOAD* /SELA* /BLANK* /VSYNC 
/Q4*/08 
/Q4*/Q9 
/Q4*/Q10 

BLANK 

VSYNC 


/DA6*LOAD* SELA* /BLANK* /VSYNC 
/DB6*LOAD* /SELA* /BLANK* /VSYNC 
/Q5*/Q8 

/Q5*/Q9 

/Q5*/Q10 

BLANK 

VSYNC 
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;LOAD DATA INPUT 
;LOAD DATA INPUT 
;SHIFT HIGH INTO 
;WHEN Q10,Q9 AND 
;ARE NOT ALL HIGH 
; SIMULTANEOUSLY. 
;LINE/FRAME BLANK 


;LOAD DATA INPUT 
;LOAD DATA INPUT 
;SHIFT DATA FROM 
;INTO Q2. 


;LINE AND FRAME 
; BLANKING. 


;LOAD DATA INPUT 
;LOAD DATA INPUT 
;SHIFT DATA FROM 
sINTO Q3. 


’ 
;LINE AND FRAME 
; BLANKING. 


‘ 

;LOAD DATA INPUT 
;LOAD DATA INPUT 
;SHIFT DATA FROM 
;INTO Q4. 


;LINE AND FRAME 
; BLANKING. 

;LOAD DATA INPUT 
;LOAD DATA INPUT 
;SHIFT DATA FROM 
; INTO Q5. 

;LINE AND FRAME 
; BLANKING. 


’ 


;LOAD DATA INPUT 
;LOAD DATA INPUT 
;SHIFT DATA FROM 
; INTO Q6. 
7;LINE AND FRAME 
; BLANKING. 


’ 


DAl. 
DBl. 
Ql. 
Q8 


ING 


DA2. 
DB2. 
Ql 


DA3. 
DB3 
Q2 


DA4. 
DB4 
Q3 


DAS. 
DB5 
Q4 


DA6. 
DB6. 
Q5 


PAL32VX10 Uses Buried Register for Input-Intensive State Machine Designs 


/Q7 = /DA7*LOAD* SELA* /BLANK* /VSYNC 
+ /DB7 *LOAD* /SELA* /BLANK* /VSYNC 
+ /Q6*/Q8 
+ /Q6*/Q9 
+ /Q6*/Q10 
+ BLANK 
+ VSYNC 
/VIDEO = /Q7 
/VIDEO.CMBF = GND 
SIMULATION 
TRACE _ON CLK SO Sl S2 
VIDEO /VSYNC BLANK SELA 
Ql Q2 Q3 Q4 Q5 Q6 
DAl DA2 DA3 DA4 DAS DA6 DA7 
DB1 DB2 DB3 DB4 DBS5 DB6 DB7 
PRLDF Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 Qin 
SETF /CLK 
DAl /DA2 DA3 /DA4 DAS /DA6 DA7 
/DB1 DB2 /DB3 DB4 /DB5 DB6 /DB7 
SELA BLANK /VSYNC 
FOR I := 0 TO 7 DO 
BEGIN CLOCKF CLK 
END 
SETF /SELA 
FOR I := 0 TO 7 DO 
BEGIN CLOCKF CLK 
END 
SETF /BLANK SELA 
FOR I := 0 TO 7 DO 
BEGIN CLOCKF CLK 
END 
SETF /SELA 
FOR I := 0 T0 7 DO 
BEGIN CLOCKF CLK 
END 
SETF VSYNC 
FOR I := 0 TO 7 DO 
BEGIN CLOCKF CLK 
END 
TRACE OFF 
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;LOAD DATA INPUT DA6. 
;LOAD DATA INPUT DB6. 
;SHIFT DATA FROM Q6 

; INTO Q7. 


;LINE AND FRAME 
; BLANKING. 


;OUTPUT Q7 TO VIDEO. 


;START OF SIMULATION 
;TRACE ALL ESSENTIAL 


INPUTS AND OUTPUTS. 


me Ne Ne NO 


;PRELOAD REGISTERS 

;SET INITIAL CONDITIONS. 
;ON DATA A INPUT PORT 
;AND DATA B INPUT PORT. 
;ENABLE A PORT AND 
;BLANK INPUT. 

;BEGIN THE DOT CLOCK 

; INPUT FOR EIGHT CLOCK 
;CYCLES. SELECT B INPUT 
; ENABLE CLOCK FOR EIGHT 
;CLOCK CYCLES. 


;REMOVE BLANKING INPUT 
;SELECT A PORT CLOCK 
;FOR EIGHT CYCLES. 


, 


;ENABLE B INPUT PORT 
;CLOCK FOR EIGHT CLOCK 
;CYCLES. 

; ENABLE VSYNC INPUT 
;CLOCK FOR EIGHT 
;CLOCK CYCLES. 


; 
7;END OF SIMULATION 
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The field of Digital Signal Processing (DSP) was originally limited 
to the digital filtering and processing of analog signals. Today, the 
field encompasses a wide range of numeric processing applica- 
tions, primarily due to the versatility of the Fast Fourier Transform 
algorithm with its accompanying repetitive addition and multipli- 
cation operations. 


The ability to perform multiplication and addition in a single clock 
cycle is the key to achieving high-speed mathematics in DSP 
hardware. When executing math-intensive algorithms, even the 
slowest DSP components are at least ten times faster than con- 
ventional microprocessors. When a DSP solution is used, it is 
usually for speed considerations. Conventional microprocessors 
are not optimized for arithmetic, and have to simulate math 
through software. This simulation tends to require several clock 
cycles. DSP systems use specialized hardware, including a 
multiplier and adder, that allow them to execute numeric opera- 
tions in a single cycle. A typical DSP system is shown in 


Figure 1. 
MAIN SYSTEM BUS 


INTERFACE 
ARITHMETIC 


ADDRESS 
CONTROL 


MICROPROGRAM CONTROL 


Figure 1. High-Performance DSP System 




























411 01 


In addition, the architecture must try to provide data to the 
multiplier and adder at every clock cycle, for the highest efficiency. 
This can be done by using an architecture where data and 
instructions are stored separately and have their own buses, one 
for data and one for instructions. If two data buses are used, two 
data elements can be moved to the multiplier or adder in parallel 
for execution, while the CPU is fetching an instruction. Another 
efficient architecture uses pipelining, allowing the processor to 
work on subsections of the operation in parallel. 


Application Areas 


The algorithms implemented in DSP components are being 
applied to a wide variety of systems. Examples include standard 
signal processing for applications from compact disc players to 
artificial intelligence, matrix-manipulation algorithms for graphics 
systems, filtering algorithms used in image-processing systems, 
and floating-point algorithms used in workstations and scientific 
computers. Many of these applications are often categorized 
under “numerical processing” or “number crunching’ rather than 
DSP, but any algorithm that makes extensive use of repetitive 
multiplication and addition operations can benefit from DSP 
hardware. The primary application area is communications, with 
military applications also a major part of the market. The 
application areas are listed below. 


Communications 

—Codecs 

—Modems 
—Multiplexers/Demultiplexers 

—Echo Cancellation 

—Filtering 

~—Gain equalization 

Military 

—-Radar/Sonar 

—Guidance and Control 

—Telemetry 

—Electronic Warfare/Countermeasures 
Computers 

—PC-Board Add-Ons for PCs 
—Minicomputers/Workstations 
—Parallel/Array Processors 

Graphics and Image Processing 
-—CAD/CAE 

—Image Enhancement and Restoration 
-Image Subtraction, Compression, Decompression 
—CAT Scanning 

Industrial Automation 

—-CAM 

—-Vision Systems/Robotic Vision and Other Senses 
—Artificial Intelligence 

Voice Data Entry/Synthesis 

Test Equipment 

Process Control 

Consumer Products 

—Television 

—Compact Disc Players 
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Figure 2. Typical General-Purpose DSP Processor System 


Speed versus Other Requirements 


As mentioned, most DSP designs are chosen because of speed. 
However, speed is not always a critical element once a DSP 
solution is selected: the slowest DSP design willstillbe fasterthan 
the fastest conventional microprocessor design. 


Speed remains a critical element in most communications appli- 
cations. An exception is modems, which require single-chip 
solutions for size reduction and do not require high speed to 
handle voice frequencies. Speed is also critical in computer 
applications, but price and performance advantages are also 
important. DSP add-in boards require small size, and therefore 
single-chip solutions. In military applications, however, price is 
less critical, and designs are typically customized for 
performance. 


Consumer applications use custom single-chip solutions for both 
size reduction and cost reduction; these are usually produced by 
a captive supplier. Many of the other applications require a 
flexible solution and a short development time, both efficiently 
provided by programmable logic devices. 
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Single Chip versus Building Block 


Similar to the beginnings of the conventional microprocessor, 
dedicated DSP processors are being introduced along with 
development software. Single-chip DSP solutions offer easier 
software development, smaller size, and lower cost. Several 
DSP processors are commercially available, and are taking the 
place of conventional microprocessors in DSP systems (Figure 
2). Most are general-purpose processors, but application-spe- 
cific processors are also being developed. 


In addition to these dedicated DSP processors, custom proces- 
sors can be built with an ASIC methodology. Custom processors 
are especially efficient for high-volume applications. 


The alternative to a single-chip DSP processor is a building-block 
solution using multiple components. These components would 
typically involve bit-slice, multiplier/accumulators, ALUs, and 
program memory, as performed in the application note on page 
2-307. These solutions provide higher speed and are efficient in 
speed-critical or low-volume applications. An example is medical 
image processing, where only a few dedicated image filtering 
routines are performed repeatedly on the images—the building 
blocks can be designed to perform quickly asmall set of functions. 
Many applications use a mix of single-chip solutions with accom- 
panying building blocks for added functionality (see Figure 3). 
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Figure 3. Am29500 Processor Family Architecture. 
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PLD Applications 


PLDs are useful both in single-chip and building block designs. If 
DSP components are used as accelerators in conjunction with 
standard microprocessors, the interface logic can be placed 
into PLDs. 


When supporting DSP processors, PLDs perform functions 
similar to those supporting a general purpose microprocessor. 
These include address decoding, interrupt control, DMA control, 
and bus control. See the section on Microprocessor-Based 
Systems for more detail. 


Programmable logic devices are also useful in the building-block 
applications. PLDs often perform functions similar to their use in 
standard bit-slice designs, as counters or sequencers. 


The need for a continuous supply of data to the numeric compo- 
nents of the DSP system provides opportunities for PLDs in both 
building-block and single-chip designs. Handshaking between 
components, especially between the numeric chips and the 


memory or sequencer, is also required, and with multiple buses 
in the system, complex bus interface logic is needed. 


The application note on page 2-307 shows PAL devices providing 
several functions for a building-block DSP system which imple- 
ments an audio spectrum analyzer. The PAL devices create the 
sequencer for program execution, multiplex the data outputs to an 
oscilloscope, and provide several miscellaneous control func- 
tions. 


PLDs provide an effective solution in any type of DSP architec- 
ture. General-purpose DSP processors need PLDs to perform 
typical processor support functions, while ASIC processors may 
need additional logic to provide the proper interface to other 
chips. In building block systems, PLDs perform both the basic 
control functions and interface functions. With limited availability 
of software development tools for most DSP applications, pro- 
grammable logic devices offer the flexibility to implement last- 
minute design changes to adapt to the needs of the software and 
application. 
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Asimple waveform generator can be built with a PAL device ad- 
dressing locations within a PROM. Using a PAL20X10 device to 
address locations in a registered PROM (Monolithic Memories 
63RA1681), digital data may be accessed and fed to a Digital to 
Analog Converter (DAC). The locations in the memory are pro- 
grammed with appropriate data to provide a sinusoidal, triangu- 
lar, sawtooth or step function output. A square wave output may 
be generated directly from the PAL device itself. 


Digital vs. Analog Waveform 
Generation 


There are advantages and disadvantages in generating analog 
functions by this method when considered against a dedicated 
analog circuit design. The main advantages favoring the digital 
method of waveform generation involve controllability. In this 
design example phase shift generation can be controlled within a 
tolerance of one degree. The PAL device has been programmed 
as a 180 degree up/down counter. An initial count and up/down 
contro! may be loaded into the counter so precise phase shifts can 
be generated. When considering the phase shift of a sinusoidal 
waveform in an analog circuit, complex Resistor-Capacitor (RC) 
networks might need to be added to generate phase shifting. 


Another advantage is the ability of the system to generate a wide 
range of waveform types, and the convenience of generating new 
waveforms by simply re-programming a PROM, or programming 
blank locations in the existing PROM. Also, if it were a require- 
ment to generate a burst of any number of cycles, this could be 
controlled by a digital interface from a microprocessor. 


The disadvantages associated with the digital circuit would be the 
generation of quantization noise due to the encoding of a digital 
code into a close approximation of the analog conversion. For 
example, an eight-bit digital code can be converted to 256 differ- 
ent voltage levels. Supposing a 10 volt signal were to be evenly 
encoded into 256 discrete levels, the difference between each 
level would be about 39 mV. The approximation of the digital 
signal to its analog equivalent gives rise to quantization noise, in 
this example 39 mV. 


A way of reducing the quantization noise is to generate a larger 
digital code. Fora 10-volt dynamic range encoded by a fourteen- 
bit digital code, the quantization noise is reduced to 0.61 mV. Of 
course there are problems in increasing the resolution of the 
digital code in the respect that a much higher sampling frequency 
would be required to generate the analog output of a given period. 


Additional to the problem of quantization noise is the noise 
generated by the super-imposition of elements of the master 
sampling clock on the analog wave output. Fortunately the 
frequency of this clock is much higher than the analog output, for 
aresolution of eight bits, and may be filtered out with a simple RC 
filter. 


In summary, the advantages in a digital waveform generator are 
in the controllability of phase shifting, and the number of cycles 
generated. Also, the ability to guard against drift might be an 
advantage. Withan analog system, purity of signal output and the 
ability to generate higher frequencies are the main advantages. 


PAL Device Implementation 


The diagram in figure 1 shows the circuit used to create the 
waveforms. Photographs of the waveforms are provided in figure 
2. The different functions generated were selectable by a unique 
digital code applied to the higher order address inputs A8, A9, and 
A10 of the PROM. The lower order addresses were driven from 
the PAL20X10; outputs Q0-Q7 were fed to address inputs 
A0-A7. The design specification of the 180 Degree Up/Down 
counter (page 2-288) was encoded into the PAL device. 


Look-Up Table Generation 


To generate the hexadecimal look-up table, which was pro- 
grammed into the PROM, a small TURBO PASCAL program was 
written to provide a conversion from the angle (degrees) to the 
sine of the angle, in a hexadecimal value (page 2-290). The 
program generated the nearest hexadecimal value to 100 
(sine(x) +1), for a range.of 180 degrees, at a resolution of one 
degree. The sawtooth and triangular waveforms were simple 
identity functions; the PROM location address was programmed 
into its own contents. The step waveform was programmed as 
constant values in a number of successive adjacent locations. 
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TITLE 180DEGREE COUNT 
PATTERN 01 

REVISION 01 

AUTHOR CHRIS JAY 
COMPANY MMI SANTA CLARA 
DATE 17 OCTOBER 1986 


CHIP DEGREE PAL20X10 


7PIN 1 2 3 4 5 6 
CLK /UD pole) D1 D2 D3 
*;PIN 7 8 9 10 11 12 
D4 DS D6 D7 DOWN GND 
7PIN 13 14 15 16 17 18 
/OE Qo Ql Q2 Q3 Q4 
7PIN 19 20 21 22 23 24 
Q5 Q6 Q7 CUP CDN vec 


¢ 

;THE PAL20X10 HAS BEEN DESIGNED AS A 180 DEGREE 
;LOADABLE UP/DOWN COUNTER. THE COUNTER OUTPUTS 
;DRIVE THE ADDRESS INPUTS OF A PROM WHICH IS 
#PROGRAMMED WITH A 'LOOK UP! SINE TABLE. A COUNT 
;UP FROM ZERO TO 180 WILL GENERATE ONE HALF 
;CYCLE FROM THE PROM CONTENTS. THE COUNTER WILL 
;AUTOMATICALLLY SWITCH TO A DOWN COUNT AND GENERATE 
7;THE SECOND HALF OF ONE COMPLETE CYCLE. IF THE 
;PROM OUTPUTS ARE REGISTERED AND FED TO A D/A 
s;CONVERTER A CONTINOUS SINUSOIDAL SIGNAL WILL BE 
?GENERATED. COUNTING CAN BE INTERRUPTED WITH A 
7;LOAD. PHASE SHIFTING OF THE SINUSIODAL WAVEFORM 
;MAY BE ACHIEVED WITH A RESOLUTION OF 1 DEGREE. 


’ 
EQUATION 


720 t= Q0*/LD +COUNT 
+ /DO*LD ;LOAD 
; 
/Q1 := /Q1*/LD ;HOLD 
+ /D1*LD ;LOAD 
t+: QO*/LD*CUP ;UP COUNT 
+ /Q0*/LD*/CUP ;DOWN COUNT 
; 
/Q2 s= /Q2*/LD ;HOLD 
+ /D2*LD ;LOAD 
t+: Q0*Q1*/LD*CUP ;UP COUNT 
+ /Q0*/Q1*/LD*/CUP ;DOWN COUNT 
: 
/Q3 s= /Q3%*/LD +HOLD 
+ /D3*LD ;LOAD 
s+: Q0*Q1*Q2*/LD*CUP ;UP COUNT 
+ /Q0%/Q1*/Q2*/LD*/CUP ;DOWN COUNT 
; 
/24 s= /Q4%/LD }HOLD 
+ /D4*LD ;LOAD 
t+: Q0#Q1*Q2*Q3%/LD*CUP ;UP COUNT 


+ /Q0%/Q1*/Q2*/Q3*/LD*/CUP 
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/Q5 = /Q5*/LD 
+ /D5*LD 
S+3 Q0*Q1*Q2*Q3%Q4*/LD*CUP 
+ /Q0*/Q1*/Q2*/Q3*/Q4*/LD*/CUP 


/Q6 := /Q6*/LD 
+ /D6*LD 
t+: Q0*Q14*Q24%Q3*Q4*Q5*/LD*CUP 
+ /QO0*/Q1*/Q2%/Q3*/Q4*/Q5*/LD*/CUP 


/Q7 t= /Q7*/LD 
+ /D7*LD 
t+: Q0*Q1*Q2*Q3%Q4*Q5*Q6*/LD*CUP 
+ /Q0*/Q1*/Q2%/Q3*/Q4%*/Q5%*/Q6*/LD*/CUP 


/CUP z= /CUP*/LD 
+ DOWN*LD 
s+2 /Q7*/Q6*/Q5*/Q4%/Q3%/Q2%/Q1*Q0*/LD*/CUP 
+ = Q7#/Q6*Q5%*Q4%/Q3 */Q2*Q1*Q0*/LD*CUP 


/CDN = CUP*/LD 
+ /DOWN*LD 
+2 /Q7*/Q6%/Q5*/Q4%/Q3%/Q2%*/Q1*Q0*/LD*/CUP 
+ — Q7®/Q6*Q5*Q4%/Q3*/Q2*Q1*Q0%/LD*CUP 


SIMULATION 
TRACE ON CLK /DOWN LD CUP CDN 
QO Ql Q2 Q3 Q4 Q5 Q6 Q7 
SETF /CLK LD OE DO Dl D2 
/D3 /D4 /D5 /D6 /D7 DOWN 


SETF /LD 

CLOCKF 

FOR I := 0 TO 15 DO 

BEGIN CLOCKF 

END 

SETF /CLK LD /DO /D1 /D2 
/D3 D4 DS /D6 D7 /DOWN 

CLOCKF 

SETF /LD 

CLOCKF 

FOR I := 0 TO 15 DO 

BEGIN CLOCKF 

END 

TRACE_OFF 


C1 mononthic Mi memories ot 





+HOLD 

7? LOAD 

;UP COUNT 
*DOWN COUNT 


e 

*HOLD 

* LOAD 

7;UP COUNT 
*DOWN COUNT 


;UP COUNT 
7DOWN COUNT 


;COUNT UP 
*REGISTER 


e 
‘ 


=e te 


7COUNT DOWN 
*REGISTER 


we Se Se Se te Se 


;LOAD 03H 
?AND DOWN 
;COUNT. SET 
;INTO COUNT 
*MODE AND 
#GENERATE 
?VECTORS TO 
#TEST DOWN 
?TO UP TRANS- 
;~ITION. LOAD 
?BOH AND UP 
;COUNT. 
;GENERATE 
;VECTORS TO 
;TEST UP TO 
;DOWN TRANS- 
; eITION. 
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program sinetable; 


(BARR RRR RRR RIK KIRKE RARER RARER ERE EERE RARER AKER REAR RERRREREKKER ) 


(* *) 
(* The program 'sinetable' has been designed to generate *) 
(* the closest possible HEXIDECIMAL value to 100 X [sine(x) + 1 x) 
(* for -90 <= x => +90, in steps of delta x = 1 degree. The *) 
(* HEXIDECIMAL PROM address locations have been evaluated as *) 
(* as n = x + 90. The program will generate a SINE.DAT file *) 
(* which will contain a table of PROM address locations from *) 
(* from 00 to B4 HEX, and the contents of each location as the *) 
(* closest possible sine value as computed in the above equation. *) 
(* Program written by: *) 
(* Chris Jay and Ser-Hou Kuang. MMI Santa Clara. *) 
s *) 


(BAAR RRR RRR RRR RRR ARIA RRR RARER RR RARE RE RRR RRR IIR R RR ERR KARR ARIE ) 


const 
hexsize = 10; 
type 
hex_type = packed array [1..hexsize] of char; 


var 

SinFile: text; 
i,k:integer; 

s,x,f:real; 

hexar,hexcon : hex_type; 


(RRR RI RII TR RR IRR TR RRR RIK RR KIKI RRR RRR RK RIKER RARER ERE K KY 
( *) 
(* Procedure to convert decimal values into hexadecimal values *) 
(* x 
(FOGG III GIGI GIICIIGIIUIGICIOIIIGITICIOIDII ISIC GTO IOIGIOIO IOI IO IIIT) 


Procedure DecToHex (dec : integer; var tohex: hex_type); 
var 
i, dig, idx, next : integer; 


Function hex (i : integer): char; 


begin 

case i of 
0 : hex := '0'; 
1: hex := '1'; 
2: hex := '2'; 
3 3: hex := '3'; 
4: hex := '4'; 
5 : hex := '5'; 
6 : hex := '6'; 
7 : hex := '7'; 
8 ; hex := '8'; 
9 : hex := '9'; 
10 : hex := ‘A’; 
11; hex := 'B'; 
12 : hex := 'C'; 
13 : hex := 'D'; 
14 : hex := 'E'; 
15 : hex := 'F'; 

end; 

end; 
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begin 
for i := 1 to hexsize do 
tohex [i] := ' °'; 


idx := hexsize; 
dig := dec; 
next := dig div 16; 


while (next <> 0) and (idx > 0) do 
begin 
ToHex {idx] := hex (dig - (next * 16)); 
idx := idx -1l; 
dig := next; 
next := dig div 16; 
end; 
ToHex [idx] := hex (dig); 
end; 


begin 
Assign(SinFile, 'B:SINE.DAT'); 
Rewrite(SinFile); 





x 3= 0; 
f := 0; 
i := -90; 
s := pi/180; 
while i < 91 do 
begin 
k := i + 90; 
x 2= (i*s); 
f := 1 + sin(x); 


f := 100*£; 
DecToHex (round (f), hexar); 
DecToHex (k,hexcon); 
if i = -90 then 
begin 
writeln ('DEGREES LOC 100*SIN X + 1 HEXLOC HEX'); 
end; 
write (i:3,' “ ke3, S34 
writeln (£:10:5,' ',hexcon,' ‘ ,hexar); 
if i = -90 then 
begin 
writeln(SinFile,'DEGREES 100*SIN(X)+1 HEXLOC 100*SIN(X)+1[HEX]'); 
end; 
writeln(SinPile,k:5,' ',£:10:5, ' ',hexcon,' ',hexar); 
i := 1+ 1; 
end; 
Close(SinFile); 


end. 
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The PAL®32VX10 offers design engineers several features not 
previously found in programmable logic devices (PLDs). Many 
of these features are found in the output macrocell, which al- 
lows over thirty possible configurations. This application note 
will highlight some of these features, and show how they-are 
used in Dual Binary Rate Multipliers. 


The PAL32VX10 is a 24-pin PLD with twelve dedicated inputs. 
The ten I/O macrocells have dual feedback, providing twenty 
additional array inputs. Each macrocell has the architecture 
shown in Figure 1. 
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Figure 1. PAL32VX10 Macrocell 


Macrocell Description 


A D-type flip-flop is clocked by a dedicated input which also 
feeds the programmable array. The flip-flop data input is fed by 
a varied number of product terms, from eight to sixteen per out- 
put. The sum of these products feeds an Exclusive-OR (XOR) 
gate controlled by an additional product term. Global reset and 
preset signals are also controlled by product terms. 


A fuse controls flip-flop output polarity. A product term allows 
the flip-flop to be bypassed, and another product term enables 
the output. Feedback signals come from the flip-flop and the 
1/0 pin, providing dual feedback. 


The device is manufactured with Monolithic Memories’ ad- 
vanced oxide-isolated process, providing high speed and low 
power consumption. The propagation delay is 30 ns maximum 
for the standard version and 25 ns maximum for the A-speed 
version. Power consumption is only 180 mA maximum. 


Dual Binary Rate Multipliers 


The availability of the clock input in the fuse array avoids the 
addition of external logic in applications where it is necessary 
to use the system clock as a logic input. In many synchronous 
PLDs the system clock input is connected only to the register 
file. The rising edge of the clock input synchronizes data flow 
through the D-type registers, but the clock is not available as a 
Boolean signal in the fuse array. 


The application of the Dual Binary Rate Multipliers uses the 
buried register capability, the XOR function to build counters 
with a T-type register, and the presence of the clock input in 
the fuse array. The circuit requires sixteen data inputs. There 
are two data byte inputs DAO-DA7 and DBO-DB7. The two serial 
outputs are QA associated with the DA port and QB with the DB 
port. The clock input is divided by sixteen and is available at 
the QA or QB outputs multiplied by the binary data inputs DAO- 
DAS or DA4-DA7, and DBO-DB3 or DB4-DB7. 


The output SELOP (Select Output) selects one half of each A 
or B byte as the multiplying vector. When SELOP is High the 
lower half of each input byte is selected to multiply the clock in- 
put. When Low, the upper half of each input is used to multiply 
the clock input. 


For example, if data inputs DA4-DA7 and DB4-DB7 were binary 
encoded as six and three, respectively, then six and three 
clock pulses would be gated through to the outputs QA and QB 
respectively while SELOP is Low. When SELOP goes High the 
data inputs DAO-DA3 and DBO-DB3 are selected as the multipli- 
ers. If DAO-DA3 contained the binary encoded input of eight, 
then eight clock pulses would be gated to QA. At the same 
time, if DBO-DB3 contained fifteen, fifteen clock pulses would 
appear at the QB output. These waveforms are shown in the 
oscilloscope photograph in Figure 2. 
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Figure 2. Dual Binary Rate Multipliers Waveforms 
Dual Binary Rate Multipliers 


Figure 3. Pin Layout 
The two additional outputs, LSB and MSB, are byte-select out- 


puts. This internal state machine has four unique states, per- 
forming a binary count zero to three. Just as the SELOP output 
can be used to select the multiplying half of each byte, LSB 
and MSB may be decoded externally to select a one-of-four 
byte data input to be applied to parallel inputs A or B. A pro- 
grammable controlled pulse train from four byte inputs can then 
be available at the QA and QB outputs. 
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TITLE DUAL BINARY RATE MULTIPLIERS 
PATTERN 05 

REVISION 01 

AUTHOR CHRIS JAY 

COMPANY MMI SANTA CLARA, CA 

DATE AUGUST 6, 1986 


° 
g 


CHIP BINRATE PAL32VX10 

;THE PAL32VX10 HAS BEEN PROGRAMMED TO PERFORM BINARY 
;RATE MULTIPLICATION. THE OUTPUTS AT 

7;QA AND QB ARE ONE-SIXTEENTH OF THE CLOCK INPUT 
;MULTIPLIED BY THE BINARY VALUE OF THE INPUTS 
;DAOQ-DA3 AND DBO-DB3, RESPECTIVELY, WHEN SELOP 

71S HIGH; OR, ONE-SIXTEENTH OF THE CLOCK INPUT 
;MULTIPLIED BY THE BINARY VALUE OF THE INPUTS DA4-DA7 
;AND DB4-DB7, WHEN THE SELOP SIGNAL IS LOW. 


;PINS 1 2 3 4 5 6 
CLK DAO DAl DA2 DA3 DA4 
:PINS 7 8 9 10 11 12 
DA5 DA6 DA7 DBO DB1l GND 
;PINS 13 14 15 16 17 18 
DB2 DB3 DB4 DB5 DB6 QA 
;PINS 19 20 21 22 23 24 
QB DB7 SELOP LSB MSB vcc 


GLOBAL Q0 Ql Q2 Q3 NC NC NC SEL LSBI MSBI ;BURIED NODES 


STRING Kl 'QO* Q1* Q2*/Q3' ;STRING STATEMENTS 
STRING K2 'QO* Q1*/Q2' ;TO DECODE USABLE 
STRING K3 'Q0*/Q1' ;STATES OF INTERNAL 
STRING K4 'Q0*/Q1* Q2* Q3' ;DIVIDE BY SIXTEEN 
STRING K5 'Q0*/Q1*/Q2*/Q3' ;BINARY COUNTER. 
STRING CKNL '/CLK* SEL' ;CLOCK SELECTION 
STRING CKNH '/CLK*/SEL' ;STRING STATEMENTS 
EQUATIONS 

/Q0 := Q0 ;CLOCK DIVIDE BY TWO. 


;BURIED REGISTER. 


/Ql := /Ql :+: QO ;CLOCK DIVIDE BY FOUR. 
;BURIED REGISTER. 


/Q2 2= /Q2 :+: Q0*Q1 ;CLOCK DIVIDE BY EIGHT. 
;BURIED REGISTER. 


/Q3 := /Q3 :+: Q0*Q1*Q2 ;CLOCK DIVIDE BY SIXTEEN. 
;BURIED REGISTER. 
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/SEL := /SEL :+: Q0*Q1*Q2*Q3 ;SELECT O/P HIGH TO SELECT 
SELOP := SEL ;DATA 0-3, LOW TO 
;SELECT DATA 4-7. 
QA = DAO * K1*CKNL ;COMBINATORIAL OUTPUT OF 
+ DA3 * /QO*CKNL ;CLOCK MULTIPLIED BY THE 
+ DAL * K2*CKNL ;DAO-DA3 INPUT WHEN SEL 
+  DA2* DA3 * K2*CKNL ;IS HIGH, AND DA4-DA7 
+  DA2*/DA3 * K3*CKNL ;WHEN SEL IS LOW. 
+  DA1* DA2 * K3*CKNL ; 
+ /DA1* DA2* DA3* K4*CKNL ; 
+ /DA1* DA2* DA3* K5*CKNL ; 
+ DA4 * K1*CKNH ; 
+ DAT * /Q0*CKNH ; 
+ DAS * K2*CKNH ; 
+  DA6* DAT * K2*CKNH ; 
+  DA6*/DA7 * K3*CKNH ; 
+  DAS5* DAG * K3*CKNH ; 
+ /DAS5* DA6* DA7* K4*CKNH ; 
+ /DA5* DA6* DA7* K5*CKNH ; 
QA.CMBF = VCC ; ENABLE COMBINATORIAL 
;OUTPUT FUNCTION. 
QB = DBO * K1*CKNL ;COMBINATORIAL OUTPUT OF 
+  DB3 * /QO0*CKNL ;CLOCK MULTIPLIED BY THE 
+  DB1 * K2*CKNL ;DBO-DB3 INPUT WHEN SEL 
+  DB2* DB3 * K2*CKNL ;IS HIGH, AND DB4-DB7 
+  DB2*/DB3 * K3*CKNL ;WHEN SEL IS LOW. 
+  DB1l* DB2 * K3*CKNL ; 
+ /DB1* DB2* DB3* K4*CKNL ; 
+ /DB1* DB2* DB3* K5*CKNL ; 
+  DB4 * K1*CKNH ; 
+  DB7 * /Q0*CKNH ; 
+ DBS * K2*CKNH ; 
+ DB6* DB7 * K2*CKNH ; 
+  DB6*/DB7 * K3*CKNH ; 
+  DB5* DB6 * K3*CKNH ; 
+ /DB5* DB6* DB7* K4*CKNH ; 
+ /DB5* DB6* DB7* K5*CKNH ; 
QB.CMBF = VCC ;ENABLE COMBINATORIAL 
;OUTPUT FUNCTION. 
/USBI := /LSBI :+: SEL*Q3*Q2*Q1*Q0 ; 
LSB := LSBI ; 
/MSBI := /MSBI : 
:+: LSBI*SEL*Q3*Q2*Q1*Q0 ; 
MSB := MSBI : 
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SIMULATION 

TRACE ON CLK QA QB QO Q1 Q2 Q3 
SELOP LSB MSB DAO DA1 
DA2 DA3 DA4 DAS DA6 DA7 
DBO DB1l DB2 DB3 DB4 DB5 
DB6 DB7 


PRLDF /Q0 /Q1 /Q2 /Q3 

SETF /CLK /DAO /DA1 
DBO /DB1 
/DA4 DAS 
DB4 DBS 

FOR J := 0 TO 4 DO 

BEGIN 

IF J = 1 THEN 


SEL /LSBI /MSBI 
/DA2 /DA3 
/DB2 /DB3 
/DA6 /DAT 
/DB6 /DB7 


BEGIN SETF DA2 DB2 DA6 DB6 


END 
IF J = 2 THEN 


BEGIN SETF /DA2 DA3 /DB2 DB3 
/DA6 DA7 /DB6 DB7 


END 
IF J = 3 THEN 


BEGIN SETF DA2 DB2 DA6 DA6 


END 
FOR I := 1 TO 32 DO 
BEGIN | 

SETF CLK 

SETF /CLK 
END 
END 
TRACE_OFF 


;SIMULATION SECTION. 
;TRACE ESSENTIAL 
; WAVEFORMS. 


. 
’ 


;PRELOAD INITIAL VALUES. 
;SET INPUT DAO-DA3 0. 
;SET INPUT DBO-DB3 Ls 
;SET INPUT DA4-DA7 2. 
;SET INPUT DB4-DB7 3s 


;GENERATE FOUR LOOPS. 
;ON SECOND LOOP SET 
;DAO-DA3 = 4, DBO-DB3 
;DA4-DA7 = 6, DB4-DA7 
;ON THIRD LOOP SET 

; DAO-DA3 8, DBO-DB3 
;DA4-DA7 10, DB4-DB7 = 1l. 
;ON FOURTH LOOP SET 
;DAO-DA3 = 12, DBO-DB3 
;DA4-DA7 = 14, DB4-DB7 


e 
, 


uN oul 
\o ~~ 
= e 


tow 
~ 
Ww 
= 


;ON EACH LOOP APPLY 

;32 CLOCK PULSES. 

;OUTPUTS QA AND QB 

;WILL GENERATE THE 

;CLK DIVIDED BY 16 MULTIPLIED 
;BY THE BINARY CODE AT THE 
;SELECTED INPUT. 
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It is often necessary to interface analog inputs into a digital sys- 
tem. In digital instruments the signal being measured is usually 
an analog level. Transducers are used to provide a linear relation- 
ship between a measured input signal and a voltage output. An 
example of a transducer would be a strain gauge. In this case the 
amplitude of the voltage output would be a measure of mechani- 
cal stress. If the measuring device were microprocessor-based 
or minicomputer-based, the voltage level would have to be 
converted to adigital binary code before it could be read and proc- 
essed. Ananalog to digital converter would be required to accom- 
plish this task. 


Conversion Techniques 


There are a number of techniques used to provide analog to 
digital (A to D) conversion. The slowest methods of A to D 
conversion are based on counters, and would probably be used 
in instruments such as multimeters. An example would be the 
self-calibrating Dual Slope Integrator. The fastest Ato D convert- 
ers are based on look-up tables and might be used in video 
applications. In the case of a Flash Converter high conversion 
speed is obtained at the expense of a very large number of input 
comparator circuits. An eight-bit digital output code would require 
2°-{ comparator circuits at the input. The comparator outputs 
would drive a look-up table from which the closest binary weight- 
ing to the analog input would be referenced. Techniques to 
simplify the input circuitry would reduce the number of compara- 
tors required. A Half Flash converter would accomplish this but 
with some degradation in conversion speed. 


Successive Approximation 


A medium to high conversion speed can be realized with the 
Successive Approximation technique of conversion, and many 
circuits exist to perform this type of conversion. Figure 1 shows 
a diagram of an Ato D converter that uses a PAL20RS10 as a 
Successive Approximation Register. The advantage of using the 
successive approximation technique of conversion is that the 
circuit is relatively simple when using a PALdevice, and the cycle 
of conversion is constant. Conversion using counters is no more 
simple, and does not give aconstant conversion time because the 
time required to produce the closest possible binary output is 
dependent on the amplitude of the input signal. 


The PAL20RS10 has been programmed as a state machine to 
perform the approximation cycle. The Design Specification title 
SAR8 (page 2-300) refers to the eight-bit Successive Approxima- 
tion Register. To provide more bits of resolution, a fourteen-bit 
SAR14 has been programmed into a PAL32R16 (page 2-303). 


Quantization Error 
Why would a system require additional bits of data in the SAR 


circuit? Suppose a two-digit code is used to represent a dynamic 
voltage range of 0 V to 9 V. A linear mapping would be: 


snanvoone [| [® | a” 





EIGHT BIT DIGITAL OUTPUT 


SYSTEM CLOCK INPUT CLK 
CMP 
POL 


PAL20RS10 


601 01 


CLK 
OCTAL REGISTER 







COMPARATOR 
CIRCUIT 





ANALOG INPUT 









DTOA 
CONVERTER 


Figure 1. Eight-Bit Successive Approximation Analog to Digital Converter 
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For a two-digit code there exist 2? codes which represent four 
discrete analog voltage levels. The difference between each 
voltage levelis 3.0 V. Inthis simple example there are not enough 
binary bits to represent any voltages between each separate 
level. The difference between the actual value and the approxi- 
mation is the quantization error. The resolution can be increased 
and quantization error reduced by adding more bits to encode the 
data. With a three-digit code resolution improves: 


BINARY CODE 


VOLTAGE LEVELS (VOLTS) 





However, there remain voltage levels that cannot be accurately 
represented. In general, if we assume an input dynamic range 
(Vdr) is to be encoded into equal discrete levels (Vq) by anumber 
(n) of bits of binary data, the relationship becomes: 


Vdr/(2"- 1) = Va. 


If adynamic range of 10 V were represented by eight bits of digital 
data the input voltage level could be resolved to an accuracy of 
approximately 


10/(28- 1) = approximately 40 mV. 


For some systems this might be accurate enough. To decrease 
quantization error even further the PAL32R16 has been pro- 
grammed to perform successive approximation with 14 bits of 
binary information. With the given figures the resolution is 


10/(2'4- 1) = approximately 0.6 mV. 
PAL20RS10 Implementation 


The design shown in Figure 1 is an eight-bit Ato D system. The 
PAL20RS10 has a clock input which synchronously clocks the 
state machine programmed into the PAL device. The Q,-Q, 
outputs will contain the closest digital binary weighting to the 
analog input (to a resolution of 40 mV for a 10-volt range) after 
each approximation cycle. Q, is the most significant bit. 


The CC signal is an active-HIGH Conversion Complete signal. 
CC can be converted to provide an additional bit of code to make 
anine-bit SAR if increased system resolution were required. The 
EOC output is an End Of Count or Conversion signal. 


The input CMP is driven by the output of a very high tolerance 
comparator circuit. The TTL level output is fed to the CMP input 
and clocked into the selected register during each compare 
operation. The POL input is hard-wired HIGH or LOW; this 
influences the polarity of the CMP input. The CMP input can be 
either active-HIGH or LOW as selected by the POL control. 


The EN inputis the ENable control. If HIGH the SAR is configured 





for eight bits, and if LOW CC becomes an additional register to 
create a nine-bit SAR. The registers increase in significance by 
one and CC becomes Q,. 


An active-LOW RESET input has been added to the design so a 
synchronous reset may be applied to the circuit. When LOW, the 
clock input will set Q, HIGH, CC, EOC, and Q,-Q, LOW, shown 
in Figure 2 att,. 


The SAR output (Q, HIGH and Q,-Q, LOW) provides a binary 
input to the D to Aconverter, generating a corresponding analog 
signal. The comparator will compare this output with the current 
value of the analog input. If voltage level of the converted signal 
is greater than the current level of the analog input the binary 
weighting of the SAR contents is too large. The feedback from the 
comparator output to the CMP input will cause the contents of Q, 
to be RESET on the next clock rising edge, t, in Figure 2. If the 
level of the converted signal is less than the input, the binary 
weighting of the contents of the SAR is not large enough. The 
comparator feedback will cause the Q, register contents to 
remain SET after the arrival of the clock rising edge t,. Simulta- 
neously Q, willbe SET HIGH to perform the next lesser significant 
approximation. 


Att, the result of the next CMP input is registered at Q, while Q, 
is SET for the next lesser significant compare. The entire cycle 
is completed with the contents of Q-Q, containing the closest 
approximation of an eight-bit digital code to the current analog 
input. The end of one complete cycle finishes at t,, and starts 
again at t,,. 


After the CMP input has been loaded into a register, the contents 
of that register must be held for the duration of the approximation 
cycle. When the digital code is set up in the register file of the 
PAL20RS10 it may be synchronously loaded into an octal re- 
gister, as shown in Figure 1. CC will provide an enabling HIGH 
(Figure 2) and the digital data is registered at time t,, in the cycle. 
The input to the comparator should be held constant during the 
conversion cycle inaSAMPLE and HOLD circuit. The EOC signal 
may be used as an interrupt input to the host processor, which can 
read the data over an eight-bit data bus by enabling the output of 
the octal register. 


The PAL20RS10 was selected because some of the outputs 
require more than the eight product terms available in standard 
PAL devices. The PAL20RS10 features product term steering, 
whereby sixteen product terms are shared between two outputs. 
To fit in the device, the outputs are arranged such that an output 
using more than eight product terms is adjacent to an output using 
less than eight product terms. 


The oscilloscope trace (Figure 3) shows one complete cycle of 
successive approximation. A PAL20RS10 device was pro- 
grammed as an SAR and configured with an AD558 D to A 
converter and a comparator circuit. The top trace shows the 
output of the comparator and the bottom trace shows the output 
of the D to Aconverter. Initially Q, is set HIGH, and the CMP 
feedback holds the Q, contents HIGH as Q, is set HIGH for the 
next compare. Q, is SET HIGH as a result of the next compare. 
When Q, is set HIGH the D to A output is greater than the input 
voltage level so the content of Q, is reset LOW. When the cycle 
is complete the result HHLH HLLH is the closest binary weighting 
to the level of the analog input. 
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Figure 2. Eight-Bit Successive Approximation Register Output Waveforms 
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Figure 3. 
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;EIGHT BIT SUCCESSIVE APPROXIMATION REGISTER 
;WITH CONVERSION COMPLETE AND END OF CONVERSION. 


TITLE SAR 8 

PATTERN 01 

REVISION 01 

AUTHOR CHRIS JAY 
COMPANY MMI SANTA CLARA 
DATE 29 JANUARY 1985 


CHIP SAR_8 PAL20RS10 


;PIN 
;PIN 
;PIN 


;PIN 


1 2 3 4 
CLK /RST CMP POL 
7 8 9 10 
NC NC NC _ NC 
13 14 15 16 
/OE EOC cc Q3 
19 20 21 22 
Ql Q5 Q0 Q6 


EQUATIONS 


/Q7 


Q6 


Qo 


Q5 


Ql 


+++ tt 


+ 


+ 


teeter te 


+ 


t++eeeet 


++ et 


/RST*Q7 *CMP* /Q6*/Q5*/Q4*/Q3 
*/Q2*/Q1*/Q0* /CC*/EOC*POL 
/RST*Q7* /CMP* /Q6*/Q5*/Q4 

* /Q3*/Q2*/Q1*/Q0*/CC*/EOC* /POL 
/RST*/Q7*/EOC 


/RST*Q7* /Q6*/Q5*/04 

* /Q3*/Q2* /Q1*/Q0*/CC* /EOC 
/RST*Q6* /CMP*/Q5*/Q4 

* /Q3* /Q2*/Q1*/Q0*/CC* /EOC* POL 
/RST*Q6*CMP*/Q5*/Q4*/Q3 

* /Q2*/Q1*/Q0*/CC*/EOC*/POL 
/RST*Q6*Q5*/EOC 
/RST*Q6*Q4*/EOC 
/RST*Q6*Q3*/EOC 
/RST*Q6*Q2* /EOC 
/RST*Q6*Q1*/EOC 
/RST*Q6*Q0* /EOC 
/RST*Q6*CC*/EOC 


/RSTRQ1* /Q0* /CC*/EOC 
/RST*QO* /CMP* /CC* /EOC*POL 
/RST*QO*CMP* /CC* /EOC* /POL 
/RST*QO0*CC*/EOC 


/RST*Q6* /05*/Q44/Q3 

* /Q2*/Q1*/Q0*/CC* /EOC 
/RST*Q5*/CMP*/Q4*/Q3 
*/Q2*/Q1*/Q0*/CC*/EOC*POL 
/RST*Q5*CMP*/Q4*/Q3 

* /Q2*/Q1*/Q0*/CC*/EOC* /POL 
/RST*Q5*Q4*/EOC 
/RST*Q5*Q3*/EOC 
/RST*Q5*Q2*/EOC 
/RST*QS*Q1*/EOC 
/RST*Q5*Q0*/EOC 
/RST*Q5*CC*/EOC 


/RST*Q2*/Q1* /Q0*/CC* /EOC 
/RST*Q1*/CMP* /Q0* /CC* /EOC* POL 
/RST*Q1*CMP* /Q0* /CC*/EOC*/POL 
/RST*Q1*Q0*/EOC 
/RST*Q1*CC*/EOC 





5 6 
NC NC 
ll 12 
EN GND 
17 18 
Q2 Q4 
23 24 
Q7 vcc 


;ACTIVE LOW RESET WILL SET 
;Q7 HIGH. THE CMP OR /CMP 
; INPUT IS REGISTERED 
;DEPENDING ON POL INPUT. 
707 IS HELD UNTIL THE END 
;OF CONVERSION. 

;ACTIVE LOW RESET WILL 
;RESET Q6 LOW. Q6 GOES 
7;HIGH AFTER Q7 AND THE 
;CMP OR /CMP INPUT IS 
;REGISTERED.POL SELECTS 
;CMP POLARITY. THE DATA 
;REGISTERED IN Q6 IS 
;HELD THROUGH THE WHOLE 
*;APPROXIMATION CYCLE Q5 
;DOWN TO CONVERSION 
;COMPLETE. THE EOC LOW 
;SIGNAL, END OF CONVER- 
7-SION CLEARS THE 
;REGISTERS. 

;SET REGISTER QO 
;REGISTER CMP 

;OR /CMP INPUT 

;HOLD QO DATA 

*;SET REGISTER Q5 

;AFTER Q6. DURING 
;COMPARE CYCLE. 

;REGISTER CMP 

7;OR /CMP INPUT 


;HOLD Q5 DATA 
;DURING THE 

; APPROXIMATION 
;CYCLE FROM Q4 
7TO CONVERSION 
; COMPLETE. 


;SET REGISTER Ql HIGH 
;AFTER Q2 REG CMP OR /CMP 
; INPUT SELECTED BY POL 
;HOLD Q1 DATA FOR 

;REST OF CONVERSION 


, 
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Q4 := /RST*Q5*/Q4*/Q3*/Q2 
*/Q1*/Q0*/CC*/EOC 

+ /RST*Q4*/CMP*/Q3*/Q2 
*/Q1*/Q0*/CC*/EOC*POL 
/RST*Q4*CMP* /Q3*/Q2 
*/Q1*/Q0*/CC*/EOC*/POL 
/RST*Q4*Q3*/EOC 
/RST*Q4*Q2*/EOC 
/RST*Q4*Q1*/EOC 
/RST*Q4*Q0*/EOC 
/RST*Q4*CC*/EOC 


+ 


++ ttt 


Q2 /RST*Q3* /Q2* /Q1* /Q0* /CC*/EOC 


oe 


+++ ei 


/RST*Q2*Q1*/EOC 
/RST*Q24*Q0* /EOC 
/RST*Q2*CC*/EOC 


Q3 


a 
" 


/RST*Q4*/Q3*/Q2*/Q1*/Q0 

* /CC*/EOC 

/RST*Q3*/CMP* /Q2*/Q1*/Q0 

* /CC*/EOC*POL 

+ /RST*Q3*CMP*/Q2*/Q1*/Q0 
* /CC*/EOC*/POL 

+ /RST*Q3*Q2*/EOC 

+ /RST*Q3*Q1*/EOC 

+ /RST*Q3*00*/EOC 

+ /RST*Q3*CC*/EOC 


+ 


:= /RST*Q0*/CC*/EOC 
+ /RST*CC* /CMP* /EOC* /EN* POL 
+ /RST*CC*CMP* /EOC*/EN*/POL 


EOC := /RST*CC*/EOC 


SIMULATION 


TRACE _ON CLK /RST EN CMP POL 
Q7 06 Q5 04 Q3 Q2 Ql QO CC EOC 


SETF OE /CLK RST EN CMP /POL 
CLOCKF 

SETF /RST 

CLOCKF 

FOR I := 1 TO 9 DO 
BEGIN CLOCKF 

END 

SETF /CMP 

FOR J := 1 TO 9 DO 
BEGIN CLOCKF CLK 
END 

SETF POL 

FOR K := 1 TO 9 DO 
BEGIN CLOCKF CLK 
SETF CMP 

FOR L := 1 TO 9 DO 
BEGIN CLOCKF CLK 
END 

CLOCKF 

END 

TRACE OFF 


;SET REGISTER Q4 
;AFTER Q5 THEN 
;COMPARE CYCLE 
;REGISTER CMP OR 
;/CMP INPUT. 


;HOLD Q4 DATA 

;DURING THE 

; APPROXIMATION 

;CYCLE OF Q3 

;DOWN TO CC 

;RESET REGISTER Q3 
;SET REGISTER Q2 HIGH 


/RST*Q2*/CMP* /Q1*/Q0*/CC*/EOC*POL;AFTER Q3 REG CMP OR 
/RST*Q2*CMP*/Q1*/Q0*/CC*/EOC8/POL;/CMP INPUT SELECT W/POL 


;HOLD Q2 DATA 

7FOR APPROXIMATION 
;CYCLE TO CC. 
;RESET REGISTER Q3 
;THEN SET Q3. 
;REGISTER THE CMP 
;OR /CMP INPUTS 
;AND HOLD CONTENTS 
;OF Q3 DURING THE 
;REST OF THE APP- 
;~ROXIMATION CYCLE 
;DOWN TO CC 


, 


;RESET CC. CC GOES HIGH 
;AFTER QO TO INDICATE 
;THAT CONVERSION IS NOW 
;COMPLETE, IF EN IS LOW 
7;THEN CC CAN PROVIDE ONE 
;ADDITIONAL BIT OF 
;RESOLUTION. 


;END OF COUNT 
;GOES HIGH TO 
;RESET THE SAR. 


;SET INITIAL CONDITIONS 
;AND RESET Q6 - QO, CC 
;AND EOC, SET Q7 

;FOR NINE CLOCK CYCLES 
;PERFORM ONE PASS OF 
;SUCCESSIVE APPROXIMATION 
;WITH CMP HIGH AND POL 
;LOW. PERFORM SECOND 
;PASS WITH CMP LOW 


:SET POLARITY PIN ACTIVE 
;TEST SUCCESSIVE CYCLE OF 
;APPROXIMATION WITH THE 

; INVERSION INPUT ACTIVE 
;FOR CMP HIGH AND CMP 

; LOW 


, 


Se se se we 
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;FOURTEEN BIT SUCCESSIVE APPROXIMATION REGISTER 


TITLE SAR14 

PATTERN 01 

REVISION 01 

AUTHOR CHRIS JAY 
COMPANY MMI SANTA CLARA 
DATE 29 JANUARY 1985 


:THE PAL32R16 HAS BEEN DESIGNED AS A FOURTEEN BIT SUCCESSIVE 

; APPROXIMATION REGISTER FOR MEDIUM TO HIGH SPEED ANALOG TO 
;DIGITAL CONVERSION. THE FOURTEEN OUTPUTS ARE CONNECTED TO 

;THE INPUTS OF A FOURTEEN BIT DAC, WITH Q13 AS THE MOST SIG- 
;NIFICANT BIT DOWN TO QO, WHICH IS THE LEAST SIGNIFICANT 

;BIT. THE CC OUTPUT IS THE CONVERSION COMPLETE SIGNAL, WHEN 
;HIGH IT INDICATES THAT THE CONTENTS OF THE FOURTEEN BIT REGISTER 
;HOLDS THE CLOSEST BINARY WEIGHTING TO THE CURRENT VERSION OF 
;THE ANALOG SIGNAL UNDERGOING CONVERSION. THE DAC OUTPUT AND 
;THE ANALOG INPUTS ARE FED TO A HIGH QUALITY COMPARATOR CIRCUIT, 
;THE OUTPUT OF WHICH IS A TTL LEVEL THAT DRIVES THE CMP COMPARE 
; INPUT OF THE PAL32R16. 





CHIP SAR14 PAL32R16 


;PIN af 2 3 4 5 6 7 8 
Q13 NC Ql2 cc /OE1 /RST CMP NC 

;PIN 9 10 71 12 13 14 15 16 
NC vcc NC NC NC NC /PLD2. /CLK2 

;PIN 17 18 19 20 21 22 23 24 
Qll Qo Q10 Ql Q9 Q2 Q8 Q3 

;PIN 25 26 27 28 29 30 31 32 
/OE2 NC NC NC NC GND NC NC 

;PIN 33 34 35 36 37 38 39 40 
NC NC 7PLD1 /CLK1 Q7 Q4 Q6 Q5 

STRING H13 'Q13*/CC' ;HOLD EQUATION FOR Q13 

STRING H12 'Q12*/CC*/RST' sHOLD EQUATION FOR Q12 

STRING H11 'Q11*/CC*/RST' ;HOLD EQUATION FOR Q11l 

STRING H10 'Q10*/CC*/RST' ;HOLD EQUATION FOR Q10 

STRING H9 'Q9*/CC*/RST' ;HOLD EQUATION FOR Q9 

STRING H8 'Q8*/CC*/RST' ;HOLD EQUATION FOR Q8 

STRING H7 'Q7*/CC*/RST' ;HOLD EQUATION FOR Q7 

STRING H6 'Q6*/CC*/RST' ;HOLD EQUATION FOR Q6 

STRING H5 'Q5*/CC*/RST' ;HOLD EQUATION FOR Q5 

STRING H4 'Q4*/CC*/RST' ;HOLD EQUATION FOR Q4 

STRING H3 'Q3*/CC*/RST' ;HOLD EQUATION FOR Q3 

STRING H2 'Q2*/CC*/RST' ;HOLD EQUATION FOR Q2 
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EQUATIONS 


Q13 : 


+ 
+ 
+ 
+ 
+ 
Ql2 := 
+ 
+ 
+ 
+ 
+ 
Qll := 
+ 
+ 
+ 
+ 
+ 
Q10 := 
+ 
+ 
+ 
+ 
+ 
Q9 3s= 
+ 
+ 
+ 
+ 
+ 
Q8 3= 
+ 
+ 
+ 
+ 
Q7 := 


+ 


++ 


Q1L3*CMP* /Q12*/Q11*/Q10*/Q9%*/08*/07 
*/Q6*/Q5*/Q4*/Q3%*/Q2*/Q1*/Q0*/CC 
H13*Q1]12 + H13*Q11 + H13*Q10 


H13*Q9 + H13*Q8 + H13*Q7 
H13*Q6 + H13*Q5 + H13*Q4 
H13*Q3 + H13*Q2 + H13*Q1 
H13*Q0 + RST + CC 


Q13*/Q12*/Q11*/Q10*/Q9*/Q8* /Q7*/Q6 
* /Q5* /Q4*/Q3*/Q2*/Q1*/Q0*/CC*/RST 
Q12*CMP*/Q11*/Q10*/Q9* /Q8*/Q7*/Q6 
*/Q5* /Q4%/Q3*/Q2*/Q1*/Q0*/CC*/RST 
H12*Q11 + H12*Q10 + H12%*Q9 

H12*Q8 + H12*Q7 + H12*Q6 

H12*Q5 + H12*Q4 + H12*Q3 

H12*Q2 + H12*Q] + H12*Q0 


Q12*/Q11*/Q10*/Q9*/Q8*/Q7*/Q6*/Q5 
*/Q4* /Q3*/Q2*/Q1* /Q0*/CC*/RST 
Q11*CMP*/Q10*/Q9*/Q8*/0Q7*/Q6*/Q5 
*/Q4* /Q3*/Q2*/Q1* /Q0*/CC*/RST 
H11*Q10 + H11*Q9 + H11*Q8 

H11*Q7) + H11*Q6 + H11*Q5 

H11*Q4 + H11*Q3 + H11*Q2 

H11*Q1 + H11*Q0 


Q11*/Q10* /Q9* /Q8*/Q7*/Q6*/Q5*/04 
*/Q3*/Q2*/Q1*/Q0*/CC*/RST 
Q10*CMP*/Q9*/Q8*/Q7*/Q6*/Q5*/Q4 
*/Q3*/Q2*/Q1*/Q0*/CC*/RST 

H10*Q9 + H10*Q8 + H10*Q7 

H10*Q6 + H10*Q5 + H10*Q4 

H10*Q3 + H10*Q2 + H10*Q1 

H10*Q0 


Q10*/Q9*/Q8*/Q7*/Q6* /Q5*/Q4*/Q3 
* /Q2*/Q1*/Q0*/CC*/RST 

Q9*CMP* /Q8* /Q7*/Q6*/Q5*/Q4*/Q3 
*/Q2*/Q1*/Q0*/CC*/RST 

H9*Q8 + H9*Q7 + H9*Q6 

H9*Q5 + H9*Q4 + H9*Q3 

H9*Q2 + H9*Q] 

H9*Q0 


Q9* /Q8 */Q7*/Q6*/Q5*/Q4*/Q3*/Q2 
*/Q1*/Q0*/CC*/RST 
Q8*CMP*/Q7*/Q6*/Q5*/Q4*/03*/Q2 
*/Q1*/Q0*/CC*/RST 

H8*Q7 + H8*Q6 + H8*Q5 

H8*Q4 + H8*Q3 + H8*Q2 

H8*Q1 + H8*Q0 
QB*/Q7*/06*/05*/Q4*/Q3%*/Q2*/Q1 
*/Q0*/CC*/RST 

Q7*CMP* /Q6*/Q5*/Q4* /Q3*/Q2*/Q1 
*/CC*/RST 

H7*Q6 + H7*Q5 + H7*Q4 

H7*Q3 + H7*Q2 + H7*Q1 

H7*Q0 


;REGISTER CMP I/P 


;HOLD RESULT OF COMPARE 
;THE APPROXIMATION CYCLE 
7Q12 TO QO CLEAR ON RESE 
;CYCLE Q12 TO QO 

;AND CONVERSION COMPLETE 


;CLEAR Q12.DURING 

;FIRST APPROXIMATION 
;REGISTER COMPARE 

; INPUT. 

;HOLD CONVERSION RESULT 
;DURING APPROXIMATION CY 
3Q11 TO QO CLEAR ON RESE 
;AND CONVERSION COMPLETE 


;HOLD Q11 LOW 

;BEFORE COMPARE 

;CYCLE. REGISTER 
;COMPARE INPUT. 

;HOLD RESULT OF COMPARE 
;DURING APPROXIMATION 
;CYCLE Q10 TO Q0. RESET 
;AND CC CLEARS REGISTER. 


;HOLD Q10 LOW 
;BEFORE COMPARE 
;CYCLE. REGISTER 
;RESULT OF COMPARE 
;HOLD RESULT ON 

; APPROXIMATION 
;CYCLE Q9 TO QO. 
;RESET AND CC 
;CLEARS REGISTER. 
;HOLD Q9 LOW 
;BEFORE COMPARE. 
;REGISTER RESULT 
7OF COMPARE, 
;HOLD RESULT ON 

; APPROXIMATION 
;CYCLE Q8 TO QO. 
;RESET AND CC 
;CLEARS REGISTER. 
;HOLD Q8 LOW 
;BEFORE COMPARE 
;REGISTER RESULT 
;OF COMPARE. 
;HOLD RESULT ON 

; APPROXIMATION 
7CYCLE Q7 TO QO 


;HOLD Q7 LOW 
;BEFORE COMPARE 
;REGISTER RESULT 
;OF COMPARE. 
;HOLD CONTENTS 
;OF RESULT ON 

7 APPROXIMATION 
;CYCLE Q6 TO QO 
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Q6 Q7*/Q6*/Q5*/Q4*/Q3*/Q2*/Q1*/Q0*/CC*/RST 
Q6 *CMP* /Q5* /Q4* /Q3* /Q2* /Q1*/Q0*/CC*/RST 
H6*Q5 + H6*Q4 + H6*Q3 


H6*Q2 + H6*Ql + H6*Q0 


++4n 


Q6*/Q5*/Q4*/Q3*/Q2*/Q1*/Q0*/CC*/RST 
Q5*CMP* /Q4*/Q3*/Q2*/Q1*/Q0*/CC*/RST 
H5*Q4 + H5*Q3 + H5*Q2 

H5*Q1 + H5*Q0 


Q5* /Q4* /Q3* /Q2* /Q1*/Q0*/CC*/RST 
Q4*CMP* /Q3*/Q2*/Q1*/Q0*/CC*/RST 
H4*Q3 + H4*Q2 + H4*Q1 

H4*Q0 


Q5 


+++ 1 


Q4 


++i 


Q4* /Q3*/Q2*/Q1*/Q0*/CC*/RST 
Q3*CMP*/Q2*/Q1*/Q0*/CC*/RST 
H3*Q2 + H3*Ql + H3*Q0 


Q3 


++ I 


Q3* /Q2* /Q1* /Q0*/CC*/RST 
Q2*CMP*/Q1*/Q0*/CC*/RST 
H2*Ql + H2*Q0 


Q2 


++ 


Q2*/Q1*/Q0*/CC*/RST 
Q1*CMP* /Q0*/CC*/RST 
Q1*Q0*/CC*/RST 


Le) 
par) 


++ it 


Q0 Q1* /Q0*/CC*/RST 


QO*CMP* /CC*/RST 


+ Il 


cc 


Q0*/CC*/RST 


SIMULATION 


TRACE_ON CLK1 CMP RST Q13 Q12 Q11 Q10 Q9 
Q8 Q7 Q6 QO5 Q4 Q3 Q2 Ql Q0 cc 


SETF /CLK1 /CLK2 OE1 OE2 /PLD1 /PLD2 RST /CMP 

CLOCKF CLK1 CLK2 

CHECK Q13 /Q12 /Q11 /Q10 /Q9 /Q8 /Q7 /Q6 /Q5 
/Q4 /Q3 /Q2 /Q1 /Q0 

SETP /RST 

CLOCKF CLK1 CLK2 


FOR I := 1 T0 14 DO 
BEGIN CLOCKF CLK1 CLK2 
END 

SETF CMP 

FOR I := 1 T0 14 DO 
BEGIN CLOCKF CLK1 CLK2 
END 

TRACE_OFF 


;HOLD Q6 LOW BEFORE COMP 


;REGISTER RESULT OF 


;COMAPRE. HOLD RESULT 


;ON APPROXIMATION 
7Q5 - QO. 


;HOLD Q5 LOW BEFORE COMP 
;REGISTER RESULT OF COMP 


;HOLD RESULT ON 

; APPROXIMATION 
;CYCLE Q4 - QO. 
;HOLD Q4 LOW 
;REGISTER COMPARE 
;HOLD RESULT ON 
7Q3 - QO. 


;HOLD Q3 LOW 
7REGISTER COMPARE 
;HOLD RESULT ON 
;CYCLE Q2 - QO. 
;HOLD Q2 LOW. 
;REGISTER COMPARE 
;HOLD RESULT 

;ON Q2 - QO. 
;HOLD Ql LOW. 
;REGISTER COMPARE. 
;HOLD RESULT. 


;HOLD QO LOW. 
;REGISTER COMPARE. 


;HIGH CONVERSION 
; COMPLETE. 


;SET INITIAL 


;CONDITIONS 

; SYNCHRONOUS 
;RESET CHECK 

7 INITIAL REGISTER 
;STATE. 

;APPLY 14 CLOCK 
;PULSES WITH CMP 
7 INPUT LOW. 

3;SET CMP INPUT 
;HIGH AND APPLY 
714 CLOCK PULSES. 


‘ 
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PAL Devices, PROMs, FIFOs, and Multipliers 
Team up to Implement Single-Board 


High-Performance Audio Spectrum Analyzer 


AN-100 





Introduction 


This application note illustrates a high-performance audio spec- 
trum analyzer. This circuit can analyze high-fidelity audio sig- 
nals with a resolution of 20 Hz and an input bandwidth of 20 kHz. 
It is useful in production test, performance evaluation, or 
adjustment of high-fidelity audio equipment. The analyzer pro- 
vides a Sweep generator output for rapid analysis of audio filter 
frequency response. 


The design techniques used to implement the analyzer are quite 
general, and can be applied to a wide variety of DSP tasks. An 
understanding of the approach used will suggest solutions to a 
number of DSP problems. The architecture chosen for the 
spectrum analyzer is controlled by a microprogram stored in 
PROM. Many other applications can be accommodated by 
changing the microprogram. The high performance of this 
architecture provides an attractive price/performance alterna- 
tive to other DSP approaches. 


Spectrum Analyzer Functions 


The spectrum analyzer requires many of the functions com- 
monly used in DSP. Figure 1 shows the analyzer functions. An 
input signal is mixed with a swept audio sinewave oscillator 
(below). 


Frequency Sweep 


The frequency sweep acts as a sampler, starting from DC and 
increasing to its maximum frequency. 


Mixing is accomplished by multiplying the input signal by the 
sinewave. From basic trigonometry: 


cos wit X cos wat = 1/2 cos (wi + w2)t + 1/2 cos (w4 -wa)t (1) 


The mixing process generates two new sinewaves whose fre- 
quencies are the sum and difference of the input sinewave 
frequencies. When the sinewave oscillator matches the fre- 
quency of an input signal component, a DC term is generated in 
proportion to the amplitude of that component: 


cos wjt X cos wit = cos? wit = 1+ 2 cos wit 


The DC term is extracted by a narrow lowpass filter. Due to the 
finite bandwidth of this lowpass filter, mixer output signals 
whose frequencies fall within the filter passband also appear at 
the filter output. As a result, the analyzer output will represent 
the energy contained in a range of frequencies, from the sine- 
wave frequency minus the filter cutoff frequency, to the sine- 
wave frequency plus the filter cutoff frequency. The effective 
bandwidth of the analyzer is twice the lowpass filter bandwidth. 


MIXER 

INPUT OUTPUTS 
SIGNALo-+{ > OVERTICAL 
DC-20 kHz 







OHORIZONTAL 







SINE WAVE 
OSCILLATOR 


SWEEP 
GENERATOR 


Figure 1. Spectrum Analyzer Functions 


DC-20 kHz 


A detector converts the lowpass filter output to a DC voltage 
representing the total energy in the filter passband. If this DC 
voltage is plotted on a vertical axis with the sinewave oscillator 
frequency (represented by the sweep voltage) controlling the 
horizontal axis, a spectrum of the input signal results. 


Other mixing schemes can be used to extract the spectrum. 
However, this “direct conversion” approach has two significant 
advantages. As shown in Figure 2, the sweptoscillator output can 
be used to plot the frequency response of an audio filter. Other 
schemes require additional mixing to achieve the same results. 


OSCILLOSCOPE 


SPECTRUM 
ANALYZER 
HOR 


FILTER 
UNDER TEST 





Figure 2. Filter Test Mode Setup 


The direct conversion scheme confines the frequencies of all 
signals following the mixer to the lowpass filter bandwidth. 
Limiting the signal bandwidth has great benefit when the 
analyzer is implemented digitally. This benefit can be better 
understood with a brief review of DSP theory. 
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Digital Signal Processing 
Theory Review 


Digital signal processing is accomplished by first converting the 
continuous analog input signal to a series of digital numbers. 
The digital numbers are then manipulated to perform the 
required signal processing. The processed digital numbers are 
then converted back to a continuous analog signal, completing 
the processing. The functions required for DSP are shown in 
Figure 3. 



































® ® © 


Figure 3. DSP Functions 

Sampling 

Representing a continuous input signal would require an infinite 
array of digital numbers. A finite collection of digital numbers 
canbe obtained by considering the signal amplitude at discrete, 
periodic points in time. This process is called sampling, and is 
equivalent to multiplying the input signal by a periodic train of 
impulses of unit amplitude. The sampling theorem states that 
the input signal can be reconstructed without distortion if the 
input is bandlimited to contain no frequency components 
greater than half the sampling frequency. The sampling theorem 
means that the discrete samples completely represent the input 
signal, as long as the bandwidth constraint is met. 


Aliasing 
What is really happening during the sampling process? Consider 
the Fourier series representation of a periodic unit impulse train. 
It can be shown that: 
k = 
f(t) = SS) cos (ark fgt), k = 0, 1, 2, 3,... 
oo (2) 
_ 1 

S sample period 
The periodic impulse train is equivalent to a series of sinusoids 
consisting of all harmonics of the sampling frequency, including 


INPUT SIGNAL 


® eee CUTOFF 


= 
aR 


ee ees A eee 
SE 











pee =~-. a SIN X 
“ di ies - x poet eee, 
® an De = 
OUTPUT 
© i ) SIGNAL 
FREQUENCY-+DC {s/2 fs 2-4 3-ts 


Figure 4. Aliasing Spectra of Figure 3 DSP Functions 


ANALOG DIGITAL ———+— ANALOG 
DIGITAL 
INJALIASING SMOOTHING|OUT 
FILTER A/D akc abi FILTER 


a DC term. Recalling Eq. (1) all possible sum and difference 
frequencies will be generated when the impluse train and the 
input signal are multiplied. This process is shown graphically in 
Figure 4. Observe that if the input contains frequencies greater 
than half the sampling frequency, the spectra in Figure 4 will 
overlap. This overlap phenomenon is known as aliasing distor- 
tion, and introduces noise in the signal. 


Another consequence of the sampling process is that high- 
frequency signal components near a harmonic of the sampling 
frequency will be mixed to produce new signal components near 
DC. These new components have frequencies within the desired 
signal passband, but are really “alias” high-frequency compo- 
nents. The phenomenon is called aliasing. 


To eliminate the undesirable effects of aliasing, a continuous 
analog lowpass filter is placed before the sampler. This aliasing 
filter removes frequency components beyond the f,/2 limit. 


Quantizing 


The input samples are converted to a series of digital numbers by 
an analog-to-digital (A/D) converter. The A/D converter operates 
by quantizing the continuous sample amplitude into a finite 
number of amplitude ranges, and then assigning a digital 
number to represent the quantized amplitude value. As might be 
expected, this process introduces noise in the signal, known as 
quantization distortion. The quantization distortion is in the form 
of a “white” or broadband random noise, whose RMS ampli- 
tude is: 
2. 15-2b 
0 = 75 (3) 
where b is the number of bits in the output 
digital word, excluding the sign bit 


The effect of aliasing on quantization noise is to alias high 
frequency noise components to the DC to fs/2 range. The result- 
ing noise spectral density is equivalent to a white noise of ampli- 
tude 02, bandlimited to f,/2. 


Dynamic Range 


The A/D output contains a finite number of bits. Dynamic range 
is defined as the ratio of the maximum-to-minimum signal 
amplitude that can be represented by the digital numbers. 
Dynamic range is determined by the number of bits in the digital 
numbers, and by the noise “floor.” 


For a digital number containing b bits plus a sign bit, the 
dynamic range would be: 


Dynamic range (dB) = 10 10949 2-70 (4) 


The noise floor is the sum of all noise components that can 
appear at the DSP output. The primary noise factors are 
quantization noise and limit cycle noise (to be discussed 
shortly). Digital filtering will affect the noise floor by eliminating 
components of the noise signal. For example, the quantization 
noise at the DSP output is: 

Ng (dB) = 10 logy, [ors (5) 

s 

where BW is the net bandwidth of the digital filters 


The noise components are uncorrelated, and are therefore 
combined by adding the power of each noise component. 
Remember that 


Power (absolute) = logig [ Power (aBy/10 | (6) 
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The resulting dynamic range is: 


10ldg ce 
10 ¥ Noise Power 


where 0.5 = the maximum mean-squared amplitude 


Dynamic range (dB) (7) 


The overall dynamic range is the lesser of the result given by Eq. 
(4) or Eq. (7). in a practical system, the width of the digital 
numbers can vary. The dynamic range is usually calculated for 
all critical points in a digital system, with the overall dynamic 
range being the worst case value. 


Digital Processing 


The digital numbers from the A/D converter are manipulated to 
process the signal. Carrier generation, filtering, and nonlinear 
operations are performed by appropriate “number crunching”. 


Generation of sinusoidal carriers is easily accomplished using a 
linear ramp function (digital up/down counter) and converting 
the results to sinusoidal samples using ROM lookup tables. 
Alternately, recursive equations can produce the desired carriers. 


Nonlinear operations on the digital numbers must be handled 
with care. Since aliasing is always present in the sampled 
domain, harmonics generated by nonlinear operations can alias 
to lower frequencies. The aliasing occurs “immediately,” since it 
can be shown that performing a nonlinear operation in the 
sampled domain is equivalent to first performing the nonlinear 
operation on a continuous signal and then sampling the result 
without bandlimiting the sampler input. 


The sampling rate can be changed to improve the efficiency of 
the digital processing. For example, discarding every other 
digital number would reduce the effective sampling rate by a 
factor of two. If the processing at the higher sample rate 
includes digital aliasing filters to remove components greater 
than half the lower sample rate, the requirements of the 
sampling theory are still met. The sampling rate can be 
increased by repeating digital sample values. This repetition is 
equivalent to a “sample and hold” operation, and modifies the 
signal spectrum by 

F’ (jw) = F(jw) x S12 (wT/2) 

wT/2 (8) 
where w= 27 X freq 

T= input (longer) sample period 
The effects of changing the sampling rate are best determined 
by plotting the resulting aliasing spectra. 


Digital Filtering 


Digital filtering is accomplished using multiplication, addition, 
and delay. For example, consider the biquadratic filter section 
in Figure 5. If 271 is defined to be a unit sample period delay 
operator, then the input-to-output transfer function of the 
biquadratic section is: 


1+a4z7! +ayz~@ 
1 2 
3 a Rae a (9) 
Te b4z + doz 
The biquadric sections can be cascaded to implement higher- 
order filters. 


The Laplace transform of a unit delay is e7ST | where T is the 
delay period. Remember that z7! represents an inverse opera- 
tor, so that z X z-1 = 1. Thus, 

=0 ST 


Zz where s =a + jw (10) 


XN YN 





X' N-2 
X'N = Xpq -b1X' N-1 ~ b2X’ N-2 
YN = X'wy ta1X’ N-4 + 42 X'N-2 


Figure 5. Digital Biquadratic Filter Section 


Digital filter poles and zeroes (in the z-plane) can be mapped 
into the s-plane to determine the equivalent analog filter 
function, and vice-versa. The digital! filter section of Eq. (9) 
corresponds to an analog biquadratic filter section with, 


2 2 
S”" +a,.W,.StWw 
H(s) = ——2—2__> (11) 


Ss +04W,Stwy? 


However, the periodic nature of the eS! function causes the 
digital filter passband to repeat periodically. The effect is the 
same as aliasing. The analog filter response is mixed with the 
sampling frequency harmonics to generate the true digital filter 
response. 


Designing Digital Filters 


How does one go about designing a digital filter? One approach 
is to perform a least mean squared error optimization using a 
computer. The desired function is specified, and the computer 
adjusts the a, and b, values until the desired response is 
achieved. 


A second approach is to design an equivalent analog filter and 
then convert that design to a digital filter. This approach has 
great merit, since analog filter design theory is well developed. 
However, the digital passband will be distorted if the analog 
equivalent filter has significant response to frequencies greater 
than f,/2. The aliased passbands overlap at that point. 


To circumvent this problem, the analog filter function can be 
modified to compensate for the aliasing effects. The analog 
transfer response is modified using several transforms to 
compensate for aliasing. Unfortunately, the nature of the 
s-plane to z-plane mapping is such that no transform can 
compensate for all aliasing effects without introducing other 
forms of distortion. 


The standard (or impulse invariant) z-transform represents a 
direct mapping to the z-plane. No frequency, amplitude, or 
phase distortion is introduced, but aliasing effects are not 
compensated. This transform should be used when the analog 
filter has negligible response to frequencies greater than fo/2. 


The bilinear z-transform preserves the filter amplitude response 
in the presence of aliasing. However, the bilinear transform 
introduces a distortion or warping of the frequency axis. As a 
result, only the filter cutoff frequency can be accurately trans- 
formed, using a pre-warping technique. Frequencies within the 
filter passband remain warped, introducing phase distortion in 
the digital filter response. The bilinear transform is used when 
the filter amplitude response is more critical than the phase 
response. 
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The matched z-transform preserves the filter phase response at 
the expense of amplitude response distortion. However, this 
amplitude distortion, unlike the aliasing distortion, can be 
corrected by placing additional zeroes in the transfer function. 
The matched transform is used when the filter phase response is 
critical, and either the amplitude response is not critical or the 
additional compensation zeroes can be accommodated. 
Performing the transforms by hand is quite tedious. Fortunately, 
computer programs are widely available which handle the 
complete filter synthesis procedure, including z-transforms and 
pre-warping. 


Limit Cycle Noise 


An effect of using digital numbers with a finite number of bits is 
the generation of quantization noise. When implementing digital 
filters, the quantization noise introduces oscilaltions that are 
analogous to ringing in analog filters. These oscillations are 
called limit cycles. The limit cycle generates a noise which peaks 
at frequencies corresponding to the filter pole frequencies. The 
noise power is roughly proportional to pole Q. Limit cycle noise 
for a second order filter section of Equation (11) is given by: 


2 
2 ,-2b 1+4r 1 
N, (dB) = 10 lo —2 er iaaaiaa 
L( 310 12 1-124 +1-2r cos 2w 





) (12) 


where b = number of digital number bits (excluding sign bit) 
pole freq. = wy 


pole Q = Ia, 
_ »_pole freq. z —w 
=r f r xP (SS oole O epole q) 


The limit cycle noise must be calculated for each complex pole 
pair, and adjusted to reflect the response of subsequent filter 
stages to the limit cycle frequency. Computer programs can 
calculate limit cycle noise power, including all of these considerations. 


Output Signal Reconstruction 


Once manipulation of the digital sample numbers is complete, 
the resulting digital numbers must be converted back to a 


continuous analog signal. Referring back to Fig. 3, a digital-to- 
analog (D/A) converter transforms the digital numbers to a 
series of analog output pulses. 


A sample-and-hold (S&H) circuit eliminates transients that are 
introduced during the D/A conversion process. The spectrum of 
the S&H output is modifed as follows: 


rary t sin (wt/2) 
Sey le cae 


where t = hold time T = sample period 


An output smoothing filter completes the reconstruction by 
removing all components with frequencies greater than f</2. 
The smoothing filter is often optional, depending on the impor- 
tance of removing the high-frequency output components. 


The spectral effects of reconstruction are shown in Figure 4. 


Implementing the Spectrum Analyzer 


The architecture used for the spectrum analyzer is shown in 
Fig. 6. Input signals are digitized and buffered with FIFOs before 
interface with a common 16-bit data bus. The 16-bit arithmetic 
unit (AU) provides multiply and accumulate operations. A 16-bit 
wide RAM stores intermediate results. A 16-bit temporary 
register facilitates Zo delays and data movement. Outputs 
are provided using a D/A converter and S&H circuits. 


The VCO output is buffered using FIFOs to provide a uniform 
high-speed sample rate. The VCO output is 12 bits wide, 
providing a signal-to-quantization noise ratio of 91 dB, using 
Equations (5) and (7). The calculation assumes a 500-Hz band- 
width. Asmoothing filter at the VCO output is not necessary. The 
filter test configuration of Figure 2 allows the input aliasing filter 
to remove the effects of VCO high-frequency components, as 
long as the filter under test is a linear analog circuit. 


The vertical and horizontal outputs are intended to interface an 
oscilloscope or X-Y plotter. The sampling of these outputs can 
be non-uniform, as long as the outputs track each other. The 
elastic storage at the input and VCO interfaces permits arbitrary 
non-uniform processing of the analyzer functions. 


The 16-bit resolution of the internal data word provides 90-dB 
dynamic range according to Equation (4), or 115-dB dynamic 
range according to Equations (5) and (7), assuming 500-Hz 
bandwidth and no limit cycle noise and aliasing. 
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Figure 6. Analyzer Architecture 


Microprogram contro! was selected for the analyzer. PAL devi- 
ces can efficiently implement sequential state machines. It is 
possible to encode all control information in PAL devices, but 
only three packages would be saved (one PROM and two 


With the theoretical background and architecture in mind, the 
spectrum analyzer functions can be defined in detail. The objec- 
tive is to realize a circuit capable of high-resolution analysis of 
audio signals in the DC to 20-kHz range. Selectable bandwidth 





and linear/logarithmic output display are highly desirable. 
Detailed functions are shown in Figure 7. 


buffers). Distributing the control among several PAL devices 
would reduce flexibility and make corrections very difficult. The 
few extra packages required for microprogram control provide 
an extremely flexible architecture and greatly simplify the PAL 
device functions. 








INPUT MIXER ALIASING 
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Figure 7. Detailed Functional Diagram 
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Input Aliasing Filter 


An analog lowpass filter removes high-frequency components 
from the input signals. With a sample frequency of 50 kHz anda 
maximum input frequency of 20 kHz, the lowest aliasing fre- 
quency is (50-20) = 30 kHz. 


An eighth-order Chebychev filter with 0.1-dB passband ripple 
will provide 44 dB of attenuation at 30 kHz, and 86 dB attenuation 
at 50kHz. It is desirable to provide at least 60-dB overall dynamic 
range for high-performace analysis. To eliminate spurious 
responses above the -60-dB “floor,” the input signal should have 
all components above 30 kHz suppressed by at least (60-44) = 
16 dB. Most input signals will meet this requirement. If not, 
additional filtering must be provided. 


S&H and A/D Converter 


The input S&H maintains a constant sampled signal level while 
the A/D conversion is in progress. No sin X/X correction is made 
for this S&H since the net effect of the S&H plus A/D action is an 
impulse sample at the start of the “hold” period. 


The A/D conversion time should be less than 16 ws. The A/D 
converter output digital number should “saturate” when the 
input signal exceeds the maximum level. The digital numbers 
should be in inverted two’s complement form. The S&H 
acquisition time should be less than 4 us. 


For a full 60-dB overall dynamic range, a 12-bit A/D is required. 


Mixer 


The mixer multiplies the A/D output by the swept sinewave 
oscillator value. The multiplication produces sum and difference 
frequencies, according to equation (1). 


Two's complement fractional arithmetic is used throughout the 
analyzer. Multiplication cannot overflow, since all numbers are 
less than 1 in magnitude. 


Swept Sinusoidal Oscillator (VCO) 


A precision swept sinusoid from DC to 20 kHz must be 
generated to mix with the input signal. A technique particularly 
well suited to this application is solving the two equations: 


sin (x+y) = sinx cos y + cos x siny (13) 
cos (x+y) = cos x cos y-sinxsiny (14) 


These two trigonometric identities generate a new sin and cos 
value with y representing the phase shift per sample period. The 
technique is a “CORDIC” algorithm, based on coordinate 
rotation. Exact results are produced, but truncation and round- 
off errors due to the finite digital word length can cause a slow 
change or “drift” of carrier amplitude. Fortunately, the swept 
sinusoid is periodically reset in the spectrum analyzer, arresting 
this amplitude drift. 


The VCO frequency is swept by varying the value of y. However, 
since equations (13) and (14) require sin y and cos y, an identical 
CORDIC algorithm is used to obtain these values. To sweep the 
VCO, then, sind and cos. are placed in RAM, selected by the 
bandwidth setting. These are two fixed numbers originally stored 
in PROM, and represent the frequency shift per sample time. 
Equations (13) and (14) are then applied to calculate sin y and 
cos y, which represent the desired phase shift per sample time. 
Equations (13) and (14) are executed again to generate the 
actual sinusoidal output. 


The calculation of sin y and cos y can take place at a reduced 
sample rate to save processing time. Only the last execution of 
equations (13) and (14) must be performed at the full 50-kHz 
sample rate. 


A linear ramp is generated to provide horizontal drive for an 
oscilloscope or X-Y plotter. The ramp is incremented each time 
the sin y and cos y values are updated, tracking the VCO sweep. 
When the ramp value overflows, the analyzer sweep cycle is re- 
initialized. 





-bo/2 


Figure 8. All-Pole Digital Filter Section 


Digital Filters 


Fig. 8 shows the implementation of the all-pole digital filter 
sections. Because of the low pole Q values in all filters, the 
second order sections can be simply cascaded to implement 
high-order filters. Fig. 8 shows a technique for handling 
coefficients greater than 1 with fractional number representa- 
tion. 


Scaling must be performed to ensure maximum dynamic range. 
Filter sections with high-Q poles will show peaking of signals 
near the pole frequencies. The input to such sections must be 
scaled down to prevent overflow of the arithmetic. For a second- 
order all-pole section, this peaking factor is exactly the Q of the 
poles. Thus, when a given second-order section has a pole Q of 
2, the input signal to that section must be multiplied by 0.5 to 
prevent overflow. When the Q is less than or equal to 1, no 
scaling is performed. 


Saturation arithmetic is not provided in this architecture. Careful 
scaling eliminates the need for saturation arithmetic, since the 
A/D will saturate at a precisely known value. 


Aliasing Filters 


Two 4th-order Chebyschev filters permit reduction of the 
sample rate following the mixer. Each filter provides 0.3 dB 
passband ripple and at least 68 dB attenuation of aliasing 
components. The slightly high passband ripple is acceptable, 
since subsequent filters will dominate the composite passband 
shape. 


The first filter permits a sample rate reduction factor of 16. It is 
designed with a passband cutoff frequency of 479 Hz and a 
sample rate of 50 kHz. Eq. (12) predicts the limit cycle noise for 
this filter to be -58 dB. 
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The second filter permits a second sample rate reduction factor 
of two. Its cutoff frequency is 219 Hz, with a sample rate of 
3.125 kHz. Equation (12) predits the limit cycle noise for this filter 
to be -83 dB. This filter also provides an additional 14-dB 
attenuation of the limit cycle noise generated in the first aliasing 
filter, reducing the limit cycle noise from the first filter to -72 dB. 


These two filters permit an overall f, reduction factor of 32 
before processing the Bessell filter, detector, and linear-to-logo 
functions. This results in a very substantial throughput improve- 
ment. Net execution time is determined by the time to execute a 
given function multiplied by the sample rate for that function. 
Thus 32 instructions at the reduced rate will increase the net 
execution time by an amount equivalent to only 1 instruction at 
the full sample rate. 


Fig. 9 shows the aliasing spectra of the sample rate reduction 
process. 












INPUT FILTER 
MIXER 1ST ALIASING ce 1ST ALIASING 
OUTPUT yo FILTER FILTER 
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1ST ALIASING 
FILTER 
OUTPUT 
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2ND ALIASING 
FILTER 
OUTPUT 

(fs = 1.56 kHZ) 0 
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Figure 9. Aliasing Spectra for f, Reduction 


Lowpass Filter 


A 4th-order Bessel lowpass filter determines the overall band- 
width of the analyzer. The overall bandwidth is twice the 
bandwidth of this filter. Overall bandwidths of 20, 50, 100 or 500 
Hz are provided by loading the proper set of filter coefficients 
into RAM when the bandwidth is selected. 


The Bessel filter provides an optimal transient response for the 
analyzer. Good transient response is important, especially at 
narrow bandwidths, since the spectral peaks are swept with 
respect to the fiter passband. The net effect is similar to pulsing 
the filter input. Because the phase response is critical, the 
matched-z transform is used to convert the analog Bessel design 
to the z-domain. 


The second aliasing filter provides 3 dB of attenuation at 250 Hz. 
When cascaded with the Bessel filter, which also provides 3 dB 
attenuation at 250 Hz in the 500 Hz bandwidth mode, the 
response at this bandwidth is modified. However, since the 
overall bandwidth is relatively broad, good transient response is 
still achieved. Cascading these two filters provides a “transitional” 
filter with a Bessel response at low attenuation and a 
Chebyschev response at high attenuation. At bandwidths less 
than 500 Hz, the combination produces an optimal tradeoff 
between transient response and resolution. 


Analysis of Equation (12) reveals that limit cycle noise increases 
exponentially as the pole frequency is reduced. Operating the 
lowpass filter at the lowest possible sampling frequency (1.5625 
kHz) minimizes limit cycle noise, in addition to improving 
throughput. Limit cycle noise for the lowpass filter will be -95 dB 





mene 
rman 


at the 500-Hz bandwidth, increasing to -67 dB at the 20-Hz 
bandwidth. 


Detector 
A square-law detector provides a DC signal corresponding to 
the energy at the lowpass filter output. From trigonometry: 


A2 
(A cos wt)? a (1+ cos 2wt) (15) 


The detector output contains the desired DC term and a single 
undesired term at frequency 2w. If the square law is ideal (easy 
in the digital domain), no additional terms are produced. The 
elimination of harmonics ensures the accuracy of the detector 
with f,/32 = 1.5625 kHz. The highest component is always less 
than f,/64 with a 250 Hz maximum lowpass filter cutoff 
frequency. However, 2w can be anywhere from DC to 500 Hz as 
the VCO sweeps past the spectral component. 








AVCO = BW/n 
AHARMONIC = 2BW/n 
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Figure 10. Detector Sweep Filtering 


Fig. 10 illustrates a technique to render the effects of the 2w 
terms negligible. The analyzer passband is divided into n equal 
intervals. The VCO sweep rate is controlled so that the VCO 
sweeps BW/n per f,/32 interval. The detector is followed by a 
single-pole lowpass filter with a 3 dB frequency of BW/n. As the 
VCO sweeps a component through the passband, the DC term is 
present in all n intervals, but the 2w term can affect only one 
interval. The worst-case DC error is 1/n for an ideal cutoff, and 
is multiplied by_(29+27-1 +273 +24 +275 +2755 49-6 + 
2-65 +9-7 4277-254...) = 1.85 due to the finite 6 dB/octave 
rolloff of the single-pole filter. Further analysis reveals that: 
fs 1.85 (16) 
109/10 _, 
where e represents the resulting error in dB. For e = 0.1 dB, 
n = 80. 
In the filter test mode, the signal frequency and VCO frequency 
are the same, forcing w = 0. The detector has no error in this 
mode, but has a 3 dB gain due to the second DC term. 
The detector output represents signal energy. Each bit in the 
detector output word thus represents only 3 dB, and 21 bits are 
required to reflect a 60 dB dynamic range. Double precision 
arithmetic is required for the detector ouput and the single-pole 
filter. The 67C7560 multiplier will handle double precision calcu- 
lations with a time penalty. Fortunately, the calculations to be 
performed are simple and the operations take place at the 
minimum sample rate, reducing the impact on throughput. 


Linear-to-Log Conversion 


The architecture of Figure 6 is customized to provide an efficient 
algorithm for linear-to-logarithmic output conversion. The RAM 
address generator monitors the 8 MSBs of the data bus, and can 
provide a number indicating the MSB position of a positive 
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number. This output is used to retrieve a lookup table value. 2) Look up the appropriate scale factor, from 20 to 28. Add 
This value is used to scale the data word to quickly left-justify logy of the scale factor to the output word. The conversion 
the MSB. A second 4-point lookup table is then used to improve is now accurate to 3 dB. 


the accuracy of the resulting logg conversion. A logo con- 


haha ; 3) If the flag was not set during step 1, multiply the data word 
version is adequate, since: 


by the scale factor to left-justify the MSB position. 


logy x 
109 49x ees. (17) 4) If the flag was not set during step 1, retrieve an intercept 


logo10 and slope value from the 8-word lookup table (four pairs 


: available). Perform a linear interpolation using: 
Equation (17) demonstrates that the output can be displayed in ss ) P g 


decibels by setting the oscilloscope or X-Y plotter Y-axis gain to x =axtb 

the proper value. where ais the slope value (18) 
Two lookup tables provide .027 dB accuracy for output values bis the intercept value 

from 0 to —45 dB, and 3 dB accuracy from —45 to -84 dB. The The conversion is now accurate to .027 dB. 


logarithmic accuracy is limited by the 10-bit output word length 


to the D/A. This output can represent 0 to —84 dB in .082 dB 5) Scale the result to provide 84-dB output range with a 

















increments. The accuracy of the 4-point lookup is therefore 10-bit word. 
sufficient. 6) Subtract 2-1 from the output to convert it to two's com- 
The logarithmic conversion procedure is as follows: plement form for the D/A. 
1) If the MSB of the data word is not among the 8 MSBs into the 
address generator, multiply the word by a! = 128, and The calculations are double precision for steps 1, 2, and 3, and 
increment the output number by 7. Repeat until the data single precision thereafter. The conversion sequence can be by- 
word is greater than 27” but no more than three times. passed using a strap option to provide a linear amplitude output 
Set a flag if this step is executed more than once. from 0 to -30 dB. 
20 kHz LOWPASS FILTER INPUT BUF. DATA BUS ARITH. 
4 4 UNIT 
2 3] [4] A/D 67401 74 
Ror Hoe Hse os ese SSE) | 
a: RP oa 16 
74 
TTT LS 1 
OSIG IN 244 
(10V-PK) = 
SHI 8 MHz 
FIR 
FOR 16 
pany ins TEMPORARY REG. 
a: “4 Geellced—t 
e ENR 
32x16 RAM ic. jy 
OP1 (LLC) LL 
OPO (DLC) INSTRUCTION MEM 
OUTPUT 


8.0 MHz XTAL 
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Figure 11. Simplified Schematic Hi-Fi Audio Spectrum Analyzer 
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Control Logic 


Figure 11 shows a simplified schematic of the analyzer. All 
critical components are shown. Bypass capacitors and some 
component input connections are omitted for clarity. 


The microprogram is stored in three 63S281 PROMs. The 
microcode word formats are shown in Figure 12. A wide, highly- 
parallel instruction word ensures maximum flexibility and pro- 
gram efficiency. 


Eight PAL devices interpret the instruction word and control the 
analyzer. Two additional PAL devices generate a 50-kHz strobe 
from the 8-MHz master clock, and implement the output D/A 
multiplexer. The control PAL devices function as follows: 


Sequencer: A PAL20X8 implements an 8-bit instruction sequen- 
cer. The sequencer performs the following operations: 


. = if 
z@2¢~s26 #€2@ B 8 5g 
TYPE 1 
CONSTANT 
AU = a, a a 
INSTR a3 g 


AU LOAD 
CONSTANT 


ENABLE R 
STORE TE 
ADDR EN. 


TYPE 2 


Pa eee tek Meee eckeat ne el tem eel | 
LOAD LOOP CTR 


a AU Ss 2 ™ &_— SEQ 
Ps INSTR zs 3 < CNTRL 
a e- Ww 
2 ao wu «& 2 
2 22 8 3 

z rE < 9 

Ww on” 


TYPE 3 
RAM ADDRESS 
MODIFY 
AU = 2 Oo 5 SEQ 
INSTR = CNTRL 


AU LOAD 
ENABLE RA 
STORE TE 

ADDR ENA 
CONSTA 


x -«~ -«~ xX 
o OO O80 80 


Cx Operation 


"X Increment by 1 (execute next instruction) 
Q Increment by 2 (skip next instruction) 
Q Jump to address 

0 No increment (repeat current instruction) 


+30 0f9 


co 
0 
1 
0 
1 


The CX input conditions the sequencer. Conditional 
branches or skip operations can be implemented. The 
sequencer will increment if the conditional requirement is 
not met. 


Condition detector: A PAL16C1 monitors up to twelve status 
flags, and generates CX. The microcode word includes a 4-bit 


ino wo w vs N ad 


ao a a a aeaai@a 
o Oo 809 809 O09 OO 8 9 


3 
0 


CONSTANT VALUE 


(MSB) 


CONDITION JUMP/LOAD 


A 
VALUE 2 


(MSB) 
LOAD LC 


TEST STRAPS heii DECREMENT LC «@ 


CONDX EXEC 
JUMP/LOAD LC 


CONDITION o 0 @ ADDR @ ADDR 
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Qa = = 
lu < =. ~ 
< oO 
a = 
z Bon, f 
oS 
O Dd 
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< N @ 
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Figure 12. Microinstruction Word Formats 
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condition word, CX0O through CX. Cx will be zero under the 
following conditions: 


CX3 CX2 CX1 CXO Condition for CX = 0 


0 0 0 0 Always (unconditional operation) 
0 0 0 1 Sample strobe (SS) = 1 

0 0 1 O AU overflow (OVF) = 1 

0 0 1 1 AU busy (AUB) = 1 

0 1 0 0 Input sample ready (INR) = 0 

O 1 0 1 Loop counter timeout (LLC) = 0 
oOo 1 1 #O [LC =1 

Oo. 64 1 1 Address control (AC3-ACO) = 0 

1 0 0 0 ACO=0 

1 Oo 0 1 ACO=1 

1 0 1 Q AC1=0 

1 0 1 1 AC1=1 

1 1 0 O AC2=0 

1 1 0 1 AC2=1 

1 1 1 0 AC3=0 

1 1 1 1 AC3=1 


The assignment is made using the flexible PAL device coding, 
and is optimized for the analyzer. The user can select a different 
set of conditions by reprogramming the PAL device. 


When the microcode represents a constant (Type 1 microinstruc- 
tion — see Figure 12) the CON input forces CX = 1 to suppress 
conditional operations. CX is also used to suppress certain 
strobes in the analyzer, providing conditional arithmetic 
operations. 





Loop counter: A PAL16R6 implements a 6-bit programmable 
down counter. This counter controls iteration loops and provides 
a timeout signal to the condition detector. The counter is preset 
via a Type 2 microinstruction, and can be decremented by other 
Type 2 microinstructions. The counter will halt when zero count 
is reached. Up to 64 iterations can be accommodated with 
minimal overhead. 


Address control: A PAL22V10 provides indexed addressing for 
the 32x16 RAM, and analyzes the eight MSBs of the data bus for 
conditional operations. If D15 represents the data bus sign bit, 
then OP1-OP3 will provide the following functions: 


OP3 OP2 OP1 AC3-ACO Output Function 
Clear (0000) 


Increment 

Decrement 

Preset to D15-D12 (Sign + 3 MSB) 
Preset to D14-D11 (4 MSB) 

Preset to D11-D8 (Address load) 
MSB position 

No change 


44440000 
++00++00 
+o+o+0-0 


The ADE input enables a change in the address word. The 
address word will not change if ADE = 1. 


The MSB position function indicates the position of the MSB for 
positive numbers. AC3 represents sign bit D15. This output 
should be zero. AC2-AC1 represent the position of the first 1 
following the sign bit. Code 0000 indicates that D15-D8 are all 0. 


Input/RAM control: Miscellaneous FIFO input and RAM control 
is provided by a PAL 10L8. The 67401 FIFO includes input ready 
(FIR) and output ready (FOR) signals, which are latched using 
the input/output shift clocks to generate two flags. The first flag 
(FR) resets the FIFO when input ready (latched) goes low, 
indicating the FIFO capacity is exhausted. The latched output 
ready signal flag represents input sample ready (INR). The INR 


flag is used as a sequencer condition to synchronize wait loops. 
Use of the FR and INR flags maintains proper fill of the FIFO. 


The RAM address LSB (AO) and read-write line (R/W) are 
decoded and latched. These signals are provided directly by 
Type IV microinstructions. 


Notice that a clocked register function requires two PAL 
combinatorial outputs per bit, while a transparent latch function 
requires only one PAL output per bit. 


Arithmetic unit control: The variety of functions listed in Table 3 
indicate the utility of the arithmetic unit (AU). A PAL16R6 pro- 
vides simplified control of the AU. 


The PAL devices and AU load signal provide conditional arith- 
metic operations. Gating the load input will suppress the start of 
anew arithmetic operation. When CXE is high, the operation is 
performed unconditionally. When CXE is low, the operation is 
performed only if CX is low. Combining conditional jumps and 
conditional AU operations provides a high degree of program 
flexibility. 


The PAL device monitors the AU instructions and generates a 
busy signal (AUB). A counter in the PAL device keeps track of 
variable-length operations to provide the correct output for any 
instruction sequence. The AUB signal conditions the sequencer 
to synchronize the microprogram to the AU operation. Microp- 
rogramming is simplified as a result. 


The PAL device also gates the input FIFO shift out clock (INS) to 
eliminate transients while providing a full 125-ns pulse for proper 
FIFO operation. 


Data strobe generator: A PAL10L8 provides a number of 
transient-free, gated strobes. These strobes provide control of the 
analyzer data flow. The PAL device interprets the microinstruc- 
tion to determine the proper microinstruction type, and gener- 
ates the strobes accordingly. 


The PAL device also generates an 8-MHz buffered clock, as 
shown in Figure 11. The crystal oscillator circuit provides inde- 
pendent AC and DC feedback, permitting reliable operation with 
the PAL device. 


Strap/output sample control: A PAL16L8 generates additional 
control strobes for the output sample-and-hold circuits. 


The PAL device also provides a tristate buffer function, connect- 
ing control straps to the data bus for certain conditional jump 
operations. Two straps select the desired analyzer bandwidth/ 
sweep rate, and the third strap selects linear or logarithmic 
output. 

Signal output 


The VCO output must be sampled at precise intervals to avoid 
phase modulation effects. Three 67401 FIFOs buffer the VCO 
samples, which are generated during the 50-kHz input process- 
ing. A 12-bit D/A converter provides better than 91 dB signal-to- 
distortion ratio. The S&H circuit provides VCO outputs at pre- 
cise 50-kHz intervals, and removes spikes that are generated in 
the D/A converter. All necessary control signals are generated 
by the strap/output data control PAL device. 


The horizontal! and vertical outputs normally drive an X-Y plotter or 
oscilloscope. There is no need to buffer these signals as long as 
the two outputs track each other. The D/A used for the VCO 
output is shared by adding two PAL12H6 chips programmed as 
multiplexers. Use of PAL devices requires fewer packages than a 
TTL multiplexer. Additional S&H circuits decode the multiplexed 
D/A output to separate the output signals. 
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Microprogram 


The architecture can implement a variety of DSP functions. A 
microprogram, stored in 63S281 PROMs, customizes the archi- 
tecture to perform the spectrum analyzer tasks. The microin- 
struction formats were summarized in Figure 12. The algorithms 
to be implemented were discussed in the previous section. The 
step-by-step implementation of these algorithms is converted to 
asequence of microinstructions to form the microprogram. The 
procedure is analogous to programming a microprocessor. 


Operation of the microprogram is better understood by consid- 
ering the allocation of the 74S218 RAM locations, shown in 
Figure 13. The microprogram consists of two parts. High-speed 
input processing provides the carrier generation, mixing, alias- 
ing filter and lowpass filter functions. Figure 13a shows the RAM 
allocation during input processing. The input segment includes 
an efficient iteration loop, using the PAL device loop counter, to 
process the 50-kHz functions. The carrier frequency shift and 
lowpass filter functions are processed at the fs/32 reduced sam- 
ple rate for maximum throughput efficiency. 


The values of sin A, cos A, and the Bessell filter coefficients 
depend on the analyzer bandwidth strap selection. These values 
are stored in a “table” area in Fig. 13, and can be easily changed. 
The fixed aliasing filter coefficients are stored as constants in the 
microprogram itself. 


{A4-A1) 





yy Ty Fm [mem] 


Once the input processing is complete, coefficients located in 
the table area can be changed. This area is re-used by the output 
program segment to hold the scale factors for the linear-to-log 
conversion routine. The detector functions are processed, and 
the logarithmic conversion is started with the RAM allocation of 
Figure 13(b). The table area is then reloaded with the interpola- 
tion coefficients (Figure 13(c)) to complete the logarithmic con- 
version. Shaded areas in Figure 13 provide temporary data and 
flag storage for the routines. 


The microprogram samples the strap settings and loads the table 
area with the appropriate coefficients for input processing. The 
detector filter coefficient (b;) is also determined and loaded. 
The input processing is then repeated. This sequence repeats 
indefinitely. The coefficient loading technique makes efficient 
use of RAM capacity while eliminating elaborate jump 
sequences. All coefficient table updates are processed at the 
minimum sample rate for best efficiency. 


The PAL device controllers simplify the microprogram. A PAL 
device provides hardware iteration loops. The AU controller 
eliminates wasteful “NO-OP” instructions otherwise required to 
allow completion of AU operations. The input control PAL 
device simplifies handshaking with the input logic. With the 
benefit of the PAL device controllers, the analyzer microprogram 
easily fits into the 256-instruction capacity of the PROMs. 
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Figure 13. RAM Allocation 
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TITLE AN-100 DSP Counter 
PATTERN DSPCOUNT 

REVISION 1 

AUTHOR Marc Baker 

COMPANY Monolithic Memories 
DATE August 20, 1987 


CHIP DSPCOUNT PAL20X8 


CLK NC OP3 OP4 OP5 OP6 OP7 CO Cl /CX NC 


/OE NC A7 A6 AS A4 A3 A2 Al AO 


EQUATIONS 
LOAD = /CO* C1* CX ;FED BACK TO EQUATIONS 
/A0 := /LOAD*/A0 :HOLD/INCREMENT BY 2 

+ /CO* C1l* CX ;LOAD 0 

2+: /CO*/Cl ; INCREMENT BY 1 
7Al := /LOAD*/A1 ; HOLD 

+ /CO* C1* CX ;LOAD 0 

s+: /CO*/C1* AO ; INCREMENT BY 1 

+ CO*/C1* CX*/A0 ; INCREMENT BY 2 
/A2 := /LOAD*/A2 ;HOLD 

+ /CO* C1l* CX ;LOAD 0 

s+: /CO*/C1* Al* AO ; INCREMENT BY 1 

+ CO*/C1* CX* A1l*/A0 ; INCREMENT BY 2 
/A3 := /LOAD*/A3 ; HOLD 

+ /CO* Cl* CX*/OP3 ; LOAD 

s+: /CO*/C1* A2* Al* AO s INCREMENT BY 1 

+ CO*/Cl1* CX* A2* Al*/A0 ; INCREMENT BY 2 
/B4 := /LOAD*/A4 ; HOLD 

+ /CO* Cl* CX*/OP4 ; LOAD 

s+: /CO*/C1* A3* A2* Al* AO ; INCREMENT BY 1 

+ CO*/C1l* CX* A3* A2* A1l*/A0 ; INCREMENT BY 2 
7AS := /LOAD*/A5 ; HOLD 

+ /CO* Cl* CX*/OP5 ; LOAD 

2+: /CO*/C1* A4* A3* A2* Al* AO ; INCREMENT BY 1 

+ CO*/C1* CxX* A4* A3* A2* A1l*/AO ; INCREMENT BY 2 
/A6 := /LOAD*/A6 ; HOLD 

+ /CO* C1* CX*/OP6 ; LOAD 

2+: /CO*/C1* AS* A4d* A3* A2* Al* AO sINC BY 1 

+  CO*/C1l* CX* AS* A4* A3* A2* Al*/AO ;INC BY 2 
/A7 := /ULOAD*/A7 ; HOLD 

+ /CO* C1* CX*/OP7 ; LOAD 

st: /CO*/C1* A6* A5* A4* A3Z* A2* Al* AO ;INC BY 1 

+ CO*/C1* CX* A6* A5* A4* A3* A2* Al*/AO ;INC BY 2 


GND 


/LOAD VCC 


1 mononthic Ki!) memories &' 
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SIMULATION 


TRACE ON CLK A7 A6 AS A4 A3 A2 Al AO 


;Trace CLK and outputs 


SETF OE /CO Cl CX /OP7 /OP6 /OP5 /OP4 /OP3 


CLOCKF CLK 
CHECK /A7 /A6 /A5 /A4 /A3 /A2 /Al /A0 


SETF CO /Cl 
CLOCKF CLK 
CHECK /A7 /A6 /AS /A4 /A3 /A2 Al /AO 


CLOCKF CLK 
CHECK /A7 /A6 /AS /A4 /A3 A2 /Al /A0 


CLOCKF CLK 
CHECK /A7 /A6 /A5 /A4 /A3 A2 Al /AO 


SETF /CO 
CLOCKF CLK 
CHECK /A7 /A6 /AS /A4 /A3 A2 Al AO 


CLOCKF CLK 
CHECK /A7 /A6 /A5 /A4 A3 /A2 /Al /AO 


SETF CO Cl 
CLOCKF CLK 
CLOCKF CLK 
CHECK /A7 /A6 /A5 /A4 A3 /A2 /Al /A0 


TRACE OFF 


:;Load all Os 


;Increment by 
sto 2 


;Increment by 
sto 4 


;Increment by 
sto 6 


;Increment by 
sto 7 


;Increment by 
sto 8 


;Hold 
;Hold 
sto 8 
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TITLE AN-100 DSP Condition 
PATTERN DSPCOND 

REVISION 1 

AUTHOR Marc Baker 

COMPANY Monolithic Memories 
DATE August 20, 1987 


CHIP DSPCOND PAL16Cl 


Cx3 CX2 CXl CxO SS_ OVF AUB INR /LLC GND 
/CON ACO AC1 AC2 /CX COMP CX AC3 NC NC VCC 
EQUATIONS 
CX = /CX3*/CX2*/CX1*/CX0* CON 
/CX3*/CX2*/CX1* CXO* SS* CON 

/CX3*/CX2* CX1*/CX0* OVF* CON 

/CX3*/CX2* CX1* CX0* AUB* CON 

/CX3* CX2*/CX1*/CX0*/INR* CON 

/CX3* CX2*/CX1* CX0* LLC* CON 

/CX3* CX2* CX1*/CX0*/LLC* CON 

/CX3* CX2* CX1* CX0*/AC3*/AC2*/AC1*/ACO* CON 
CX3*/CX2*/CX1*/CX0*/ACO* CON 

CX3*/CX2*/CX1* CXO* ACO* CON 

CX3*/CX2* CX1*/CX0*/AC1* CON 

CX3*/CX2* CX1* CXO* AC1* CON 

CX3* CX2*/CX1*/CX0*/AC2* CON 

CX3* CX2*/CX1* CXO* AC2* CON 

CX3* CX2* CX1*/CX0*/AC3* CON 

Cx3* CX2* CX1* CxX0* AC3* CON 


t++eteee eee teteeest i 


SIMULATION 


SETF /CON ;SET /CX HIGH 
CHECK /CX 


SETF CON /CX3 /CX2 /CX1l /CX0 ;CX3-CX0=0 - SET /CX LOW 
CHECK CX 


SETF CX0O SS ;CX3-CX0=1 - SET /CX LOW 
CHECK CX 


SETF CXl /CX0 OVF ;CX3-CX0=2 - SET /CX LOW 
CHECK CX 


SETF CX0 AUB ;CX3-CX0=3 - SET /CX LOW 
CHECK CX 


SETF CX2 /CXl /CX0 /INR ;CX3-CX0=4 - SET /CX LOW 
CHECK CX 


SETF CxO LLC ;CX3-Cx0=5 - SET /CX LOW 
CHECK CX 
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SETF CXl 
CHECK CX 


SETF CxX0 
CHECK CX 


SETF CX3 
CHECK CX 


SETF CxX0 
CHECK CX 


SETF CX1 
CHECK CX 


SETF CxX0 
CHECK CX 


SETF CX2 
CHECK CX 


SETF Cx0 
CHECK CX 


SETF CXl 
CHECK CX 


SETF CxX0 
CHECK CX 


Audio Spectrum Analyzer 


/CX0 /LLC 


/AC3 /AC2 /AC1 /ACO 


/CX2 /CX1 


ACO 


/CxX0 


AC] 


/CX1 /CX0 


AC2 


/CX0 


AC3 


;CX3~CX0=6 


;CX3-CX0=7 


3CX3-CX0=8 


7CX3-CX0=9 


7CX3-CX0=10 


;CX3-CX0=11 


7CX3-CX0=12 


7CX3-CX0=13 


;CX3-CX0=14 


7;CX3-CX0=15 
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SET /CX LOW 


~ SET /CX LOW 


- SET /CX LOW 


SET /CX LOW 


SET /CX LOW 


SET 


SET 


SET 


SET 


SET 


/CX 


/CX 


/CX 


/CX 


/CX 


LOW 


LOW 


LOW 


LOW 


LOW 
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TITLE AN-100 DSP Address Control 
PATTERN ADDCONT 

REVISION 1 

AUTHOR Marc Baker 

COMPANY Monolithic Memories 

DATE August 21, 1987 


CHIP ADDCONT PAL22V10 


CLK OP3 OP2 OP1l /ADE D15 D14 D13 D12 NC NC GND 


NC NC Dill D10 ACO ACl1 AC2 AC3 DI 


GLOBAL 
EQUATIONS 
ACO := /OP3*/OP2* OP1*/ACO* ADE ; INC 
+ /OP3* OP2*/OP1*/ACO* ADE ;DEC 
+ /OP3* OP2* OP1* D12* ADE ;D12 
+ OP3*/OP2*/OP1* D1l1l* ADE ;D1l 
+ OP3*/OP2* OP1* D8 * ADE ;D8 
+ OP3* OP2*/OP1* D1l4* ADE ;MSB EQUATIONS 
+ OP3* OP2*/OP1*/D14*/D13* D12* ADE 
+ OP3* OP2*/OP1*/D14*/D13*/D12*/D11* D10* ADE 
+ OP3* OP2*/OP1*/D14*/D13*/D12*/D11*/D10*/D9*D8* ADE 
+ OP3* OP2* OP1* ACO ;HOLD 
+ ACO*/ADE ; HOLD 
AC1l := /OP3*/OP2* OP1* AC1*/ACO* ADE ;INC 
+ /OP3*/OP2* OP1*/AC1* ACO* ADE ; INC 
+ /OP3* OP2*/OP1* AC1* ACO* ADE ;DEC 
+ /OP3* OP2*/OP1*/AC1*/ACO* ADE ;DEC 
+ /OP3* OP2* OP1* D13* ADE ;D13 
+ OP3*/OP2*/OP1* D12* ADE ;D12 
+ OP3*/OP2* OP1* D9 * ADE ;D9 
+ OP3* OP2*/OP1* D14* ADE ;MSB EQUATIONS 
+ OP3* OP2*/OP1*/D14* D13* ADE 
+ OP3* OP2*/OP1*/D14*/D13*/D12*/D11* D10* ADE 
+ OP3* OP2*/OP1*/D14*/D13*/D12*/D11*/D10* D9* ADE 
+ OP3* OP2* OP1* ACL ;HOLD 
+ AC1*/ADE ; HOLD 
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AC2 := /OP3*/OP2* OP1*/AC2* AC1* ACO* ADE ;INC 
+ /OP3*/OP2* OP1* AC2*/AC1* ADE ; INC 
+ /OP3*/OP2* OP1* AC2* /ACO* ADE ; INC 
+ /OP3* OP2*/OP1*/AC2*/AC1*/ACO* ADE ;DEC 
+ /OP3* OP2*/OP1* AC2* AC1* ADE ;DEC 
+ /OP3* OP2*/OP1* AC2* ACO* ADE ;DEC 
+ /OP3* OP2* OP1*. D14* ADE ;D14 
+ OP3*/OP2*/OP1* D13* ADE ;D13 
+ OP3*/OP2* OP1* D10* ADE ;D10 
+ OP3* OP2*/OP1* D14* ADE ;MSB EQUATIONS 
+ OP3* OP2*/OP1* D13* ADE 
+ OP3* OP2*/OP1* D12* ADE 
+ OP3* OP2*/OP1* D1ll* ADE 
+ OP3* OP2* OP1* AC2 ; HOLD 
+ AC2*/ADE ; HOLD 

AC3 := /OP3*/OP2* OP1*/AC3* AC2* AC1* ACO* ADE ;INC 
+ /OP3*/OP2* OP1* AC3*/AC2* ADE ;INC 
+ /OP3*/OP2* OP1* AC3* /ACLK ADE ;INC 
+ /OP3*/OP2* OP1* AC3* /ACO* ADE ;INC 
+ /OP3* OP2*/OP1*/AC3*/AC2*/AC1*/ACO* ADE ;DEC 
+ /OP3* OP2*/OP1*/AC3* AC2* ADE ;DEC 
+ /OP3* OP2*/OP1*/AC3* AC1* ADE ;DEC 
+ /OP3* OP2*/OP1*/AC3* ACO* ADE ;DEC 
+ /OP3* OP2* OP1* D15* ADE ;D15 
+ OP3*/OP2*/OP1* D14* ADE ;D14 
+ OP3*/OP2* OP1* D1ll* ADE ;D11 
+ OP3* OP2* OP1* AC3 ; HOLD 
+ AC3*/ADE ; HOLD 

SIMULATION 


TRACE ON CLK AC3 AC2 AC1 ACO 


SETF ADE /OP3 /OP2 /OP1 
CLOCKF CLK ;CLEAR TO 0 
CHECK /AC3 /AC2 /AC1 /ACO 


SETF OP1 
CLOCKF CLK ; INCREMENT TO 1 
CHECK /AC3 /AC2 /AC1 ACO 


CLOCKF CLK ; INCREMENT TO 2 
CHECK /AC3 /AC2 ACl /ACO 


CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK ; INCREMENT TO 8 
CHECK AC3 /AC2 /AC1 /ACO 
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SETF OP2 /OP1 
CLOCKF CLK ;DECREMENT TO 7 
CHECK /AC3 AC2 AC1 ACO 


CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK ;DECREMENT TO 3 
CHECK /AC3 /AC2 ACl ACO 


SETF OPl /D15 D14 /D13 D12 /D1l 
CLOCKF CLK ;SET TO D15-D12 
CHECK /AC3 AC2 /AC1 ACO 


SETF OP3 /OP2 /OP1 
CLOCKF CLK ;SET TO D14-Dl1l 
CHECK AC3 /AC2 AC1 /ACO 


SETF OP1 D10 /D9 D8 
CLOCKF CLK ;SET TO D11-D8 
CHECK /AC3 AC2 /AC1 ACO 


SETF OP2 /OP1 
CLOCKF CLK ;CHECK MSB 
CHECK /AC3 AC2 ACl ACO 


SETF /D14 
CLOCKF CLK ;CHECK MSB 
CHECK /AC3 AC2 /AC1 ACO 


SETF OP1 
CLOCKF CLK ; HOLD 
CHECK /AC3 AC2 /AC1 ACO 


SETF /OP3 /OP2 /OP1 /ADE ;HOLD 
CLOCKF CLK 
CHECK /AC3 AC2 /AC1 ACO 


TRACE _OFF 


Bus Interface 











Bus interface plays an important role in computer architecture 
design. As systems become more complicated, the bus require- 
ment becomes more critical in terms of speed, power, accessibil- 
ity and feasibility. There are many different kinds of buses in the 
electronics industry. Each of the buses is specially designed to 
fit into a special architecture for various purposes. There are 
many stringent rules and regulations to follow depending on the 
application to design a bus architecture or bus interface. The 
application depends on timing (synchronous or asynchronous), 
type (military or commercial), architecture (open or closed), data 
structure (serial or parallel), computer type (micro, mini or others) 
ora mix of the above. 


There have been nearly one hundred buses since the invention 
of the computer. A great deal of them have faded away because 
the application no longer exists, or have been replaced by new 
technology. 


The following are some current major buses: 


DEC Unibus 
Future Bus 
IEEE 488 Bus 
Multibus Il 

Nu Bus 

PC Bus 

VME Bus 

Q Bus 


Disregarding the complexity and different varieties of buses, 
there are four basic sections (Figure 1) in the bus structure: 


« Address Section 
* Data Section 

* Control Section 
* Utility Section 






BUS INTERFACE 


ADDRESS SECTION 
DATA SECTION 
CONTROL SECTION 
UTILITY SECTION 
415 01 


Figure 1. Four Sections of the Bus Interface 
Address Section 


The Address Section consists of the address lines and control 
portion. The address lines are used by the processor to indicate 
to memory and other peripherals the location with which it wants 
to communicate. The address width varies from bus to bus. For 


example, VME, Nu and Multibus II are 32 bits wide. DEC Unibus, 
PC Bus and Q Bus are 18, 20 and 22 bits wide, respectively. The 
width of the address line has gradually expanded with each new 
bus announcement. As the width expands, the address decoding 
becomes more complex. One solution to this complex address 
decoding problem is to use PAL devices such as the PAL16L8/ 
20L8/20L10. The advantage of using these PAL devices is to 
utilize their wide input range (up to 16/20) to address up to 64/1M 
addresses by a single PAL device. 


Data Section 


The Data Section consists of the data lines and contro! portion. 
The data lines carry instruction and data between the processor 
and the peripherals, including the memory. The control portion 
provides support to the data lines. In most cases, data can be 
addressed as byte, word or double-word. For instance, PC Bus, 
DEC Unibus, and Nu Bus can be addressed as 8, 16 and 32-bits 
wide, respectively. VME and Multibus II can both be addressed 
as 8, 16, 2401 32-bits wide. The externalbus data communication 
requires handshaking. As long as the protocol is well defined, it 
can be implemented easily by the PAL16R8/20R8/20X10. If a 
change is needed after the circuit is designed, we can simply 
modify the PAL device file. In other words, by using PAL devices 
for data communication, flexibility can be achieved with minimum 
hardware change. 


One of the major problems when dealing with data transfer is 
synchronization. Synchronization is when events occur concur- 
rently with a regular or predictable timing relationship, typically 
based ona system clock. The data transfer control functions for 
this type of synchronous bus can be implemented by PAL devices 
such as the 20R8. In asynchronous buses, the data lines com- 
municate through handshaking instead of the data clock. This 
can be tedious when using discrete logic. The asynchronous 
devices PAL16RA8/20RA10 can be very useful for implementing 
such designs. 


Another problem is when both ends of the data lines have different 
data rates. Inthis case we need FIFO memories. FIFO memories 
match the data rates and serve as temporary buffers between 
source and destination. 


Control Section 


The Control Section coordinates the operation of all bus interface 
circuitry. This is the heart of the bus interface design. The major 
architecture differences can be observed here. Generally, any 
device that is capable of controlling bus operation is called a “bus 
master”. Devices that operate on the bus but cannot control it, i.e. 
the memory system, I/O ports, etc. are called “slaves”. Most of 
the buses are designed to allow more than one bus master ora 
variety of intelligent devices, i.e. asecond CPU, disk controller or 
network controller, on the bus. 
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Functionally, the Control Section can be split into three 
subsections: 


* Interrupt Subsection 
«+ DMA Subsection 
« Arbiter Subsection 


Interrupt Subsection 


Interrupts are important to all bus systems. These signals are 
designed to allow infrequent or important peripherals to get the 
attention of the bus master. Amaskable interrupt can be ignored 
by the bus master, while a non-maskable interrupt can not. 
Vectored interrupt inputs provide priority to the interrupt controller 
in case more than one interrupt request should occur simultane- 
ously, The number and type of interrupt inputs vary widely among 
buses. 


To determine which device caused the interrupt in order to 
execute the correct interrupt-handling routine, two methods can 
be used: 


1. Software Method—tn this method, a polling program checks, 
one by one, each of the devices connected tothe interrupt line, 
to determine which one is interrupting. When an interrupting 
device is found, the routine then branches to the appropriate 
interrupt handler. This scheme is considerably slow dueto the 
software overhead involved. 


2. Hardware Method—This method is more efficient than the 
software scheme. The interrupted peripheral will provide an 
interrupt vector which forms the branching address to the 
interrupt-handling routine at the time it occurs. This is called 
automated interrupt-vectoring (Figure 2). After the Interrupt 
Request (IR) and the vector are issued by a peripheral, the 
Interrupt Handler arbitrates, grants and serves that interrupt. 
The VME and DEC Unibus use this interrupt technique. The 
required branching address is loaded in the address register. 
Ordinary registered PAL devices such as the 20R8/20X10 to- 
gether with the 20L8/20L10 are suitable for this technique. A 
design example of the interrupt controller for the DEC Unibus 
is shown on page 2-328. This example demonstrates a near 
4to 1 chip count saving by using a PAL device design. Obvi- 
ously, the real estate and debugging time shrink and the 
reliability increases. 







INTERRUPT 


BUS 
HANDLER < REQUESTER 





INTERRUPTER 1 


In the event that several interrupts occur simultaneously, a priority 
scheme is used to decide which I/O device should be granted 
control of the bus. The most popular one is the Fixed Priority 
Scheme. This assigns a fixed priority number to each device. 
This number will be there untilthe next configuration change. The 
other scheme is more complex and is rarely used, called the 
Programmable Priority Scheme. In this scheme, the priority can 
be changed dynamically by the software program. This scheme 
may be useful in applications such as strategic military or dynamic 
financial marketing where the strategy changes with various 
battles or investment environments; but, the software design 
may be very complex. However, both schemes can be accom- 
plished by using sequencer devices such as the PMS14R21. 
This powerful device can serve as a local distributed processor 
within the Bus Interface Unit. 


Direct Memory Access (DMA) Subsection 


The Direct Memory Access concept exists because the ordinary 
memory transferring speed often isn’t fast enough. The transfer 
speed of a microprocessor is limited by the software overhead 
involved in transferring successive words: several instructions 
must be executed in order to transfer a word or a chain of words. 
In such cases where the ordinary memory transferring speeds 
may not be sufficient, as in a floppy disk or a frame buffer, DMA 
is required. The DMA Controller implements the transfer algo- 
rithm in hardware and automates the word or block transfers 
directly between the memory and the I/O device. 


Two techniques are used: 

1. The processor is halted or suspended by the DMA Controller 
(DMAC). The DMA dominates the bus until the end of the 
transfer. 

2. The DMAC steals a memory cycle from the processor and 
inserts a data transfer between the memory and the I/O 


device. This is called cycle-stealing DMA. 


These methods can also be used in a combined fashion. 






INTERRUPTER 2 


ARBITRATION BUS 
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Figure 2. Bus Interrupt Example 





2-326 


cl Monolithic rari Memories re | 


Bus Interface 





Ce a TT | 


DMAC design can involve complex handshake sequences be- 
tween the processor and an |/O device. Itcontributes significantly 
to the system’s cost and complexity and thus it is not normally 
used in small systems. However, with the availability of PAL 
devices, DMA applications are becoming more popular. The 
handshake sequence and complex discrete logic design can be 
replaced by asynchronous PAL devices such as the PAL16RA8/ 
20RA10 and programmable sequencers such as the 
PMS14R21/A. This can significantly decrease the cost, real 
estate and debugging time. 


Arbiter Subsection 


Bus Arbiter designs are becoming more critical and complex as 
microprocessor costs decrease. This is because it is becoming 
more cost-effective to design systems with multiple processors 
sharing global resources. Inthe bus interface, the arbiter acts as 
a judge to decide who should own the proprietorship of the bus 
and for how long. The arbiter decides which Requester should be 
granted control of the Bus when several requests happen simul- 
taneously. 


The function of the arbiter consists of: 
1. Granting the use of the bus to one of the bus masters. 


2. Scheduling requests from multiple bus masters for optimum 
bus use. 


3. Preventing simultaneous use of the bus by two bus masters. 


In other words, a successful arbiter design can prevent bus 
hangup and increase bus bandwidth. 


Four major types of arbitration are: 


1. Prioritized Arbiter—This priority scheme provide a pre-as- 
signed number to each bus master to arbitrate the execution 
order. 


2. Round-robin Arbiter—This scheme assigns the bus master 
on a rotating priority basis. 


3. Single-Level Arbiter—This scheme provides only single level 
arbitration. However, it relies on a bus grant daisy-chain to 
arbitrate the request. 


4, Fairness Arbiter—This scheme prohibits the winning module 
from being serviced again until all pending requests have 
been serviced. 


An example of using the PAL22V10 to perform the Prioritized 
Arbiter is described on the application note on page 2-347. The 
arbitration process and PAL device equations are also provided. 


An example of an arbiter used for resolving a bus contention 
between two bus masters fighting for control is shown in 
Figure 3. 


When both master 1 (M,) and master 2 (M,) issue a Bus Request 
(BR), the Arbiter will grant the Bus Grant (BG) to the one with 
higher priority. Assuming it is M,, M, will then drive Bus Busy (B 
Busy) active and release the BR signal. After M, finishes its op- 
eration, the Arbiter detects that the BR signal is still low and 
transfers the bus control to M,. 


Bae aT : 
$l a 
ee 2 


B BUSY 
B CLR 






REQUESTER M2 
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Figure 3. Arbitration Example: Bus Contention 
Utility Section 


The Utility section provides support to each of the previous 
sections. It links the entire bus interface between modules. The 
Utility Section consists of the System Clock Distribution, Power 
Monitor and System Diagnostics. The Power Monitor detects 
power failures and signals the system to start an orderly shut- 
down. When poweris then reapplied to the system, it ensures that 
each module is initialized accordingly. For example, the power 
failure and power up sequences forthe VME bus are 4ms and 200 
ms, respectively. This can help the processor to react to these 
external interrupts according to the system specification. A 
system without diagnostics is an incomplete system. System 
Diagnostics can be implemented by a Programmable Sequencer 
such as the PMS14R21/A (page 3-123). 


Future System Expansion 


With the rapid rate of improvement of technology, the useful life 
of a new system is increasingly getting shorter. Therefore, the 
future system expansion consideration can be a major design 
feature to system designers as well as a selection guideline to a 
buyer. The key to system expansion in a general computer 
system is the bus through which the CPU communicates with the 
present and future system components. Design flexibility, high 
throughput, multiprocessor support, and low power consumption 
soon become critical design considerations for bus interface 
techniques. PLDs can be the solution for bus interface, not only 
because of their flexibility in terms of possible hardware moditica- 
tion, but also because of their real estate saving, which in turn 
lowers the manufacturing cost and power consumption. 
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Functional Description 


One of the more established computer families is the Digital 
Equipment Corporation’s PDP-11 series. This family of compu- 
ters uses the DEC unibus to communicate between cards. A 
specific protocol is required to interface a card to the unibus. 
This protocol is described in the available DEC literature. 


Since the unibus is an asynchronous bus, much of the interface 
circuitry consists of combinational logic to generate specific 
signals and flip-flops which are set and reset as flags. This tends 
to use a lot of SSI and MSI logic packages. Using Monolithic 
Memories’ PAL devices, much of this logic can be condensed 
into a few packages. Figure 2 is the schematic diagram for an 
interrupt controller to be used on the unibus. 


Many cards communicate over the bus by taking control of the 
unibus with an interrupt request, and then do whatever they 
require before releasing control. As can be seen, this interrupt 
controller takes six special interface ICs, (380 and 8881 bus 


AN-131C 


drivers and receivers) eight MSI, SSI ICs, (7400, 7402 and 7474s) 
along with some transistors and discrete parts. This parts count 
can be considerably reduced by using a PAL20RA10 and a 
PAL20L10. 


Figure 1 shows how the circuit with the PAL devices would look. 
The two PAL devices allow almost all of the 7400, 7402 and 7474 
packages to be removed. (Almost a four-to-one saving in chip 
count.) In addition, the preload pin (PRLD) on the 20RA10 
allows the flip-flops to be easily set to a known state on power 
up, or when reinitializing. So the PAL devices reduce the logic 
package count from eight chips to three. 


In the schematic shown, there are three VLSI devices, three 
MSls and two SSls. Using a PAL20RA(10, it is possible to replace 
three MSlIs and one SSI device, thereby reducing the chip count 
by a factor of two. The ICs inside the enclosed loop were 
replaced. 
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TITLE INTERRUPT LOGIC 

PATTERN INTRPO1 

REVISION 01 

AUTHOR DAN KINSELLA 

COMPANY MONOLITHIC MEMORIES, INC. 
DATE 11/06/87 


CHIP INT LOGIC PAL20L10 


INTRAH INTRAHEN MCLEARAH INTRBH INTRBHEN MCLEARBH BGINBH 
BGINAH BUSSSYNL BUSBBSYL STARTINTRAL GND 
STARTINTRBL FF3RESET ENINTRB INTRDONEBH INTRDONEAH EN8881 
SSYN NBGINBH FFI1RESET ENINTRA NBGINAH VCC 


EQUATIONS 
/NBGINAH = BGINAH ;FF1 CLK CONTROL 
; BLOCK A 
/FFIRESET = MCLEARAH+ENINTRA ;SET FF1 CONTROL 
; BLOCK A 
/ENINTRA = INTRAHEN* INTRAH ;ENABLE INTERRUPT A 
/NBGINBH = BGINBH :;FF3 CLOCK CONTROL 
; BLOCK B 
/SSYN = BUSSSYNL*BUSBBSYL ;SYNCHRONIZE FF2 & 
: FFA 
/EN8881 = STARTINTRAL*STARTINTRBL; INTERRUPT BUS 
/ INTRDONEAH = BUSSSYNL+STARTINTRAL ;SIGNAL INTERRUPT 
; DONE 
/ENINTRB = INTRBH* INTRBHEN ; ENABLE INTERRUPT B 
/FF3RESET = MCLEARBH+ENINTRB ;SET FF3 CONTROL 
; BLOCK B 
/ INTRDONEBH = BUSSSYNL+STARTINTRBL ;SIGNAL INTERRUPT 
; DONE 
7 DESCRIPTION 


;COMBINATORIAL LOGIC FOR PAL20RA10 INTERRUPT CONTROLLER 
;(1ST PART OF THE TWO PAL SOLUTION: PAL20L10 & PAL20RA10) 


; SIMULATION NOT INCLUDED 
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Figure 2. 
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PAL Design Specification Simulation Results 
Title DEC PDP-11 unibus interrupt controller Page : 1 
Pattern Control.pds ggg 
Revision A FF1RESET LHHH 
Author Dan Kinsella FF3RESET LHHH 
Company Monolithic Memories Inc., Santa Clara, CA AINTR HLLL 
Date 3/1/85 BINTR HLLL 
SSYN XXXL 
CHIP INTR_CONTROL PAL20RA10 ABGIN XHHH 
BBGIN XHHH 
PL AINTR NC ABGIN FF1RESET SSYN BINTR NC FF3RESET BBGIN FF1 LLLL 
NC GND FF3 LLLL 
NC OUT4 OUT3 OUT2 OUT1 FF3 NFF4 FF4 NFF2 FF2 FF1 vcc NFF2 XLLL 
NFF4 XLLL 
EQUATIONS ouT1 XHHH 
OUT2 XHHH 
/FF1 t= /FFL*FF2 ?Master control OUT3 LHHH 
FF1.SETF = /FFLRESET tblock A OUT4 LHHH 
FF1.CLKF = /ABGIN 
FF2 := FFL ;Bus Busy Signal 
FF2.SETF = /AINTR 
FF2.CLKF = ABGIN*FF2*/SSYN 
/NFF2. t= FFL ;Bus sack signal 
NFF2.SETF = /AINTR 
NFF2.CLKF = ABGIN*NFF2*/SSYN 
/FF3 t= /FFI*FF4 ?Master control 
FF3.SETF = /FF3RESET tblock B 
FF3.CLKF = /BBGIN 
FF4 t= FF4 7Bus busy signal 
FF4.SETF = /BINTR 
FF4.CLKF = BBGIN*FF4*/SSYN 
/NFF4 := FF3 7Bus sack signal 
NFF4.SETF = /BINTR 
NFF4.CLKF = BBGIN*NFF4*/SSYN 
/OUT1 = FF1+FF2 ;Bus request signal 
sblock A 
/OUT2 = FF4+FF3 ;Bus request signal 
tblock B 
/OUT3 = AINTR sIntr. signal for 
sbus req. block A 
/OUT4 = BINTR ;Intr. signal for 


sbus req. block B 
SIMULATION 


TRACE_ON FF1RESET FF3RESET AINTR BINTR SSYN ABGIN BBGIN 
FFl FF3 NFF2 NFF4 OUT1 OUT2 OUT3 OUT4 


SETF /FF1RESET /FF3RESET AINTR BINTR 7Reset all regs 
SETF FF1RESET FF3RESET /AINTR /BINTR #Clock FF1 and FF3 
ABGIN BBGIN ;regs 
SETF /SSYN 7Clock NFF and NFF3 
regs 
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A Muitibus Arbiter Design 
for 10 MHz Processors 


This application note describes the implementa- 
tion of a bus arbiter using a PAL16Ré4 to interface 
the 10 MHz 80186 to the MULTIBUS multi-master 
environment. The PAL equations for bus ex- 
change were developed based on functional and 
timing requirements specified in the Intel 
MULTIBUS Specification. The interface shown 
addresses the bus master case in regards to bus 
exchange only. No attempt has been made to 
include the requirements for bus slave operation, 
interrupt handling or byte operations. This note 
assumes a knowledge of MULTIBUS operations 
and signal names. 


Overview 


The system environment is assumed to be a 
collection of loosely coupled independent 
processors which communicate through a com- 
mon memory on the system bus. Each processor 
in this system possesses its own local resources. 


A block diagram showing the interface of one such 
processor, an 80186, to the system bus, 
MULTIBUS, is shown in Figure 1. A block of the 
80186’s address space is allocated to system 
memory. Access into this portion of the address 
map generates a request for the system bus viaa 
programmable memory chip-select pin. Once a 
request is generated by the CPU, the PAL16R4 
arbiter and the 8288 bus controller combine to 
perform a bus exchange and transfer cycle. 


In this design the arbiter maintains control of the 
system bus and therefore must be forced off either 
by a higher priority master (BPRN goes HIGH) ora 
Common Bus Request (CBRQ). 





Functional Requirements 


On the MULTIBUS, the bus exchange process 
begins when the Bus Request (BREQ) goes LOW 
(parallel priority resolution scheme), or when 
BPRO goes HIGH to succeeding masters (Serial 
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priority resolution scheme). If the requesting 
master is granted priority (BPRN enabled) and the 
bus is not under control of another master (BUSY 
disabled), then the master may take control of the 
bus by forcing BUSY LOW. 








An optional request mechanism, Common Bus 
Request, used in conjunction with either parallel or 
serial priority has been implemented in this design. 


It reduces bus acquisition overhead by allowing 
bus retention across transfer cycles, unless a 
request is pending (CBRQ is LOW). For example, 
in parallel priority the requesting master asserts 
both BREQ and CBRQ to force the current master 
off the bus. Once acquisition is complete CBRQ is 
released for use by another master. 








Timing Considerations 


Bus exchange on MULTIBUS is a synchronous 
process. BREQ, BPRO, BUSY and CBRQ are all 

synchronized with the trailing edge (HIGH-to- 
LOW transition) of BCLK. BCLK is a MULTIBUS 
signal having a duty cycle of approximately 50% 
and a maximum frequency of 10 MHz. There is no 
requirement for synchronization between BCLK 
and any other clock in a MULTIBUS system. 
Synchronizer circuits must be used on signals 
that cross clock boundaries, because it is possi- 
ble (as in this design) for the processor clock to be 
asynchronous to BCLK. 


Bus priority on MULTIBUS resolution takes place 
in one BCLK cycle. If the serial priority scheme is 
implemented, this requirement will place an upper 
limit on the number of masters allowed for a given 
BCLK rate. The parallel priority scheme allows 
the relationship between BCLK and the number of 
masters to be independent. Whichever is chosen, 
the MULTIBUS specified minimum setup time of 
22 ns (resolution to clock), plus the desire to keep 
the BPRN to BPRO propagation delay to a mini- 
mum, points to the use of an “A’ speed PAL 
device (Figure 2). 
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A Multibus Arbiter Design for 10 MHz Processors 


The MULTIBUS bus exchange timing requires a 
BUSY setup time before clock of 25 ns (see Fig- 
ure 2). Release by one master and acquisi- 
tion by another takes place on two successive 
HIGH to LOW transitions of BCLK. 


PAL Design Description 


The PAL design specification with supporting 
timing diagrams are shown in Figures 3 and 4. The 
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following remarks are in addition to the comments 
found on the PAL design specification. 


The term "BRQP*AEN" found in the equation for 
"BREQ" is required to keep the arbiter from 
recapturing the bus when it is the releasing master. 
This can occur when a CBRQ forced the 
surrender, and the processor cycle which follows 
the just completed cycle accesses the system 
memory. 








A80186-10 





Figure 1. Single Multimaster Bus Interface 
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A Multibus Arbiter Design for 10 MHz Processors 


Using BREQ to disable BPRO results in a 40 ns 
delay worst case from clock (tCO + tPD). Taking 
into account the 22 ns setup (see Fig. 2) and 
clock skew (3 ns max) leaves 35 ns of cycle (10 
MHz BCLKk), if the serial priority resolution 
scheme is being employed. Lower priority mas- 
ters using this design would exhibit a 25 ns worst 
case delay BPRN to BPRO. Use an external gate 
to implement the equation ‘‘BPRO = 
BPRN*BREQ’ if the serial priority is required when 
using more than two masters. 





Note the inversion of BLCK for use by the PAL 
device. PAL device registers load data on the 
LOW to HIGH clock transition of pin 1. 


MULTIBUS address setup before Command (50 
ns min.) is determined by the 8288 bus controller. 
The 8288 specification guarantees 115 ns 
minimum between AEN going LOW and 
Commands going active. 





MULTIBUS address hold after Command (50 ns 
min.) is satisfied by using the ALE signal from the 
8288 instead of the 80186. The 8288 Command 
lines go inactive on the HIGH to LOW transition of 
CLKOUT at the beginning of T4 (30 ns max delay). 
The 8288 ALE signal goes active on the HIGH to 
LOW transition of CLKOUT at the beginning of T1 
(1/2 clock cycle later than ALE from the 80186). 
The minimum hold time is therefore 100 ns - 35 ns 
= 65ns. 
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A Multibus Arbiter Design for 10 MHz Processors 





TITLE A MULTIBUS ARBITER 
PATTERN AMP3-318 

REVISION 005 

AUTHOR JOE ENGINEER 

COMPANY ADVANCED MICRO DEVICES 
DATE 11/15/87 


CHIP MULTIB PAL16R4 


BCLK /RD /WR /SREQ /RESET /BPRN NC NC NC GND 
/E /CBRQ /BUSY /SYNC /HREQ /AEN /OEN /BREQ /BPRO VCC 


EQUATIONS 


BPRO = /HREQ*BPRN 


BREQ = HREQ+tAEN 
OEN := /RESET*SREQ*AEN 
AEN := /RESET*AEN*HREQ*RD ;HOLD BUS UNTIL CURRENT 
+ /RESET*AEN*HREQ*AWR ;CYCLE IS COMPLETE 
+ /RESET*HREQ*BPRN* /BUSY ; INITIAL BUS ACQUISITION 
+ /RESET* AEN* BPRNé4 /CBRQ ;HOLD BUS IF NO OTHER REQUEST 


HREQ := /RESET*SYNC 


SYNC := /RESET*SREQ*RD 
+ /RESET*SREQ*WR 


BUSY = AEN 
BUSY.TRST = AEN 


CBRQ = /AEN*HREQ 
CBRQ.TRST = /AEN*HREQ 


;THIS MODIFICATION ALLOWS THE ARBITER TO RESIDE ANYWHERE IN 
;THE ARBITRATION DAISY CHAIN 


Figure 3. PAL16R4 Arbiter Design Specification 
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Multibus to Am9516 Interface 





This interface shows the Am9516 connected to the MULTIBUS 
(Figure 1). This is accomplished by two PAL device designs. 
The equations for the PAL devices are shown in Figures 2 and 3. 
The first designated Am9516MBC does the MULTIBUS arbitra- 
tion as defined by the MULTIBUS specification. Common bus 
request (CBRQ) was not implemented in this design. Addition- 
ally this design holds the bus as long as BREQ is active. If the 
user wishes to release the bus after each transaction the 
Am9516 should be programmed for CPU interleave. 


The second PAL device in this interface designated 9516MBC, 
converts the Am9516 signals into MULTIBUS control signals. It 
also generates RD and WR for the 8530 so that flyby transfers 





can be done. When not doing flyby, this part of the PAL device 
should be changed. There are several considerations when 
using the SCC with DMA not addressed here. These are cov- 
ered in a separate application. This example will work for 
moderate serial data rates. To operate the SCC at maximum 
speed, a local RAM should be used as bus arbitration overhead 
could cause problems. The main purpose here was to illustrate 
the versatility of PAL devices and how easy it is to interface 
apparently incompatible devices to the MULTIBUS. 


The two PAL devices shown are similar in function to the 8289 
and 8288 shown with the 8086 CPU. A similar design could be 
done for processors such as the 68000 or other bus masters 
such as the 8052 CRT controller which has its own DMA. 
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Multibus to Am9516 Interface 





TITLE MULTIBUS CONTROLLER FOR AM9516 
PATTERN AMP3-189 

REVISION 01 

AUTHOR JOE BRCICH 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/15/87 

CHIP MULTI1 PAL16L8 


BACK MIO NC NC /DACK NC NC NC /CEN GND 
NC /RD /IORC /DS /MWTC /MRDC /IOWC /RW /WR VCC 


EQUATIONS 

IORC = /MIO*DS*/RW*CEN 
IORC.TRST = BACK 

IOWC = /MIO*DS*RW*CEN 
IOWC.TRST = BACK 

MRDC = MIO*DS*/RW*CEN 
MRDC.TRST = BACK 

MWTC = MIO*DS*RW*CEN 


MWTC.TRST = BACK 


RD = DACK*RW*BACK 
+ TIORC*/BACK 

WR = DACK*/RW*BACK 
+ IOWC*/BACK 

DS = IORC 
+ IOWC 


DS.TRST = /BACK 
RW = IOWC 
RW.TRST = /BACK 
; DESCRIPTION 


;This PAL device converts multibus signals into 9516 compatible 
;Signals and vice-versa. It also supports the 8530 in flyby mode 


Figure 2. Source Listing for MULTIBUS to Am9516 Interface 
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TITLE MULTIBUS ARBITER FOR AM9516 
PATTERN AMP3-190 

REVISION 01 

AUTHOR JOE BRCICH 

COMPANY ADVANCED MICRO DEVICES 

DATE 11/15/87 


CHIP MULTI2 PAL16R4 


/BCLK /XACK BRQ /BSY /BPRN /DS NC  /IORC /CS GND 
/OE /RBEN /TBEN BACK /CEN /BREQ /BUSY /BPRO /WAIT VCC 


EQUATIONS 
TBEN = IORC*CS 


TBEN.TRST = /BACK 





RBEN = /IORC*CS 


RBEN.TRST = /BACK 


WAIT =/XACK*BACK 

BREQ := BRQ 

BPRO = /BRQ*BPRN 

/BACK := /BUSY 

BUSY := BREQ*BPRN*/BSY*/BUSY 
+ BREQ*BUSY*BPRN 
+ DS*BUSY 

CEN := BACK 

; DESCRIPTION 


;/CEN delays the commands to meet the multibus requirements that 
;address and data be valid at least 50 ns prior to control active. 
;If we do not allow preemption; remove BPRN from the second 
;expression in the BUSY equation and eliminate the third expression 


Figure 3. Source Listing for MULTIBUS to Am9516 Interface 
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Z-Bus and 8088/8086 Interface 





This application note describes how replacing two 8086 support 
chips with a Z8000 support chip and a PAL16R8A allows the 
8086 CPU to interface directly to the Z-BUS. Since the timing of 
the signals used is the same for the 8088 CPU, this circuit will 
work equally well in those applications. 


Interfacing the 8086 CPU to the Z-BUS allows 8086 users to 
take advantage of the very powerful Z8000 peripheral and 
memory support circuits that are available. The Z8000 periph- 
eral circuits in particular offer the user higher throughput rates, 
simpler contro! software and less system overhead require- 
ments than any previous generation peripheral family for 
any CPU. 


Design Requirements 


The 8086 CPU can operate in two different modes. In minimum 
mode, it generates all the bus control and timing signals for the 
8086 (8085, 8088) buses directly on-chip. In maximum mode, 
the CPU puts out status information early in each bus cycle and 
relies on an external bus controller chip, the 8288, to generate 
timing and control signals. This implementation uses the CPU in 
maximum mode and replaces the 8288 with a PAL16R8A that 
generates the Z-BUS timing and control signals from the status 


56pF 





signals provided by the CPU. It also makes use of the AmZ8127 
clock generator to allow precise timing resolution by providing 
an oscillator signal at 3 times the CPU clock frequency. The 
AmZ8127 provides all the clock generation functions of an 
8284A as well as several additional functions. Either clock chip 
will work in this system. 


The bus controller provides the following functions: 


—Generates AS, DS, INTACK and R/W with proper timing rela- 
tive to address and data. 


—Provides simultaneous assertion of AS and DS during reset. 
—Automatic insertion of 1 wait state for all 1/O cycles. 


—Synthesizes a single Z-BUS interrupt acknowledge cycle from 
the 8086 IACK cycles. 


Figure 1 shows the circuit interconnection diagram. The system 
uses a high-speed PAL16R8A to generate AS, DS, R/W and 
INTACK of the Z-BUS, RDY for wait state generation, and three 
internal state variables. The registers are clocked with the 15- 
MHz OSC signal from the 8127. The five input signals to the PAL 
device are 5-MHz CPU Clock Signal (CLK), System Reset 
(RESET), and the three CPU Status States (Sp, S,, Sz). 





AmPAL16R8A 


03862A-103 


Figure 1. Circuit Interconnection Diagram 
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The CPU indicates the start of a bus cycle by bringing at least one 
of the status lines low from the idle high state (see Figure 2). This 
starts an internal timing sequence within the PAL device which 
corresponds Closely to the various T states of a bus cycle. AS is 
asserted during the time CLK is LOW during T,. DS is asserted 
at the start of T3. If itis an 1/O cycle, then RDY would be disabled 
for one CLK period straddling Tz and T3 causing the 8127 to 
request 1 wait state after Ts. In either case, DS remains as- 
serted until after the first 1/3 of T4, which is identified by the 
Status lines returning to the idle state during the previous cycle. 


2Z-Bus and 8088/8086 Interface 


R/W is generated by sampling Sp and S, during AS. 
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Figure 2. Memory and I/O Timing Diagram 
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It is in the realm of interrupts where the Z8000 peripherals shine 
over other peripherals. Each peripheral can identify many differ 
ent exception conditions during its operation. The occurrence of 
one or more of these conditions causes activation of a single 
interrupt request line. The peripheral wants the CPU to respond 
with a single interrupt acknowledge cycle, during which the pe- 
ripheral resolves priority and provides the CPU with enough 
status and vector information to allow it to respond to the excep- 
tion without any further interrogation of the peripheral. This al- 
lows interrupt driven systems to achieve very high data 
throughput rates. 
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The 8086 CPU responds to an interrupt request with a sequence 
of two interrupt acknowledge cycles, and only in the second is 
any data read off the bus. As stated before, the Z-BUS peripher- 
als require only one acknowledge cycle. The timing of this has to 
be such that there is enough delay between AS going HIGH and 
DS going LOW to allow any prioritizing daisy chains to settle, 
and DS has to be wide enough to allow the peripheral time to 
place vector or status information on the bus. Figure 3 shows 
how these two requirements are accomplished by turning the 
two acknowledge cycles into one. The first cycle allows only AS 
and the second asserts only DS and does so for the complete 
cycle. This appears to the peripheral as one very long bus cycle 
which is identified as an interrupt acknowledge cycle by the 
assertion of INTACK. 


1 | T2 | 13 | T1 | 


Design Approach 


To implement this design in a PAL16R8A requires recognizing 
the Z-BUS timing characteristics in Figures 2 and 3. The major 
characteristic to consider is counting the phases of a bus cycle. 
Internal state variables Pp and P, are the result (see Figure 4). 
An additional internal state variable (Iz) is necessary to count the 
second bus cycle of an interrupt acknowledge sequence. As 
shown in Figure 5, !2 in conjunction with INTACK allows AS tobe 
asserted only in the first interrupt acknowledge cycle and DS 
only in the second. The RESET input is used to initialize the 
internal variables and assert AS and DS. Note also that Sp and 
S, are included in the DS equation to prevent DS from being 
asserted during a halt cycle. 
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Figure 3. Interrupt Acknowledge Timing Diagram 
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The fact that PAL devices are user programmable allows a great 
deal of flexibility for the designer. Minor timing changes are 
easily implemented by simply adding or changing a term in the 
logic equations and reprogramming the device. In this system, 
we have timing resolution to 67 ns. This same configuration can 
be used with a 24-MHz crystal for 8-MHz CPU chips. The logic 
equations would change because the OSC period would be 
42 ns. The only hardware change would be the crystal. 


An additional PAL device could also perform chip select decod- 
ing based on both address and status signals. 


Conclusion 


We have seen how a properly programmed PAL device can be 
used to replace a specialized bus controller chip and allow an 
8086 CPU to interface directly to Z-BUS peripheral(s) and/or 
memory systems. This brings all the advantages of the superior 
Z8000 peripheral family in terms of both throughput and ease of 
use to 8086 users with no increase in chip count while still 
allowing a wide range of design flexibilty. The PALASM software 
equations for the PAL16R8A 8086 to Z-BUS interface chip are 
shown in Figure 6. 
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Figure 4. 
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Figure 5. 
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TITLE 8086 TO Z-BUS INTERFACE 
PATTERN AMP3-195 

REVISION 01 

AUTHOR NICK ZWICK 

COMPANY ADVANCED MICRO DEVICES 
DATE 11/15/87 


CHIP ZBUS1 PAL16R8 


CK RESET CLK /SO /S1 /S2 NC NC NC GND 
/E /PO /P1l /RW /DS /AS /INTACK RDY /I2 VCC 
EQUATIONS 


; INTERNAL STATE VARIABLES 


ae] 
) 


/RESET* 
/RESET* 
/RESET* 


SO*/P0*/P1*/CLK 
S1*/P0*/P1*/CLK 
S2*/P0*/P1*/CLK 


/RESET* 
/RESET* 
/RESET* 
/RESET* 


PO*/CLK 

Pi* CLK* SO 
Pl* CLK* Sl 
Pl* CLK* $2 


++teteeti 


ue} 
~ 


/RESET* 
/RESET* 
/RESET* 
/RESET* 
/RESET* 


PO*/P1* CLK 
P1*/CLK 

Pl* CLK* SO 
P1l* CLK* Sl 
P1* CLK* $2 


+e teil 


H 
iwe) 
ee 


/RESET* 
/RESET* 
/RESET* 
/RESET* 


INTACK*/I2* CLK* PO*, Pl 
12*/P1 

12*/P0 

I2*/CLK* PO* Pl 


+++ 


; Z-BUS OUTPUT SIGNALS 
AS := RESET 
+ /PO*/P1* CLK*/12 
+ AS*/P0*/I2*/DS 


oO 
1p) 


RESET 

/INTACK*/PO* Pl* 

/INTACK*/PO* Pls 
I2* SO* S1* §2 
DS* PO* Pl 


CLK*SO 
CLK*S1 


++ 441 


RW 


AS* S0*/S1 
RW*/AS 


+ 


INTACK:= /RESET*AS*S0*S1*S2 
+ /RESET*INTACK* /I2*P1 
+ /RESET*I2 


/RDY := /RESET* S0*/S1* S2*/PO* Pl 


+ /RESET*/SO* S1* S2*/PO* Pl 


Figure 6. Source Listing for the Z-Bus to 8088/8086 Interface 
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A High-Performance Bus 


The VMEbus is a high-performance asynchronous bus capable 
of supporting multiple 16- and 32-bit processors. Its asynchro- 
nous nature permits easy implementation of hardware control- 
lers that reduce the time required for bus control tasks such as 
bus arbitration. By offloading these tasks to hardware, bus ar 
bitration time is minimized which consequently maximizes the 
overall bus transfer rate. 


This article describes PLD implementations of two such control- 
lers in a typical VMEbus system: the bus arbiter and the inter- 
rupter that comply with the VMEbus protocol. These state 
machines used to require multi-chip designs consisting of se- 
quencers, microprogrammed control stores, and other MSI/LSI 
chips. By using the PAL22V10 programmable array logic device, 
you can reduce substantially the number of chips needed for 
these state-machine implementations. 


Functional Modules 


A typical VMEbus-based computer system contains a bus ar 
biter and an interrupt handler board. A processor/master in- 
itiates a data transfer to a slave by requesting control of the data 
bus from the bus arbiter. Once bus control has been granted to a 
master, control signals to and from the master and slave are 
exchanged which follow predefined protocols. These protocols 
guarantee an orderly transfer of data between the communicat- 
ing modules. Interrupt servicing ‘capability is provided by the 
interrupt handler board (see Figure 1). 


PROCESSOR/ 
MASTER 2 


PROCESSOR/ 
MASTER 1 





VMEbus 


Steps in Designing the 
Bus Controllers 


VMEbus protocols outline the steps to perform any bus-related 
operation, such as transferring data over the bus between two 
modules. These protocols are described using flow diagrams 
with interactions between the communicating modules shown 
through various interface signals. For example, the priority bus 
arbiter flow diagram (Figure 2) shows how bus requests be- 
tween two modules using the same request line are resolved. 


The flow diagrams are analyzed to pick out the functions that 
can be incorporated into PLDs, and then state machines are 
created for these functions. These state machines can be 
described logically, or with flowchart symbols such as rectan- 
gular blocks (to symbolize events or control signals) and deci- 
sion diamonds (to determine control program flow). Logic 
equations are then written for these state machines. 


This methodical process is used in designing the two PLD- 
based VMEbus controllers: the bus arbiter and interrupt handler 
controller. 


A PLD Bus Arbiter 


Before any module can perform a data transfer, it must request 
control of the data transfer bus from the bus arbiter. This design 
uses a priority option bus arbiter implemented on a single 
PAL22V10 that monitors the four bus request lines BRO-3(L) 
with BR3(L) assigned the highest priority. 


CACHE/SLAVE BUS 
ARBITER 


ADDRESS 
DECODER 
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Figure 1. Intermodule Communication Through Am29PL141 (FPCs) and PLDs in a VMEbus System 
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LOCATED IN SLOT 2 LOCATED IN SLOT 1 
MASTER B REQUESTER B MASTER A REQUESTERA ARBITER 
DRIVE (DEVICE DRIVE (DEVICE 
WANTS BUS) HIGH 2 ooo oe WANTS BUS) HIGH 
; ' ARBITRATION ! 
DETECT (OEVICE WANTS 'IN PROGRESS | DETECT (DEVICE WANTS 
BUS) DRIVEN HIGH BUS) DRIVEN HIGH 
DRIVE BRT LOW DRIVE BRT Low 


EEE 


DETECT BAI LOW 
DRIVE BGIIN LOW 


DETECT BGIIN LOW 


DRIVE “BESY LOW 
RELEASE BAT 
DRIVE (DEVICE Tae 
GRANTED Bus) HicH DETECT. BESY Low 
DETECT (DEVICE GRANTED DETECT BGIIN HIGH 
BUS) DRIVEN HIGH 
peter a see SS - i 
' MASTER A HAS CONTROL! 
! OF DATA TRANSFER BUS ! PERFORM DATA TRANSFERS 


ona eee we : | 


DRIVE (DEVICE WANTS 


Bus) TO LOW 
DETECT (DEVICE 
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' TRANSFER BUS 
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Figure 2. Bus Arbitration Flow Diagram 
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Based on the flow diagram (Figure 2), the arbiter grants the bus 
to the requesting module using the highest active request line, in 
this case BR1(L). A bus grant signal is daisy-chained to all the 
devices using this request line to resolve simultaneous bus re- 
quests from two or more modules. This dictates that the arbiter 
be in the first slot of the VMEbus system, and that the module 
physically closest to the arbiter through the daisy-chain will get 
the bus grant. 


All bus requests are processed in parallel by the AND-OR array 
of the PLD. This allows priority arbitration to occur in a single 
clock cycle, which minimizes arbitration time and maximizes the 
data transfer rate on the VMEbus. 


When the bus is free, the module using the highest active bus 
request line is granted the bus. This is described with Boolean 
logic notation where registered outputs are defined with ':=’: 


IF (/BBSY*(BRO+BR1+BR2 _ if bus not busy anda 
+BR3)) THEN BEGIN request line is active 
IF (BR3) THEN BGSIN := 1; if BR3 is active, grant 

bus to device on BR3 


IF (/BR3*BR2) THEN activate bus grant daisy 


BG2IN := 1; chain 2 
IF (/BR3*/BR2*BR1) THEN _ if BR1 is active and BR3 
BGI1IN := 1; and BR2 are not, then 


grant bus to device 

using request line 1 
BRO-3 and BGOIN to 

BGSIN are active low 


IF (/BR3*/BR2*/BR1*BRO) 
THEN BGOIN := 1; 
END ; 


_ The PAL22V10 can be programmed with normal or inverted 
outputs and the logic software can support active high or low 
input polarities. 


BGTtIN(L) is asserted until the requesting module asserts 
BBSY(L). This is defined in logic equation form as: 


IF (/BBSY*BG1IN) THEN 
BG1IN := 1; 


continue asserting 
BG1IN until BBSY 
becomes active 


Priority Arbitration Options 


It may be necessary for the arbiter to force the current bus 
master to relinquish bus control for certain conditions, such as if 
a higher priority operation must be attended to. This is done by 
asserting the bus clear signal BCLR(L) based on the priority of 
the current bus master and the bus clear conditions. 


The PLD arbiter keeps track of the current bus master’s priority 
by recording which bus request line was used to gain control of 
the bus. For example, two output registers called BUS_MAS- 
TER will be set to the binary value “2” if the current bus master 
used BR2(L) to gain control of the bus. 


In this design, BCLR(L) wil be activated under two conditions: 

1. If the BUS_MASTER is 2,1 or 0 and the active bus request 
line is 3 or 2 

2. If BUS_MASTER is 0, and any bus request line is active. 


When BUS_MASTER is 3, then the arbiter will not honor any 
bus requests until the bus busy line BBSY(L) is low. The above 
conditions are expressed logically as: 


IF (BBSY*(BRO+BR1+BR2+BR3)) THEN 
BEGIN IF (BR3*(/MASTER|[1:0] = 3) ) ) THEN 
BCLR := 1; “assert BCLR if MASTER < > 3” 
IF (/BR3*BR2*(/MASTER[1:0] = 3) ) ) THEN 


BCLR := 1; 

IF (MASTER[1:0] = 0) THEN 
BCLR := 1; 

END; 


Once the BCLR(L) line is active, then it should be held until the 
current bus master releases BBSY(L). This is logically ex- 
pressed as BCLR = BCLR*BBSY, or in a high-level syntax, 


IF (BCLR*BBSY) THEN 
BCLR := 1; 


The BCLR(L) signal is defined such that uninterruptible devices 
use BR3(L). Devices that can be temporarily suspended to ac- 
commodate interrupts and higher-priority operations are as- 
signed to bus request line 0. The BCLR conditions will vary with 
your application, but any modifications will only require redefini- 
tions of the high-level logic expressions. 


The PAL22V10 is ideal as the bus arbiter because of the input/ 
output signal requirements and the large number of product 
terms needed to logically define the bus grant and bus clear 
signals. 


Processing Interrupts 


When real-time or urgent response is needed by a processor, it 
generates an interrupt request signal and waits for the interrupt 
acknowledge signal IACK(L) and IACKIN(L) daisy-chain signal. 
A 3-bit value is then read from the VMEbus when the data 
strobes are active; this value indicates which interrupt request 
line was acknowledged. These three-bits are decoded to deter- 
mine if it matches the processor's request level. 


if the interrupt is acknowledged, then the interrupting processor 
puts its status or [ID byte on the data bus for the interrupt hand- 
ler. This data is used by the interrupt handler as an interrupt 
vector. This processor can then wait in a loop until the IACK(L) 
signal from the interrupt handler is driven HIGH to signify inter- 
rupt service completion. 


Interrupt Handling: The Interrupt 
Handler Preprocessore (IHP) 


To handle external I/O or special system events (e.g., timeout, 
overflow), interrupts are supported on the VMEbus through an 
interrupt handler (IH) module. 


The logic complexity of the IH is reduced by offloading some of the 
initial interrupt recognition tasks to a PLD. APAL22V10 can be 
programmed as an IHP to preprocess interrupt requests, obtain 
the bus and handle data transfers (such as interrupt acknowledge 
signals). Controlis passed to the interrupt handler only when the 
IHP latches the interrupt vector. 


Interrupt request processing, bus acquisition, and the interrupt 
vector transfer phase are all specified using logic equations 
which are written using a high-level Boolean notation. The 
PAL22V10 monitors seven interrupt request lines and five other 
control inputs, and generates ten registered outputs to send 
control signals to the interrupt handler and the VMEbus drivers. 
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Interrupt Logic 
All interrupt request lines are monitored according to the follow- 
ing logic equation: 


IF (IR1 + IR2 + IR3 + IR4 + IR5 + IR6 + IR7) THEN 
BR3 := 1; this interrupt handler uses 
the BR3 request line 


If any of the interrupt lines are active, then BR3(L) is asserted. 
This initiates the bus acquisition phase. 


The next step is to wait for the bus grant in signal. Only when 
BGIN3(L) is active will BBSY(L) be active. This is expressed 
logically as: 


IF (BR3*/BG3IN) THEN _ 


BR3 := 1; —_  ————__ [A] 
IF (BR3*BG3IN + BBSY*/SERVICE_DONE) 
THEN —m—\— [B] 
BBSY := 1; 


Equation [A] continually asserts BR3(L) as long as BG3IN(L) is 
not active, while [B] asserts BBSY(L) only when request line 
BR3(L) and BGSIN(L) are active, or if service has not been 
completed once the IHP asserts BBSY(L). 


When the IHP is the bus master, it puts the 3-bit interrupt-line 
acknowledge value on the bus and applies the data strobes. 
When the DTACK(L) signal is received from the interrupter, the 
IHP then strobes the status byte from the bus into a register on 
the interrupt handler card. The IHP informs the interrupt handler 
that a status/ID byte is ready and to commence interrupt 
servicing. 


The IHP takes only a single clock cycle to resolve interrupt 
priorities because all interrupt/input signal lines are processed 
in parallel by the PLD. This is expressed in a logic description 
language as follows: 


IF (BBSY) THEN 


BEGIN 
(1) IF (IR7) THEN IR7 was active 
INTR[2:0] := 7; 3-bit value acknowledging 


interrupt line 7 


IF (/IR7*IR6) THEN IR6 active, IR7 inactive 


INTR[2:0] := 6; acknowledge interrupt line 6 
IF (/IR7*(IR6*IR5) THEN IR5 highest priority 
interrupt line active 


INTR(2:0] := 5; 


(2) IF (/IR7*/IR6*/IR5*/IR4*/IR3*/IR2*1IR1) THEN 
INTR[2:0} := 1; acknowledge interrupt line 1 
END; 


As noted above, if both the IR7(L) and BBSY(L) signal are ac- 
tive in (1), then regardless of the value of the other interrupt 
lines, the three-bit value generated will be “111” or 7. In (2), if 
IR1(L) and BBSY(L) are active and the other six control signals 
are inactive, then the three-bit output value will be “001” or 1. In 
(2), IR1(L) is the highest priority line active. 


The remaining preprocessing involves completing the interrupt 
vector byte transfer. This is logically expressed as: 


IF (BBSY) THEN 
BEGIN 
IACK := 1; begin interrupt acknowledge 
(X) IF (DTACK) THEN daisy chain; if device sends 
LATCH_STATUS := 1; data transfer acknowledge 


END; (DTACK) signal, then latch 
IF ((ACK) THEN the status 
AS := 1; assert the address strobe signal to inform 


the interrupter that the interrupt 
acknowledge level is ready 


Once the 8-bit status of ID byte is transferred successfully via 
the DTACK(L) signal (X), the IHP PAL device informs the inter- 
rupt handler module to begin the interrupt service routine: 


IF (BBSY*LATCH_STATUS +BBSY*START_SERVICE* 
/SERVICE_DONE) THEN 
START_SERVICE := 1; 


In this design, the START_SERVICE signal is constantly as- 
serted until the interrupt handler generates a SERVICE_DONE 
signal, at which point START_SERVICE is brought LOW. The 
above equations will generate the timing diagram in Figure 3. 
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Figure 3. Timing Diagram 





Controller Design Simplified only analyze the bus protocols, convert these into state ma- 
chines and then write assembly language programs or high- 


with Development Tools level logic equations to describe these state machines. The 
The task of programming FPCs and PLDs as VMEbus control- assembler and logic software will then process these programs 
lers is simplified with the development tools. The designer need and equations to fit into the FPC or PLD. 





on Monolithic ari Memories ol 2-351 


oSE-S 


DEVICE ARBITER (AMPAL22V10) " VMEbus Priority Arbiter Option IF (BG3IN*/BBSY) THEN 
For this application, if the bus is busy, then bus requests will be BGSIN s= 13 
deferred until the BBSY(L) (bus busy line active LOW) goes HIGH, IF (BG2IN*/BBSY) THEN 
unless the bus request is of a higher priority. In this case, the BG2IN := 1; 
bus clear signal BCLR(L) is asserted and the current bus master IF (BGIIN*/BBSY) THEN 
relinquishes control. If the bus is free and 2 or more masters BGIIN := 1; 
request service, then the bus is granted to the master with the IF (BGOIN*/BBSY) THEN 
higher priority request line. " BGOIN := 1; 
PIN CLK =1 IF (BG3IN*BBSY) THEN '" when requester responds with BBSY active, then " 
/BBSY =2 MASTER{1:0] := 3 ;  "™ MASTER is set to the line currently controlling" 
/BRO =3 IF (BG2IN*BBSY) THEN " the bus; this is used internally for future by 
/8R1 =4 MASTER(1:0}] := 2; " bus priority resolution ? 
/BR2 25 IF (BGIIN*BBSY) THEN 
es /BR3 =6 MASTER([1:0] := 1; 
IF (BGOIN*BBSY) THEN 
= /BCLR = 23 MASTER[1:0) := 0 ; 
S /BGOIN = 22 
o /BG1IN = 21 IF (BBSY) THEN ' when bus busy, then remember current bus master line " 
= /8G21N = 20 MASTER[1:0] := MASTER(1:0} ; 
= 
S /BG3IN = 19 
MASTER[1:0] = 18,17; "if a higher priority bus request occurs and the bus is busy, then begin 
By bus resolution (by using bus clear BCLR) under the following conditions : 
BEGIN - if BR3 and present bus master is not using bus line 3; i.e., once a device 
= "if a bus request occured and bus = free, then activate the bus grant in uses BR3, then no one can force it to relinquish the bus or 
3 Line corresponding to the highest priority bus request line " - if BR2 and present bus master is not 3; i.e., BR2 can assert bus clear 
r=) except when the presetn bus master obtained the bus using bus line 3 or 
3 IF (/BBSY*(BRO+BR1+BR2+BR3)) THEN - if BR1 and the bus master obtained the bus using bus request line 0, then 
® BEGIN assert BCLR 
ry IF (BR3) THEN 
BG3IN := 1; ==> if BRO is activated when the bus is busy, then do not assert BCLR; BRO 
IF (/BR3*BR2) THEN devices cannot force anyone off the bus. This is one possible priority 
BG2IN := 1; implementation. 
IF (/BR3*/BR2*BR1) THEN ” 
BGIIN := 1 ; IF (BBSY*(BRO+BR1+BR2+BR3)) THEN "if a bus request occured and bus = busy" 
IF (/8R3*/BR2*/BR1*BRO) THEN BEGIN 
BGOIN := 1; IF (BR3*(/MASTER(1)+/MASTER{0})) THEN "if BR3 and present master is not " 
END ; BEGIN 
BCLR := 1; "using line 3, then assert BCLR " 
“ while the bus grant in line is active and the bus is still busy, IF (BCLR*BBSY) THEN 
then latch the bus grant signals until BBSY{L) becomes active" BCLR := 1; 
END; 


Figure 4, PLPL Specifications for the Example of Figure 1 
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IF (/BR3*BR2*(/MASTER [1] +/MASTER LO] )) THEN 
BEGIN 
BCLR := 1; 
IF (BCLR*BBSY) THEN 
BCLR := 1; 
END; 
IF (/BR3*/BR2*BR1*/MASTER[1]*/MASTER (0) ) THEN 
BEGIN 
BCLR := 1; 
IF (BCLR*BBSY) THEN 
BCLR := 1; 
END; 
END ; 
END. 
“Test vectors used for simulation and testing" 
TEST_VECTORS 
IN CLK , /BBSY , /BRO , /BR1 , /BR2 , /BR3 ; 
OUT /BCLR , /BGOIN , /BGIIN , /BG2IN , /BG3IN , MASTER[1:0] ; 
BEGIN 
u“ / 


tee 
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Figure 4. PLPL Specifications for the Example of Figure 4 (Cont’d.) 
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DEVICE INTR_HND_CTRLR CAMPAL22V10) " VMEbus interrupt handler preprocessor 


This PAL serves as an interface between the interrupt servicing logic in the 
interrupt handler and the bus arbiter: when an interrupt is detected, the 
PAL requests the bus from the arbiter. When the bus is granted, the bus 
busy BBSY(L) is asserted. The 3 bit code corresponding to the highest 
priority interrupt request is put on the bus, and both IACK(L) and AS(L) 
are also asserted. The PAL waits for DTACK(L) LOW and then latches the 
status/ID byte. The START_SERVICE signal informs the interrupt handler 
to begin the interrupt service sequence with the status/ID byte latched in. 
Note : this interrupt handler interface PAL handles all interrupts using 

bus request line BR3; this ts application-dependent " 


PIN 

CLK = 1 

/IR1 = 2 “interrupt request lines 1 to highest priority 7" 
/tR2 0 = 3 

/1R3 = 4 

JRE 5 

/IRS = 6 

JRE = 7 

“IR? = 8B 

/DTACK = 9 “data transfer ack; used by interrupter to indicate 

status/ID data byte ready on bus " 
/BG3IN = 10 “bus grant in signal from arbiter" 
SERVICE_DONE = 13 “signal from interrupt handler indicating 
service complete" 

LATCH_STATUS = 23 "Latches data from bus for interrupt service routine" 
/BBSY = 22 "bus busy" 

/IACK = 21 "interrupt acknowledge" 

JAS = 20 “address strobe" 

/BR3 = 19 “bus request signal to bus arbiter" 

GET_BUS = 18 “used internally by PAL " 

INTR (2:0) = 15:17 "3-bit level code put on bus " 


START_SERVICE = 14 ; "sent from PAL to intr handler " 


BEGIN 


“if any interrupt request line is active, then try getting the bus. All the 
interrupts handled by this PAL will use the bus request line BR3. A 
request can occur only when no other interrupt is being serviced (i.e., 
bus should not be busy and bus grant in should not be active) " 


IF (/BBSY*/BG3IN*{IR1 + IR2 + IR3 + IR4 + IR5 + IR6 + IR7)) THEN 
BEGIN 
GET BUS := 1; 

=; Ngubject to application" 


“jf interrupt handler wants the bus, the bus grant line is still inactive, 

and the interrupt request is still active, then continue requesting the bus " 
IF (GET_BUS*BR3*/BG3IN*(IR1 + IR2 + IR3 + IRS + IRS + IR6 + IR7)) THEN 

BEGIN 

BR3 := 1; 

GET_BUS := GET_BUS ; 

END; 


"if bus grant in line is active, then assert the bus busy signal" 
IF (GET_BUS*BG3IN + BBSY*/SERVICE DONE) THEN 
SBSY := 1; 


“if interrupt handler received bus, then acknowledge the interrupt" 
IF (BBSY) THEN 
BEGIN 
IF (/START_SERVICE) THEN 
TACK := 1; 
IF (DTACK) THEN "when the interrupter responds with DTACK(L), then " 
LATCH_STATUS := 1; "latch the status/ID byte into the interrupt handler" 


IF.CIR7) THEN 
INTR(2:0] 2:= 7; "IR7 line is highest priority" 
IF C/IR7*IR6) THEN 
INTR(2:0] := 6; 
IF (/IR7*/IR6*IR5) THEN 
INTR{2:0] := 5 ; 
IF C/IR7*/IR6*/IRS*IR4) THEN 
INTR(2:0] s= 4; 
IF C/TR7*/TR6*/IRS*/IR4*IR3) THEN 
INTR(2:0] := 3 ; 
IF C/IR7*/1RE*/IRS*/1TR4*/1R3*IR2) THEN 
INTR[2:0] := 2; 
TF (/IR7*/IR6*/IRS*/IR4*/IR3*/IR2*1R1) THEN 
INTR(2:0] = 1; "IR1 line is lowest priority" 
END; 


Figure 5. PLPL Specifications for the Example of Figure 1 
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IF CLACK*/AS + IACK*/DTACK) THEN “hold the interrupt acknowledge level until” 
INTR{2:0] := INTR(2:0] ; “strobed by address strobe signal or until 
the interrupter acknowledges the 3-bit 
level with OTACK . " 


"if the IACK line is active and interrupt servicing has not yet begun 
activate the address strobe line AS " 
IF CIACK*/START_ SERVICE) THEN 
AS := 1; 


“gend the start service signal until service is done" 
IF (BBSY*LATCH STATUS + BBSY*START_SERVICE*/SERVICE_ DONE) THEN 
START_SERVICE := 1; 


END. 


TEST_VECTORS 

IN CLK, /IR1, /IR2, /IR3, /IR6 , SIRS , /IR6, /IR7, 
/DTACK , /BG3IN , SERVICE_DONE; 

OUT START_SERVICE , GET BUS , INTR(2:0] , LATCH_STATUS , 
/AS , /TACK , /BBSY , /BR3 ; 

BEGIN 
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Figure 5. PLPL Specifications for the Example of Figure 1 (Cont’d.) 
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Introduction to Communication 
Systems 


Modern communications has developed from two major sources: 
telephones and computers. Within telephony, what was once 
straightforward engineering in the analog telephone system has 
grown over 100 years into the sophisticated discipline of telecom. 
Within computers, communication needs have expanded rapidly 
over the last 20 years from simple asynchronous terminals 
clustered around a host to also include networking of PCs, work- 
stations and minicomputers across rooms and continents. Both 
areas can be expected to merge into complex information net- 
works which handle all aspects of communications over the next 
10-20 years. A graphic portrayal of a hypothetical future system 
is shown in Figure 1. 


Historically, the system architecture, engineering and compo- 
nents of these two converging areas have been dissimilar. As 
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telephony systems become digital-based and computer systems 
use more of the telephony system for interaction, this dissimilarity 
is eroding. In particular, digital design techniques developed in 
computer engineering are used increasingly in telecom. The 
original application was in the trunk links between exchanges 
where digital time division multiplexing allowed a much higher 
density of circuits over a given pair of wires. This spread into the 
switch, line card and other areas of the exchange until only the 
subscriber line to the exchange remains largely analog. Develop- 
ment of the Integrated Services Digital Network (ISDN) interna- 
tional standard will make this digital as well, enabling data and 
video as well as voice to be handled in a uniform and flexible 
manner. Future telecom engineering will therefore have much in 
common with the digital design techniques used in computers 
and the distinction between them will become minimal. 
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Figure 1. Communications in the Nineties 
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Along with the strong trend in equipment design towards digital 
systems, there is a parallel move towards a more global market 
for equipment. The development costs for major pieces of tele- 
com systems, such as central office switches, require a large 
market in order to amortize the costs. This implies a requirement 
for standards beyond national boundaries, permitting sales into 
foreign markets with minimum engineering changes. This in- 
creased market size on top of increased complexity of services is 
realizable only through widespread standardization. It also 
makes such a universal system more complex and therefore 
more elusive. 


The computer area has evolved around a number of proprietary 
computer families whose architectures and operating systems 
were incompatible to a large degree. The need to interconnect 
multiple computer systems with high-speed links led to a polari- 
zation around proprietary standards as local area network (LAN) 
interconnect standards were evolved by each major manufac- 
turer. Recent moves towards decentralized computing, based on 
personal workstations of varying power, has crystallized users 
around a few LAN standards, but the dream of universal LAN 
interconnectivity among equipment is still far off. 


Communications therefore requires widespread standards for 
any meaningful inter-vendor or international operation. Teleph- 
ony is the more advanced in this area. Standards efforts by the 
Bell system in the US and the local PTT (the governmental Post, 
Telephone and Telegraphy authority) in other industrial countries 
has ensured that any equipment which connects to the public 
telephone network operates fully with any other piece of equip- 
ment. 


The main international body which regulates that is the CCITT 
(Consultative Committee for Telephony and Telegraphy) through 
a series of “recommendations”, which are effectively standards. 
In the USA, before deregulation, AT&T published technical stan- 
dards or “Bell Pubs” which specified the operating characteristics 
for equipment in a manner similar to and sometimes based onthe 
CCITT recommendations. Since deregulation, AT&T continues 
to issue such documents as it pertains to their equipment and the 
RBOCs (Regional Bell Operating Companies) now issue their 
own either directly or through their joint research organization, 
BeliCoRe (Bell Communications Research). 


Despite strenuous efforts among governing bodies, problems of 
compatibility have existed. Deregulation, the increase in the 
number of suppliers, international traffic and equipment sales, 
and most of all, the massive increase in equipment complexity 
have led to a growing demand for flexibility within telecom 
systems to handle compatibility issues. Fully compatible systems 
on aglobal scale remain an unrealistic goal before the billenium. 
Most systems are variants on a few widespread standards. The 
small deviations among manufacturers or countries can fre- 
quently be handled by a minimal amount of flexibility in the 
system design, such as that provided by PLDs. 


The situation on the computing side remains even more chaotic. 
Other than IBM, no vendor has managed to establish a wide- 
spread standard based on their own proprietary network. All 
others have turned to variants of the newer LAN standards, such 
as Ethernet, but even here, compatibility at the hardware levels 
does not necessarily mean compatibility at the upper or software 
levels. 


Features of Programmable Logic 
in Communications 


Programmable logic has already demonstrated its usefulness in 
computer systems in a variety of roles, including address decod- 
ing, state machines and “garbage collection”. All these applica- 
tions exist in communications, but others particular to this market 
also exist. 


The problems of interconnecting dissimilar systems will remain in 
communications for a number of years. Equipment manufactur- 
ers are therefore faced with a dilemma of how to combine the 
economies of scale associated with volume production with the 
need to interface with multiple vendor equipment, some of which 
deviates from the standards. PLDs offer a unique method of 
customizing logic to conform to deviations while maintaining 
identical hardware layout and parts. 


Power and space requirements are considerations in most 
computer systems, but normally are relegated to being secon- 
dary issues when compared to performance. The opposite is true 
of telephony, where these factors assume paramount importance 
over speed. In order to satisfy requirements from both areas, 
components must therefore be available in both high-speed and 
low-power versions. 


In telephony, certain high-volume functions such as line card 
interfaces justify VLSI devices. But a large number of other lower- 
volume functions differ among equipment vendors and applica- 
tions. Such design problems cannot be easily solved with dedi- 
cated VLSI components. With the introduction of CMOS PLDs, an 
opportunity now exists to combine higher integration with low 
power and the ability to modify designs without hardware compo- 
nent or board alteration. 


In computers, the proliferation of decentralized computing power, 
particularly in the form of PCs and workstations, has brought the 
power issue into focus as engineers attempt to minimize power 
consumption and the associated heat generation. While minis 
and mainframes continue to strive for increased speed, the bulk 
of applications now have a need for both lower-power compo- 
nents and a flexibility which permits each unit to communicate 
with as many of the office automation interconnection schemes 
as possible. Again, the advent of CMOS PLDs offers a solution to 
the system design problems associated with this. 


Basic Telephony 


Basic telephony is loosely defined as the classic telephone 
system. It includes such major components as the voice tele- 
phone with twisted pair interface to the exchange, as well as the 
Private Branch Exchange (PBX), the Central Office (CO) and the 
trunk lines which interconnect them. The voice telephone has little 
scope for PLDs due to high volume and extremely low cost 
requirements. From an architecture point of view, both the PBX 
and the CO can be considered similar and divided into several 
main components: 


* The line cards which interface to subscriber lines 

¢ The central switching facility which routes calls through the 
exchange 

¢ The trunk interface which connects to other exchanges 
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A simplified structure of a PBX is shown in Figure 2. 
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Figure 2. PBX Architecture Example 
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Line cards are the most cost- and power-sensitive components of 
the basic telephony system. They are usually fabricated in 
volume and incorporate VLSI devices available from several 
semiconductor houses. The scope for PLDs is limited, due to 
these constraints and the limited customization needs of the 
simpler functions of the voice telephone. 


The central switch in modern PBX and CO exchanges is a digital 
memory-based system controlled by a minicomputer. As such, it 
contains many familiar opportunities for PLD applications from 
the computer industry, but a limited number that are distinctly 
communication-oriented. 


The trunk interface is an area that received little attention until 
recently as the interface was buried in the heart of the telephony 
system. Its moderate unit volume attracted little interest in dedi- 
cated silicon development. Deregulation of the Bell system in the 
US has resulted in a proliferation of equipment complying with the 
widespread T1 standard developed by AT&T for Multiplexers and 
other Channel Service Units (CSUs) which use trunk links. 


From its beginnings as a means of allowing several voice circuits 
to share the same cable, T1 has developed in application to the 
stage where it is being used in many cases where up to 24 x 64 
Kb/sec channels are required in a point-to-point configuration 
both within and between isolated customer sites. This application 
allows data and video, as well as the original voice connections, 
to be multiplexed over existing standardized links provided by 
both AT&T and the RBOCs. It is one of the fastest growing areas 








TRANSMIT 
DATA 







of modern telecom and is spawning related equipment suchas T1 
switching nodes and T1 testers. 


The basic interface to the T1 connection can be engineered ina 
number of ways but basically consists of a transmit and receive 
section, as shown in Figure 3. 


The transmit section of a T1 mux must buffer and rate-adapt the 
bit stream of incoming channels which are to be multiplexed, 
perform a 24-to-1 multiplex function and then encode the result- 
ing bits with appropriate framing onto the line using a bipolar line 
driver. The receive section of amux contains acorresponding line 
receiver and decoder. The recovered NRZI data is then fed into 
a deframer which detects frame synchronization and allows 
channel demultiplexing and buffering. 


Several of these functions can be efficiently implemented using 
programmable logic. The encoder and decoder for both the North 
American T1 B8ZS and the European HDB3 coding schemes can 
be implemented in such devices (pages 2-362 and 2-384, respec- 
tively). The deframing problem for the most common current T1 
framing scheme (D4 Superframes) can also be solved with a few 
PAL devices (page 2-404). In the newer T1 Extended Super- 
Frame (ESF) standard, an error detection scheme is superim- 
posed on the framing bit pattern using a CRC-6 scheme. The 
CRC generator for this fits nicely within a single PAL device (page 
2-431). 


Two applications are also shown for the LCA (Logic Cell Array) 
device. Page 2-435 shows a 32:1 mux for a T1 link, and page 
2-444 shows a format converter for a PBX. 
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Data Communications 


Unlike telephony, data communications is a relatively new disci- 
pline and equipment designed to interconnect data processing 
equipment evolved in parallel with mainframe computers. Such 
equipmenttends to be high-speed and less concerned with power 
requirements than telephony equipment. Its main function is to 
transmit binary datain a reliable manner across a room or across 
a continent between computers compatible enough to exchange 
data. Awide variety of methods exist to do this at speeds ranging 
from 300 bits per second to 100,000,000 bits per second. As 
distance increases, the speed options available become limited. 
Generally, the options can be considered in a number of catego- 
ries, in order of increasing speed, cost and sophistication: 


Asynchronous terminals and connections using existing 
analog phone lines 

Synchronous terminals and dedicated data links 
High-speed point-to-point links, including microwave and 
satellite 

Multipoint, medium-speed local! area networks 
Multipoint backbone, high-speed local area networks 


The slowest and most widely available category includes most 
“dumb” terminals, which are normally linked to a host system 
either directly via an RS-232C/V.24 serial link or over a standard 
telephone line via a voiceband modem. Maximum data rates 
available are 9600 bits/sec and the typical modem rates are 
currently 300 and 1200 baud. Devices used in building either type 
of link are typically LSI low-cost NMOS devices with most new 
devices being VLSI CMOS. Where PAL devices offer the greatest 
advantage is in higher-performance applications for which VLSI 
solutions do not yet exist. 


There are, however, a number of additional contro! functions 
required around standard UART/USARTs which can be optimally 
implemented in a single PAL device, such as providing address 
latching and an asynchronous divider around a Motorola 6850 
Asynchronous Communications Interface Adapter (page 2-453). 


In the case of modems, however, several highly sophisticated 
techniques beyond that found in the standard modem “box” can 
be used to improve throughput on the link. One technique used 
to increase the bit rate across the link both in the slow, voiceband 
modems and in the higher-speed satellite modems is Quadrature 
Amplitude Modulation (QAM). Another technique uses a convo- 
lutional encoding scheme to increase the reliability of data trans- 
mitted across a noisy link. This cuts down the requirement for 
retransmission of lost data and effectively increases the through- 
put of the link. 


Both schemes can be implemented efficiently in PAL devices as 
described in the application notes on pages 2-456 and 2-463. 


Synchronous terminal links usually comply with one of a number 
of standards for which low-cost LSI devices exist. These provide 
for data rates above that of async links and generally can perform 
up to 1 Mbit/sec and beyond, but only over a limited distance, such 
as within a building. Data rates above 2400 baud over long 
distances are currently expensive and limited in popularity. 


One scheme which is expected to overcome this and gain wide 
application over the coming years is the provision of 64Kbit/sec 
data across the new digital telephone system which will be 
available on the international Integrated Services Digital Network 
(ISDN) “S" and “U’ interfaces to the phone line. This is a standard 
to which virtually every telephony authority and a large number of 
major computer manufacturers have given their support. Among 
the difficulties which will be encountered is the problem of 
adapting existing terminal devices to communicate with the wave 
of VLSI CMOS devices which are being released for use on the 
ISDN “S” and “U” interface link. 


The “S” interface is being addressed by a number of VLSI 
solutions. The recently standardized “U” interface has no such 
devices and has an additional encoding/decoding requirement 
similar to trunk lines, due to the longer distances involved to the 
central office. A4B3T coding scheme is used, which can be im- 
plemented using PLDs, as shown in the application note on page 
2-475. 


Further applications in data communications can be found in 
LANs. While several VLSI chip sets exist for standards such as 
Ethernet, a number of proprietary systems are in use and their 
numbers are increasing steadily. Some of the more basic func- 
tions required in the LAN interface, such as the Manchester 
encoding function of Ethernet, can be provided in a single PAL 
device (page 2-499). 


Conclusion 


The opportunities for the use of PLDs in communications is 
widespread andis becoming increasingly important for the follow- 
ing reasons: 


¢ Power and space limitations. 
¢ Increasing adherence to multiple standards, yet the need 
for... 
¢ Standardized hardware to permit low-cost 
volume production. 
¢ Need to reconfigure equipment in the field 
at the minimum overhead. 


Where speed is an issue, the broad range of fast bipolar PAL 
devices provides arange of solutions. Butinthe more recent trend 
towards low-power circuitry, the need for which is most apparent 
in equipment such as in fanless or portable personal computers, 
The family of low-power CMOS devices provides the same func- 
tionality at a fraction of the power budget. 
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Bipolar with eight zero substitution (B8ZS) coding provides 
clear channel capability, which is required in an Integrated 
Service Digital Network (ISDN) or an open network. A (B8ZS) 
encoder and decoder can be implemented in CMOS PAL 
devices. The encoder and decoder convert between non- 
return-to-zero (NRZ) data and B8ZS while providing a flexible 
interface to framing chips. 


Principles of Coding 


Communication lines require a data stream with imbedded 
clocking information. T1 digital communication lines transmit 
Alternate Mark Inversion (AMI) coded voice and data. For AMI, 
logic “1's” are encoded as alternating positive and negative 
pulses, while logic “O’s” are encoded as zero voltage. This 
coding scheme ensures that consecutive pulses alternate in 
polarity. The pulse edges provide the clocking. A pair of 
consecutive pulses with the same polarity is called a bipolar 
violation. 


Since a string of zeros in AMI coding does not contain any 
transitions, a clock cannot be derived. If this string of O's is 
too long, the system will lose synchronization. Therefore, the 
AT&T T1 standard limits the maximum number of consecutive 
0's to fifteen. For voice transmission this restriction is 
acceptable; however, for data it is not. A variation of AMI could 
provide clocking information while not restricting user data. 


B8ZS coding is such an improvement on the AMI code; logic 
“1's” are encoded as alternating positive and negative pulses, 
while logic “O’s” are encoded as zero voltage. Additionally, 
B8ZS substitutes a string of eight O's with a code word as 
shown in Table 1. The polarity of the preceding pulse 
determines which B8ZS code word is substituted. The B8ZS 
code words contain bipolar violations in the fourth and seventh 
bit positions. 


Since B8ZS coding allows strings of 0's for any length, it 
supplies clear channel capability. Clear channel capability is 
required for an open network and ISDN. With B8ZS coding, the 
maximum length of consecutive zeros is seven bits. 


PRECEDING PULSE B8ZS CODE WORDS 


a < e 


Table 1. B8ZS Coding 








B8ZS coding is used in the following applications: 
e T1 multiplexers 

e T1 repeaters 

e T1 channel banks 

e T1 network equipment 

« Video CODECs 


Functional Description 


Implementing B8ZS coding in such equipment requires 
converting between NRZ and B8ZS format. In this design, 
CMOS PAL devices encode and decode data while providing 
flexibility. Current single chip solutions do not interface to all 
framing devices. Using the PAL devices, B8ZS line coding can 
be added to any framing technique. This design performs two 
operations: encoding and decoding. The encoding operation 
converts NRZ data into positive and negative pulses with B8ZS 
code words. Decoding converts the positive and negative 
pulses into NRZ data and substitutes eight 0's for the B8ZS 
code words shown in Table 1. 


B8ZS Encoder 


The B8ZS encoder converts NRZ data to B8ZS data. Each 
string of eight consecutive O's is replaced with a B8ZS code 
word. 

Two PAL devices can be configurated to encode the data 
stream. Figure 1 shows the PAL devices, designated, PAL_A 
and PAL_B. Signals on Figure 1 are explained in Table 2. 
PAL_A, a PALC16R8Z, implements the 3-bit counter, which 
searches for strings of eight 0’s. An 8-bit string of O's is 
detected by C2-CO and NRZ_IN. PAL_A also incorporates a 
5-stage shift register, which delays the NRZ_IN data stream to 
ensure proper alignment with the counter’s output. The 
delayed data, NRZ_DELAY, is encoded by PAL_B. 


NRZ_CK 
NRZ_IN 
C(2-0) 
NRZ_DELAY 
cS Current state 


DESCRIPTION 
Active low master reset 
External NRZ clock 
NRZ input data stream 
Counter output 


Delay NRZ data stream 


Positive pulse output 


Negative pulse output 


Table 2. B8ZS Encoder Pin Description 
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Figure 1. B8ZS Encoder 


PAL_B, a PALC16R6Z, encodes the data using two state 
machines; one generates the B8ZS sequence, and the other 
generates the positive and negative pulse outputs. Figure 2a 
shows the B8ZS Sequence State Machine, which generates 
the B8ZS sequence. If an 8-bit string of O's is detected, the 
B8ZS Sequence State Machine forces the B8ZS code word to 


C2+C1+CO+N 


NRZ_IN 






be encoded. Otherwise, B8ZS Encoder State Machine 
performs normal AMI-type encoding. The B8ZS Encoder State 
Machine encodes NRZ_DELAY into positive and negative 
pulses based on the B8ZS Sequence states, Q2-Q0 (see 
Figure 2b). The positive and negative pulses (PPO and NPO) 
occur six clock cycles after the input NRZ_IN. 
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B FORCE 
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Figure 2a. BBZS Sequence State Machine 
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Figure 2b. B8ZS Encoding State Machine 
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B8ZS Decoder 


Decoding is performed by a PALC16R6Z and a shift register DESCRIPTION 
a 
code words. When a B8ZS code word is detected, the PAL 


(see Figure 3). The PAL device searches for incoming B8ZS 


device substitutes an 8-bit string of 0's. The output, NRZ_OUT, 


External T1 clock 


is delayed eight clock cycles from the positive and negative Positive pulse input 

pulse inputs (PPI and NPI). The device flags the B8ZS code 

words and any bipolar violations. If two bipolar violations occur Negative pulse input 
NRZ data stream 

bipolar violation. The B8ZS Decoding State Machine shown in 

Figure 4 searches for the two B8ZS code words. The state NRZ_OUT NRZ output data stream 

machine has two groups of states; SN(0-7) searches for B8ZS flag 

000-+0+- and SP(0-7) searches for 000+-0-+. The decoder's 


state machine shows reduced equations from the B8ZS and BPV (Bipolar Violation) flag 


in a non-B8ZS 8-bit data string, the device will only flag a single 





BPV (Bipolar Violation) outputs. Note: PP! and NPI are never 


both HIGH. Table 3. B8ZS Decoder Pin Description 












PALC16R6Z 


3| 


R3 
R2 
Ri 
RO 


B8&ZS 





T1_CK =e CLR P 

SHLD C166 GH 

SER sh 
ACDEFGH © 


NRZ_ OUT 


Figure 3. B8ZS Decoder 
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PPI*NPI PPi*NPI PPI*NPI PPrNPI 







PPI*NP} 





B8ZS = 1 B8ZS = 1 





PPINPI PPI*NPI 













SP4 
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SP5 
BPV=PPI + NPI 
B8ZS = 1 
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Figure 4. B8ZS Decoder State Machine 
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TITLE PAL_A 
PATTERN B8ZS ENCODER PAL A 
REVISION 1.02 

AUTHOR THERESA SHAFER 
COMPANY MMI 

DATE 6/6/87 


CHIP PAL_A PAL16R8 


;PINS 
31 2 3 4 
NRZ_CK /RST NRZ IN NC 
711 12 13 14 
/OE co C1 C2 
; INPUTS: NRZ CK 

: /OE ACT 
; /RST ACT 
; NRZ_IN SERIA 
: AN 
; OUTPUTS: NRZ_ DELAY 

; c2 = co 

; 

EQUATIONS 


e 
e 


c2 * Cl * CO 
/C2 * /Cl 
/C2 * /CO 
RST 

NRZ_IN 


/C2 


t+++4ll 


=e Se 


/cL /Cl * /CO 
Cl * CO 
RST 


NRZ_IN 


+++ 


=e te 


/cO co 
RST 


NRZ_IN 


+ + Il 


~e we 


° 
‘ 


5-STAGE PIPELINE DELAY 


/NRZ_DELAY := /R3 + RST 


/R3 := /R2 + RST 
/R2 := /R1 + RST 
/R1L := /RO + RST 
/RO := /NRZ_IN + RST 


5 6 7 8 9 

NC NC NC NC NC GND 
15 16 17 18 19 20 
NRZ_DELAY R3 R2 R1 RO vee 


EXTERNAL CLOCK 


IVE LOW OUTPUT ENABLE SIGNAL 

IVE LOW MASTER RESET SIGNAL 

L NRZ DATA STREAM WHICH IS ENCODED 
D TRANSMITTED 


DELAY NRZ DATA WHICH IS INPUT FOR PAL B 
COUNTER OUTPUTS (111) INDICATES 
AN 8-BIT STREAM OF ALL ZEROS 


3-BIT COUNTER WITH SYNCHRONOUS RESET AND ENABLE 


MASTER RESET 


CLEAR COUNTER WHEN NRZ_IN = 1 
MASTER RESET 
CLEAR COUNTER WHEN NRZ_IN = 1 
MASTER RESET 
CLEAR COUNTER WHEN NRZ_IN = 1 
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SIMULATION 


TRACE _ON NRZ_CK /OE /RST NRZ_IN 
C2 C1 CO NRZ_DELAY R3 R2 R1 RO 


SETF OE ; ENABLE OUTPUT 
RST ; RESET 
CLOCKF NRZ_CK 


INITIALIZE 


=e 


SETF NRZ_IN 
/RST 
CLOCKF NRZ_CK 


PERFORM ZERO SUBSTITUTION 


=e 


SETF /NRZ_IN 
FOR J:= 0 TO 8 DO 
BEGIN 
CLOCKF NRZ_CK 
END 





ALTERNATE POSITIVE AND NEGITIVE PULSES 


=e 


SETF NRZ_IN 
CLOCKF NRZ_CK 
SETF NR2Z_IN 
CLOCKF NRZ_CK 
SETF /NRZ_IN 
CLOCKF NRZ_CK 
SETF NRZ_IN 
CLOCKF NRZ_CK 
SETF /NRZ_IN PERFORM ZERO SUBSTITUTION 
FOR J:= 0 TO 6 DO 

BEGIN 

CLOCKF NRZ_CK 

END 


=e 


SETF NRZ_IN 
CLOCKF NRZ_CK 


TRACE_OFF 





Pa Monolithic rar Memories ra 2-367 


B8ZS Coding 


(Li-19-66) 


1936 


SE 


Fs 
INC. 


T RELE 


MEMORIES 


- MARKEE 


eyes 
Pera 


Ve 


FPALASM SIMULATION. 


MONOLITHIC 


GFYRIGHT 


C 


tC) 


= LISTING 


CTIVE TRACE 


SEL 


FALASM STIMULATION 


SHAFER 


THERESA 


PPT 


Auth cae 


A 


Title 


FAL 


Company 
Date 


ENCODER FAL A 


REZS 


Fattern 


6/6/87 


4 
on 
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TITLE PAL_B 


PATTERN B8ZS ENCODER PAL B 


REVISION 1.04 

AUTHOR THERESA SHAFER 
COMPANY MMI 

DATE 6/6/87 


CHIP PAL_B PAL16R6 


8-BIT STREAM OF ALL ZERO WHEN 


; PINS 

rl 2 3 4 5 6 7 8 9 
NRZ_CK /RST NRZ_IN C2 cl co NRZ_DELAY NC NC 

711 12 13 14 15 16 17 18 19 

/OE Nc cs NPO PPO Qo Qi Q2 NC 

; INPUTS: NRZ_CK EXTERNAL CLOCK 

; /OE ACTIVE LOW OUTPUT ENABLE SIGNAL 

; /RST ACTIVE LOW MASTER RESET SIGNAL 

; NRZ_DELAY SERIAL NRZ DATA STREAM WHICH WAS DELAYED 

; BY PAL A 

; c2 - Co COUNTER OUTPUTS 

; NRZ_IN NRZ DATA STREAM INDICATES AN 


OUTPUTS: CS 


Zz 
8 


EQUATIONS 


~e te Me Me TO TO WO 


P_ZERO = [0,0,0] 
NEGATIVE = [1,0,1] 
N_ZERO = [1,0,0] 
POSITIVE = [0,1,0] 


COUNTER OUTPUTS ARE 111 AND NRZ_IN = 0 


CURRENT STATE 
POSITIVE B8ZS CODE 
NEGATIVE B8ZS CODE 


B8ZS SEQUENCE STATE MACHINE STATES 


B8ZS ENCODING STATE MACHINE EQUATIONS 
STATE VARIATES = [CS,PPO,NPO] 
WITH THE FOLLOWING STATE ASSIGNMENT 


/CS := /CS * /NPO * /Q1 * QO 


+ CS * /PPO * Qi * 


/Q0 


+ /CS * /NPO * /NRZ DELAY * /Q1 
+ /CS * /NPO * Q2 * /Q1 
+ CS * NRZ DELAY * /PPO * /Q2 * /QO0 


+ RST 


/PPO := CS * /PPO * QO * /Ql 


+ /CS * /NPO * /QO 


+ CS * /NRZ_DELAY * /PPO * /Q1 
+ CS * /PPO * Q2 * /Q1 


+ RST 


GND 


vec 
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/NPO := /CS * /NPO * /Q1 * QO 


+ /CS * /NPO * /NRZ_DELAY * /Q1 
+ /CS * /NPO * Q2 * /Q1 
+ CS * /PPO * /Q0 


+ RST 


STATE VARIATES [(Q2,Q1 


[0,0,0] 
[0,0,1) 
[0,1,0] 
[1,0,0} 
(1,0,1] 
(1,1,0] 


AMO Ww YS 
il 


++ Il 


/Q2 /Q2 * /Q1 
Q2 * Ql 


RST 


/Q0 
Q2 * Ql 
RST 


/Q2 * /Ql 
/Q2 * /Q1 
/Q2 * /Ql 
/Q2 * /Ql 


++ Il 


/Ql 


t++eeest il 


/Q0 


+ +e 
2 ~ 
Le} 

° 


~e we 


SIMULATION 


B8ZS SEQUENCE STATE MACHINE EQUATIONS 


,Q0] 


WITH THE FOLLOWING STATE ASSIGNMENTS 


* /C2 
* /Cl 


* NRZ_IN 


e@oeoeeeaeneeeeoeeeseeeeeeoeveeeeeeeeeeeenveo8 


eeoeeevoeeevneeeeveeeseeeeeveeeeeeeeeeeeeeeeee 


TRACE_ON NR2Z_CK /OE /RST NRZ_DELAY NRZ_IN C2 Cl CO 
CS PPO NPO Q2 Q1 Qo 


SETF OE 
RST 
CLOCKF NRZ_CK 


SETF NRZ_DELAY 

/RST 

NRZ_IN /C2 /Cl /CO 
CLOCKF NRZ_CK 


SETF /NRZ_DELAY 


; ENABLE OUTPUT 
+ RESET 


INITIALIZE 


=e 


PERFORM NORMAL ENCODING 


/NRZ_IN /C2 /Cl1 /CO 


CLOCKF NRZ_CK 
SETF NRZ_DELAY 
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/C2 /C1 CO 
CLOCKF NRZ_CK 
SETF NRZ_DELAY 

/c2 C1 /CO 
CLOCKF NR2Z_CK 
SETF NRZ_DELAY 

/C2 C1 co 
CLOCKF NRZ_CK 
SETF /NRZ_DELAY 

C2 /C1 /CO 
CLOCKF NR2Z_CK 
SETF /NRZ_DELAY 

C2 /C1 co 
CLOCKF NRZ_CK 
SETF /NRZ_DELAY 

C2 Cl /CO 
CLOCKF NRZ_CK 
SETF /NRZ_DELAY 

C2 C1 /CO 
CLOCKF NRZ_CK 
SETF C2 Cl CO 
CLOCKF NRZ_CK 
SETF /C2 /C1 /CO 
CLOCKF NRZ_CK 
FOR I:=0 TO 5 DO 

BEGIN 

CLOCKF NRZ_CK 

END 


PERFORM B8ZS SUBSTITUTION 
FORCE VIOLATION 


SETF /C2 /Cl /CO 
NRZ_DELAY 

CLOCKF NRZ_CK 

SETF /NRZ_DELAY 


=e tO 


CLOCKF NRZ_CK ; FORCE ONE 
SETF /NRZ_DELAY 
CLOCKF NRZ_CK ; FORCE ZERO 


SETF /NRZ_DELAY 
CLOCKF NRZ_CK 
SETF C2 Cl CO 
/NRZ_DELAY 
CLOCKF NRZ_CK 
SETF /C2 /C1 /CO 
/NRZ_DELAY 
CLOCKF NRZ_CK 
FOR I:=0 TO 5 DO 
BEGIN 
CLOCKF NRZ_CK 
END 


FORCE VIOLATION 


=e 


FORCE ONE 


=e 


PERFORM NORMAL ENCODING 


=e 


SETF /NRZ_DELAY 
NRZ_IN C2 Cl /CO 

CLOCKF NRZ_CK 

SETF C2 C1 CO 

CLOCKF NRZ_CK 

SETF /C2 /C1 /CO 
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CLOCKF NRZ_CK 
FOR I:=0 TO 5 DO 
BEGIN 
CLOCKF NRZ_CK ; PERFORM B8ZS SUBSTITUTION 
END 


TRACE_OFF 
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TITLE B8ZS_DECODER 
PATTERN B8ZS DECODER PAL 
REVISION 1.05 

AUTHOR THERESA SHAFER 
COMPANY MMI 

DATE 6/6/87 


CHIP B8ZS_DECODER PAL16R6 


; PINS 

71 2 3 4 5 6 7 8 9 
T1_CK /RST PPI NPI NC NC NC NC NC 
711 12 13 14 15 16 17 18 19 
/OE NC Qo Ql Q2 Q3 /BPV /B8ZS DATA 


INPUTS: T1_CK EXTERNAL CLOCK 


ZERO SUBSTITUTION HAS OCCURRED 


; /OE ACTIVE LOW OUTPUT ENABLE SIGNAL 

; /RST ACTIVE LOW MASTER RESET SIGNAL 

; PPI SERIAL POSITIVE PUSLE DATA STREAM 

; TO BE DECODED 

; NPI SERIAL NEGATIVE PUSLE DATA STREAM 

; TO BE DECODED 

; OUTPUTS: DATA DATA WHICH IS LOADED INTO THE SHIFT 
; REGISTER 

; Q3 =- QO STATE MACHINE OUTPUTS 

; /BPV LOW ACTIVE SIGNAL WHICH INDICATES THAT 
; BIPOLAR VIOLATION HAS OCCURRED 

; /B8ZS LOW ACTIVE SIGNAL WHICH INDICATES THAT 
‘ 


EQUATIONS 
; DATA OUTPUT 
/DATA = /PPI * /NPI; 


; /BPV AND /B8ZS FLAGS 


BPV := PPI * /Q2 * Q1 
+ PPI * /Q3 * Q2 * /QO 
+ NPI * Q2 * Ql 
+ NPI * /Q3 * Ql * /Q0 
+ /PPI * /Q2 * /Q1 * QO 
+ /PPI * Q3 * /Q1 * QO 
+ /NPI * /Q3 * Q2 * /Q1 
+ /NPI * Q2 * /Ql * /Q0; 


B8ZS := /PPI * NPI * /Q3 * Q2 * /Q1 * QO 
+ PPI * /NPI * Q3 * Q2 * /Ql * QO; 


STATE MACHINE WHICH DETECTS B8ZS SEQUENCE 
STATE VARIABLES [(Q3,Q2,Q1,Q0] 
WITH THE FOLLOWING STATE ASSIGNMENTS 


~e we VO 


GND 
20 
vce 
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; SPO = [0,0,1,1] 
; SP1 = [1,0,1,1] 
; SP2 = [1,0,1,0] 
; SP3 = [0,0,0,0] 
; SP4 = [0,1,0,0] 
; SP5 = [0,0,1,0] 
; SP6 = [1,1,0,0] 
; SP7 = [1,1,0,1] 
; SNO = [1,1,1,1] 
; SN1 = [0,1,1,1] 
; SN2 = [1,1,1,0] 
; SN3 = [1,0,0,0] 
: SN4 = [1,0,0,1] 
; SN5 = [0,1,1,0] 
; SN6 = [0,0,0,1] 
; SN7 = [0,1,0,1] 
/Q3 := PPI * /RST 
+ /NPI * /Q3 * /Q2 * /Q1l * /QO * /RST 
+ /NPI * Q3 * /Q2 * Ql * /QO * /RST 
+ /NPI * /Q3 * Q2 * Ql * /QO * /RST 
+ /NPI * Q3 * /Q1 * QO * /RST 
+ /NPI * Q3 * Q2 * QO * /RST 
+ NPI * /Q3 * Q2 */Q1 * /QO * /RST; 
/Q2 := PPI * Ql * /RST 
+ PPI * Q2 * /RST 
+ /NPI * /Q3 * Q2 * /QO * /RST 
+ /NPI * Q3 * Ql * /QO * /RST 
+ /PPI * /NPI * /Q3 * /Ql * /RST 
+ /NPI * /Q2 * Ql * QO * /RST 
+ Q3 * /Q2 * /Ql * /QO * /RST 
+ /Q3* Q2 * /Ql * /QO * /RST; 
/QL := PPI * /Q3 * /Q2 * /Q1 * /RST 
+ /PPI * /NPI * /Q2 * /QO * /RST 
+ /PPI * /NPI * Ql * /QO * /RST 
+ /PPI * NPI * Q3 * /Q1 * /QO * /RST; 
/QO := PPI * /Q3 * /Q2 * /Ql * /QO * /RST 


+ PPI * Q3 * /Q2 * /Q1 * QO * /RST 

+ /PPI * /NPI * /Q2 * /QO * /RST 

+ /PPI * /NPI * Q3 * /QO * /RST 

+ /PPI * /NPI * /Q3 * /Q2 * /Ql * /RST 

+ /PPI * /NPI * Q3 * /Q2 * Ql * /RST 

+ /PPI * /NPI * /Q3 * Q2 * Ql * QO * /RST 
+ /PPI * NPI * /Q3 * Q2 * /Q1 * /QO * /RST; 


me te 


eoeoeovneeveevoeoeveeceoeoeeevneeeeoesevoeneeeveseeeeeevneseesvneeseeeevesnevnenese 
@eeeeseoeneeaovoevosvseeeeeeeoeeevoeeweeseeeveeeevsseeeveeveeveeeveeseseoeeeveesveeneeee e @ 
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SIMULATION 


TRACE _ON T1_CK /OE /RST PPI NPI 
Q3 Q2 Q1 QO /BPV /B8ZS 


SETF OE ; ENABLE OUTPUT 
RST ; RESET 
/PPI /NPI 

CLOCKF T1_CK 

SETF RST 


CLOCKF T1_CK 
SETF /PPI /NPI 
/RST 

CLOCKF T1_CK 


; NORMAL DECODING 
SETF PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 





CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 


; DETECT B8ZS SEQUENCE 
; + SEQUENCE 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 
; SET-UP FOR NEXT SEQUENCE 
SETF /PPI NPI 
CLOCKF T1_CK 
; - SEQUENCE 
SETF /PPI /NPI 
CLOCKF T1_CK 
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SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 
CLOCKF T1_CK 
SETF PPI /NPI 


CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 
CLOCKF T1_CK 


SETF /PPI /NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF /PPI /NPI 
CLOCKF T1_CK 
SETF PPI /NPI 
CLOCKF T1_CK 


TRACE_OFF 
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Introduction 


A digital communication trunk system consists essentially of two 
channel banks and several repeaters on the transmission lines 
(Figure 1). A channel bank is terminal equipment which multi- 
plexes more than one channel onto a single transmission line 
using Time Division Multiplexing (TDM) technique. This TDM 
technique assigns an equal individual time slot for each channel 
to allow several channels to share a transmission line sequen- 
tially. 


CHANNEL REPEATERS ON CHANNEL 


BANK TRANSMISSION LINES BANK 





Figure 1. Digital Communication Trunk System 


High Density Bipolar (HDB3) code is a modified bipolar code. A 
bipolar code uses alternating polarities for successive logic 
ones. The signal edges provide the timing information. An error 
occurs when two contiguous logic ones have the same polarity; 
this is defined as a "bipolar violation." HDB3 code permits a legal 
bipolar violation to prevent four or more contiguous zeros in the 
data stream. Three is the maximum number of contiguous zeros 
allowed before a violation is inserted to maintain timing informa- 
tion for a long string of logic zeros. 


HDB3 Code 


Code conversion rules for trunk lines are defined in the CCITT 
G703 recommendation, which includes two standards. One 
standard is T-carrier standard for North America and Japan; the 
other standard is CEPT standard for Europe and elsewhere. 
HDB3 code is recommended by CCITT for the CEPT standard, 
at the 2,048, 8,448, or 34,368 kbps rates. 


Violation patterns used for four-zeros-substitution in HDB3 are 
shown in Figure 2. Four possible HDB3 substitution codes are: 
+00+, -00-, 000+, or 000-. The "+" indicates the positive voltage, 


"." indicates the negative voltage, and "0" indicates the zero volt- 
age. The polarity of the preceding pulse and the number of bipo- 
lar pulses after the previous bipolar violation select the substitu- 
tion code. 














POLARITY 
PRECEDING 
PULSE 


Positive 


NUMBER OF BIPOLAR PULSE 


000+ -00- 
(+) 


(-) 


Figure 2. HDB3 Substitution Table 








HDB3 Coding Example 


An example of converting from NRZ (Non-Return-Zero) data to 
HDBS3 code is shown in Figure 3. There are four sets of four suc- 
cessive zeros in the data stream. In the first group the previous 
signal is positive and the number of logic ones is odd, so the 
"000+" pattern is generated. In the second group the previous 
signal is positive and the number of logic ones is even, so the "- 
00-" pattern is generated. In the third group the previous signal is 
negative and the number of logic ones is even, so the "+00+" 
pattern is generated . A"000-" pattern is generated in the fourth 
group because the previous signal is negative and the number 
of logic ones is odd. 


HDB3 substitution codes force the fourth bit to be a bipolar viola- 
tion. The violation is chosen by the following scheme, based on 
the number of bits since the previous violation: 


(1) When the number is even, HDB3 substitution code forces the 
first zero to be a bipolar code (B) and the fourth zero to be a bi- 
polar violation (V). Therefore, four successive zeros is converted 
to a bipolar signal, following two continuous zeros, and a bipolar 
violation with the first bipolar signal (BOOV). The polarity of the 
bipolar violation (V) is the same as the polarity of the first bipolar 
signal (B). 


NRZ SIGNAL STRING ———+001-0000-11-0000-11-0000-1-0000 


POLARITY ——— 00+ + +: . 
THE PREVIOUS 
SIGNAL NUMBER ————> 1 2 2 1 


HDB3 SUBSTITUTION CODE ————> 000+ -00- +004 o00- 
HDB3 CODE ———-00+ 000+ -00- +00+ o00- 
GROUP 1 GROUP 2 GROUP 3 GROUP4 


Figure 3. An Example of HDB3 Coding 
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(2) When the number is odd, HDB3 substitution code forces the 
fourth zero to be a bipolar violation (V). Therefore, HDB3 is con- 
verted as three continuous zeros followed by bipolar violation 
with the previous bipolar signal (000V). The polarity of the bipo- 
lar violation is the same as the polarity of the previous bipolar 
signal. 


HDB3 Encoder Implementation 


Figure 4 shows the block diagram of an HDB3 encoder, which 
includes three state machines and one 4-bit shift register. A DE- 
TECT_4_ ZEROS state machine detects four successive zeros. 
An ODD_EVEN state machine checks the number of logic ones 
after the last bipolar violation. An HDB3 encoding state machine 


NRZIN 


4-ZEROS 
DETECT 4 ZEROS 





SHIFT REGISTER 


generates the HDB3 code and a RESET signal which clears the 
ODD_EVEN state machine. A 4-bit shift register delays the in- 
coming NRZ data in order to synchronize with the 4-ZEROS sig- 
nal from the DETECT_4 ZEROS state machine. 


The HDB3 encoding state machine generates HDB3 code, de- 
pending on the DSIN, 4-ZEROS, ODD_EVEN, and POS_NEG 
signals. When DSIN is false and 4-ZEROS is false, the output is 
a zero. When DSIN is true and 4-ZEROS is false, the output 
generates a signal that alternates between the positive and neg- 
ative signals. When DSIN is true and 4-ZEROS is true, an error 
flag is raised. HDB3 substitution code is generated when DSIN 
is false and 4-ZEROS is true. Four possible HDB3 substitution 
codes are generated, depending on POS_NEG and 
ODD_EVEN signals. These are summarized in Figure 5. 


RESET 





ODD _ EVEN 
ODD _ EVEN 


(+) 











GENERATE PHDB 
HDB3 
CODING 
STATE 
MACHINE NHDB 






(-) 


Figure 4. Block Diagram of an HDB3 Encoder 


DSIN 4-ZEROS POS_NEG ODD_EVEN CHARACTERISTICS 


i 


1 
Zero output 








Alternating 
PHDB (+) and NHDB (-) 





Error 


Figure 5. HDB3 Encoding State Table 
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The three Programmable Array Logic (PAL) devices used to im- 
plement the entire HDB3 encoder are a 16RA8, a 20RS8, anda 
16R4 (see Figure 6). 


The 16RA8 includes two state machines: ODD_EVEN and DE- 
TECT_4_ZEROS. The ODD_EVEN's output signal is fed into 
the HDB3 decoding state machine. If the ODD_EVEN state ma- 
chine is triggered by the same clock edge as the decoding state 
machine, the wrong result is generated because the encoding 
state machine always gets the ODD_EVEN's output signal one 
clock early. Therefore, the ODD_EVEN state machine must be 
triggered by the falling edge of the clock to compensate for this 
timing problem. However, the DETECT_4_ ZEROS state ma- 
chine is triggered by the rising edge of the clock. Both rising and 
falling edges are needed for these two state machines. The 
16RA8 device offers the programmable clock feature because 
one of the product terms in each cell is the clock input. The clock 
flexibility of the 16RA8 allows the falling-edge clock (CLK) for 
the ODD_EVEN state machine and the rising- edge clock (CLK) 
for the DETECT_4 ZEROS state machine in a single 20-pin 
PAL device. 


NRZIN 


The HDB3 encoding state machine diagram is shown in Figure 
7. Four registers are used for fifteen states. Two of the four reg- 
isters have ten product terms, one register has eight product 
terms, and the final register has seven product terms. Both the 
20RS4 and 20RS8 have the product-sharing feature. The 
20RS8 is used to implement this HDB3 encoding state machine 
because the product sharing feature is sixteen product terms 
shared by two registers exclusively, and in this case at least sev- 
enteen product terms are required per register pair. The design 
fits in the 20RS8 by using the product terms from two registers 
for one output, and leaving the other output as a no-connect. In 
addition to the HDB3 encoding state machine, a reset signal and 
an error message are generated in this 20RS8. The reset signal 
clears the ODD_EVEN state machine. The error message oc- 
curs when DSIN and 4_ZEROS both are high simultaneously. 


The 16R4 has four registered outputs and four combinatorial 
outputs. A 4-bit shift register is implemented using four regis- 
tered outputs. The positive HDB3 signal (PHDB) and negative 
HDB3 signal (NHDB) are generated using two combinatorial 
outputs. 





Figure 6. HDB3 Encoder Implemented by Three PAL Devices 
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$4 (0100) 
PHDB =1 
NHDB =0 


RESET = 1 
ERROR =0 





$5 (1100) 
PHDB =0 
NHDB =0 
RESET =0 
ERROR =0 





S6 (1101) 
PHDB =0 
NHDB =0 
RESET = 0 


ERROR =0 


S7 (1111) 
PHDB = 1 
NHDB =0 
RESET = 1 
ERROR =0 
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/DSIN* ZEROS 





















/DSIN* 
ZEROS* 
/ODDEVEN $13 (0101) 
PHDB =0 
NHDB = 0 
RESET =0 
/DSIN* /DSIN* sab ile 
ZEROS* ZEROS* 
/ODDEVEN /ODDEVEN 
/DSIN* 
ZEROS* 
ODDEVEN 
/DSIN* 










/ZEROS* 









$0 (0000) 
PHDB =0 
NHDB =0 
RESET =0 
ERROR =0 


$3 (1110) 
PHDB =0 
NHDB =1 
RESET =0 
ERROR = 0 































DSIN* 
{ZEROS 
DSIN* /ZEROS 
$1 (1000) $2 (1010) 
PHDB = 1 PHDB = 0 
NHDB =0 NHDB =0 
RESET =0 RESET =0 
ERROR =0 /DSIN* ERROR =0 
ZEROS* 
/ODDEVEN 
/DSIN* ALL DSIN* ZEROS 
ZEROS* 
ODDEVEN 
$8 (0111) $14 (1001) 
PHDB = 0 PHDB =0 
NHDB =0 NHDB =0 
RESET =0 RESET =0 
/DSIN* ERROR = 0 ERROR =1 
/ZEROS* 





/DSIN* ZEROS 


Notes: 
1. Inputs: DSIN, ZEROS, and ODDEVEN. 
2. State Outputs: PHDB, NHDB, RESET and ERROR. 


Figure 7. HDB3 Encoding State Machine 
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/OSIN* 
/ZEROS* 


DSIN* 
/ZEROS 


/DSIN* 











$12 (0001) 
PHDB =0 
NHDB = 1 


RESET =1 
ERROR =0 


$11 (0011) 
PHDB =0 
NHDB =0 
RESET =0 

ERROR =0 





$10 (0010) 
PHDB =0 
NHDB =0 
RESET =0 
ERROR = 0 


ZEROS* 
/ODDEVEN 





$9 (0110) 
PHDB = 1 
NHDB =0 


RESET =1 
ERROR =0 
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HDB3 Decoder Implementation 


The block diagram of the HDB3 decoder is shown in Figure 8. A 
decoding state machine detects the HDB3 code, then either a 
Data Serial OUT (DSOUT) signal or a RESET signal is generat- 
ed. These two signals are fed into the 4-bit shift register. Two 
error flags are generated: VERROR (Violation ERROR) and ER- 
ROR. 









DETECT 
HDB3 CODING 
STATE 
MACHINE 


<«—— PHDB 





RESET 
wor leo 
DS 
4-BIT OUR 
ERROR 
VERROR 


SHIFT REGISTER ~««— NHDB 


Figure 8. Block Diagram of an HDB3 Decoder 


CLK | 1 | CLK 


MR 5 (MR 


PAL20RS4 


}10 | NC VERROR 
1 ERROR 
12] GND 


A single 24-pin PAL device (20RS4) and a 4-bit shift register 
(HCT194) are used to implement the HDB3 decoder (see Figure 
9). The decoding state machine diagram is shown in Figure 10. 
Four registers are used for thirteen states. Two of the four reg- 
isters have eight product terms, one register has nine product 
terms, and the final register has five product terms. Because 
the total product terms of two registers can be less than six- 
teen product terms, these four registers can be implemented 
using a 20RS4 device rather than a 20RS8. 


When an HDB3 substitution code is detected, a reset signal is 
generated to load four zeros to the HCT194. Otherwise, 
DSOUT is generated to the shift right serial input (SR SER) of 
the HCT194. When any two sequential signals of the same po- 
larity are detected except the HDB3 substitution code, the 
bipolar violation signal is generated (VERROR). The ERROR 
signal is generated when both positive HDB3 signal and nega- 
tive HDB3 signal are high simultaneously. 


Figure 9. HDB3 Decoder Implemented by a PAL Device and a Shift Register 
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/PHDB* 


$0 (0000) 
PHDB* RESET = 0 
/NHDB VERROR = 0 
ERROR = 
PHDB* PHDB* 
/NHDB NHDB 
$11(1110) 
RESET = 0 
VERROR = 1 
ERROR =0 
/PHDB* PHDB* 
/NHDB INHDB 
PHDB* PHDB* 
/NHDB 511000) /NHDB 
RESET =0 
VERROR = 1 
euDEe ERROR =0 
/NHDB 
/PHDB* 
/NHDB 
PHDB" $2 (1001) /PHDB* 
/NHDB RESET =0 NHDB 
VERROR = 1 
ERROR =0 
/PHDB* /PHDB* 
/NHDB /NHDB 


PHDB* 


$3 (1011) 
RESET = 
PHDB*/NHDB 
VERROR = 0 
ERROR =0 









S4 (1010) 
RESET = 
PHDB*/NHDB 
VERROR = 0 
ERROR =0 





S5 (0010) 

RESET =0 
VERROR = 1 
ERROR =0 


/NHDB 


/PHDB* 
NHDB 


/PHDB* 
NHDB 


/PHDB* 
/NHDB 


/PHDB* 
NHDB 


Notes: 
1. Inputs: PHDB and NHDB. 









ALL 
PHDB* 
NHOB /PHDB* 
NHDB 
/PHDB* 
NHDB 
$12 (1101) 
RESET =0 
VERROR = 1 
ERROR =0 
/PHDB* 
/NHDB 
/PHDB* 
S6 (0110) NHDB 
RESET =0 
VERROR = 1 
HupEY ERROR =0 
/NHDB 
/PHDB* 
/NHDB 
PHDB* $7 (0100) /PHDB* 















RESET =0 NHDB 
VERROR = 1 


ERROR = 0 


S8 (0101) 
RESET = 
/PHDB* NHDB 
VERROR = 0 
ERROR = 0 





/PHDB* 
/NHDB 


S9 (0111) 
RESET = 
/PHDB* NHDB 
VERROR = 0 
ERROR = 0 


/PHDB* 
/NHDB 





$10 (1111) 
RESET =0 
VERROR = 1 
ERROR =0 
/PHDB* 
NHDB 


2. State Outputs: RESET, VERROR and ERROR. 
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Title HDB3 encoder logic 

Pattern HDB3EN1.pds 

Revision A 

Author Cindy Lee 

Company Monothic Memories Inc., Santa Clara, Ca 
Date 3/4/87 


CHIP HDB3EN1 PAL16RA8 


;PIN1 PIN2 PIN3 PIN4 PIN5 PIN6 PIN7 PIN8 PINS PIN10 
/PL CLK PHDB NHDB NRZIN RESET MR NC NC GND 


*;PIN11 PIN12 PIN13 PIN14 PIN15 PIN16 PIN17 PIN18 PIN19 PIN20 
/OE ZQO Z2Q1 ZQ2 ZEROS NC 0Q0 0Q1 ODDEVEN VCC 


INPUT SIGNALS 


’ 

; /PL : ASSERTIVE LOW PRELOAD 

; CLK : EXTERNAL CLOCK 

; PHDB : POSITIVE HDB3 SIGNAL 

; NHDB : NEGATIVE HDB3 SIGNAL 

; NRZIN : NOR-RETURN-ZERO INPUT SIGNAL 

; RESET : RESTE SIGNAL TO RESET THE ODD_EVEN STATE MACHINEG 
; MR : MASTER REST 

; /OE : ASSERTIVE LOW OUTPUT ENABLE 


OUTPUT SIGNALS 

ZQ0, ZQ1, ZQ2 : DETECT_4 ZEROS STATE VARIABLES 
ZEROS : DETECT 4 ZEROS STATE OUTPUT 

0Q0, OQ1 : ODD_EVEN STATE VARIABLES 

ODDEVEN : ODD_EVEN STATE OUTPUT 


me me we MO NO 


EQUATIONS 


716RA8 PROVIDES TWO STATE MACHINES 


;(1) THE ODD_EVEN STATE MACHINE CHECKS THE NUMBER OF BIPOLAR PULSES 
; AFTER THE PREVIOUS BIPOLAR VIOLATION IS ODD OR /EVEN 


ODDEVEN = 0Q0*/0Q1 ; ODD_EVEN STATE OUTPUT 
OQ1 := NHDB*0Q0*/0Q1*/RESET ; MSB STATE VARIABLE 
+ PHDB*0Q0*/0Q1*/RESET 
+ /NHDB*/PHDB*0Q0*0Q1*/RESET 
OQlL.SETF = MR ; MASTER RESET 
0Q1.CLKF = /CLK ; THE FALLING EDGE OF 
; THE CLOCK 
0Q0 := NHDB*/0Q1*/RESET ; LSB STATE VARIABLE 
+ PHDB*/0Q1*/RESET 
+ OQ0*/RESET 
0Q0.SETF = MR 
0Q0.CLKF = /CLK 





2-390 ON monotithic HH] memories &\ 


HDB3 Line Coding Using PAL Devices 





;(2) THE DETECT_4 ZEROS STATE MACHINE DETECTS FOUR CONTINUOUS ZEROS 


ZEROS = /ZQ0 * /ZQ1 * ZQ2 ; STATE OUTPUT 

ZQ2 := /NRZIN * /ZQO0 * 2Q1 * /ZQ2 ; MSB STATE VARIABLE 

ZQ2.SETF = MR ; MASTER RESET 

2Q2.CLKF = CLK ; EXTERNAL CLOCK 

Z2Q1 := /NRZIN * ZQO0 * /2Q2 

ZQ1.SETF = MR 

Z2Q1.CLKF = CLK 

ZQ0 := /NRZIN * /ZQ0 * /ZQ1 ; LSB STATE VARIABLE 
+ /NRZIN * /ZQ1 * /ZQ2 

ZQ0.SETF = MR 

ZQ0.CLKF = CLK 

SIMULATION 





TRACE_ON CLK MR NRZIN PHDB NHDB RESET ODDEVEN ZEROS 


SETF /CLK MR /NRZIN /PHDB /NHDB /RESET OE /PL ;RESET CONDITION 


SETF CLK /MR ;/DSIN, /ZEROS, /ODDEVEN 
SETF /CLK /MR /NRZIN /PHDB /NHDB /RESET ;VECTOR 1, STATE 0 

SETF CLK ;/DSIN, /ZEROS, /ODDEVEN 
SETF /CLK /NRZIN /PHDB /NHDB /RESET ;VECTOR 2, STATE 0 

SETF CLK ;/DSIN, /ZEROS, /ODDEVEN 
SETF /CLK /NRZIN /PHDB /NHDB /RESET ;VECTOR 3, STATE 0 

SETF CLK ;/DSIN, /ZEROS, /ODDEVEN 
SETF /CLK /NRZIN /PHDB /NHDB /RESET ;VECTOR 4, STATE 0 

SETF CLK ;/DSIN, ZEROS, /ODDEVEN 
SETF /CLK NRZIN PHDB /NHDB RESET ;VECTOR 5, STATE 4 

SETF CLK ;/DSIN, /ZEROS, /ODDEVEN 
SETF /CLK NRZIN /PHDB /NHDB /RESET ;VECTOR 6, STATE 5 

SETF CLK ;/DSIN, /ZEROS, /ODDEVEN 
SETF /CLK NRZIN /PHDB /NHDB /RESET ;VECTOR 7, STATE 6 

SETF CLK ;/DSIN, /ZEROS, /ODDEVEN 
SETF /CLK /NRZIN PHDB /NHDB RESET ;VECTOR 8, STATE 7 

SETF CLK ;DSIN, /ZEROS, /ODDEVEN 
SETF /CLK NRZIN /PHDB NHDB /RESET ;VECTOR 9, STATE 3 

SETF CLK ;DSIN, /ZEROS, ODDEVEN 
SETF /CLK /NRZIN PHDB /NHDB /RESET ;VECTOR 10, STATE 1 
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SETF CLK 


SETF /CLK NRZIN /PHDB 


NHDB /RESET 


;DSIN, /ZEROS, /ODDEVEN 


;VECTOR 11, STATE 3 





SETF CLK ;/DSIN, /ZEROS, ODDEVEN 
SETF /CLK /NRZIN /PHDB /NHDB /RESET *;VECTOR 12, STATE O 

SETF CLK ;DSIN, /ZEROS, ODDEVEN 
SETF /CLK /NRZIN PHDB /NHDB RESET VECTOR 13, STATE 1 

SETF CLK ;/DSIN, / ZEROS, /ODDEVEN 
SETF /CLK /NRZIN /PHDB /NHDB /RESET ;VECTOR 14, STATE 2 

SETF CLK ;DSIN, /ZEROS, /ODDEVEN 
SETF /CLK /NRZIN /PHDB NHDB /RESET ;VECTOR 15, STATE 3 

SETF CLK ;/DSIN, ZEROS, ODDEVEN 
SETF /CLK NRZIN /PHDB /NHDB /RESET 7;VECTOR 16, STATE 13 
SETF CLK :/DSIN, /ZEROS, ODDEVEN 
SETF /CLK NRZIN /PHDB /NHDB /RESET 7;VECTOR 17, STATE 10 
SETF CLK ;/DSIN, /ZEROS, ODDEVEN 
SETF /CLK /NRZIN /PHDB /NHDB /RESET - ;VECTOR 18, STATE 11 
SETF CLK ;/DSIN, /ZEROS, ODDEVEN 
SETF /CLK /NRZIN /PHDB NHDB RESET ;VECTOR 19, STATE 12 
SETF | CLK ;DSIN, /ZEROS, /ODDEVEN 
SETF /CLK /NRZIN PHDB /NHDB /RESET ;VECTOR 20, STATE 1 

SETF CLK ;DSIN, /ZEROS, ODDEVEN 
SETF /CLK /NRZIN /PHDB NHDB /RESET ;VECTOR 21, STATE 3 

SETF CLK :/DSIN, ZEROS, /ODDEVEN 
SETF /CLK NRZIN PHDB /NHDB RESET ;VECTOR 22, STATE 4 

SETF CLK ;/DSIN, /ZEROS, /ODDEVEN 
SETF /CLK /NRZIN /PHDB /NHDB / RESET ;VECTOR 23, STATE 5 

SETF CLK ;/DSIN, /ZEROS, /ODDEVEN 
SETF /CLK NRZIN /PHDB /NHDB /RESET ;VECTOR 24, STATE 6 

SETF CLK ;/DSIN, /ZEROS, /ODDEVEN 
SETF /CLK /NRZIN PHDB /NHDB' RESET ;VECTOR 25, STATE 7 

SETF CLK ;DSIN, /ZEROS, /ODDEVEN 
SETF /CLK NRZIN /PHDB NHDB / RESET ;VECTOR 26, STATE 3 

SETF CLK * :;/DSIN, /ZEROS, ODDEVEN 
SETF /CLK NRZIN /PHDB /NHDB / RESET ;VECTOR 27, STATE O 

SETF CLK ;DSIN, /ZEROS, ODDEVEN 
SETF /CLK /NRZIN PHDB /NHDB /RESET ;VECTOR 28, STATE 1 

SETF CLK ;/DSIN, /ZEROS, /ODDEVEN 
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SETF /CLK /NRZIN /PHDB /NHDB /RESET ;VECTOR 29, STATE 2 
SETF CLK ;DSIN, /ZEROS, /ODDEVEN 
SETF /CLK /NRZIN /PHDB NHDB /RESET ;VECTOR 30, STATE 3 
SETF CLK ;DSIN, /ZEROS, ODDEVEN 
SETF /CLK /NRZIN PHDB /NHDB /RESET ;VECTOR 31, STATE 1 
SETF CLK ;/DSIN, ZEROS, /ODDEVEN 
TRACE_OFF 
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Title HDB3 encoder 

Pattern HDB3EN2.pds 

Revision A 

Author Cindy Lee 

Company Monolithic Memories Inc., Santa Clara, Ca 
Date 3/9/87 


CHIP HDB3EN2 PAL20RS8 


*;PINI PIN2 PIN3 PIN4 PINS PIN6 
CLK DSIN ZEROS ODDEVEN NC MR 


*PIN7 PINS PINS PIN10 PIN11 PIN12 
NC NC NC NC NC GND 


*PIN13 PIN14 PIN15 PIN16 PIN17 PIN18 
/OE ERROR NC Q3 Nc Q2 


?PIN19 PIN20 PIN21 PIN22 PIN23 PIN24 
NC Ql NC QO RESET Vcc 


INPUT SIGNALS 


, 

; CLK : EXTERNAL CLOCK 

; DSIN : SERIAL DATA INPUT SIGNAL 

; ZEROS : FOUR ZEROS DETECTION FLAG INPUT 
; ODDEVEN : ODD OR /EVEN SIGNAL 

; MR : MASTER RESET 

; /OE : ASSERTIVE LOW OUTPUT ENABLE 


; OUTPUT SIGNALS 

; ERROR : ERROR FLAG 

* Q3, Q2, Ql, QO : HDB3 DECODING STATE MACHINE STATE VARIABLES 
; RESET : RESET SIGNAL 


EQUATIONS 

7;20RS8 GENERATES TWO SIGNALS AND ONE STATE MACHINE 

7(1) A RESET SIGNAL CLEARS THE ODD_EVEN STATE MACHINE 

RESET = /Q0 * Q2 * /Q3 STATE 4 AND STATE 9 


+ QO * /Ql * /Q2 * /Q3 ; STATE 12 
+ Q0 * Q1 * Q2 * Q3 ; STATE 7 


7(2) A ERROR SIGNAL IS RAISED WHEN DSIN AND 4-ZEROS BOTH ARE 
; HIGH AT THE SAME TIME 


ERROR = QO * /Ql * /Q2 * Q3 ; STATE 14 
7(3) HDB3 ENCODING STATES MACHINE 
Q3 2= /QO * /Ql * Q2 * /MR ? MSB STATE VARIABLE 
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+ /Q0 * /Q2 * Q3 * /ZEROS * /MR 
DSIN * /QO0 * ZEROS * /MR 

DSIN * Q2 * Q3 * /MR 

DSIN * /Ql * /Q2 * /Q3 * /MR 
DSIN * /Q1 * ZEROS * /MR 

/Ql * Q2 * Q3 * /MR 

DSIN * /Q3 * ZEROS * /MR 

QO * Ql * Q2 * /Q3 * /MR 

QO * Q2 * Q3 * /ZEROS * /MR 


t++tteees+ 


/DSIN * /QO * /Ql * Q2 * /MR 

/DSIN * QO * Ql * Q2 * /Q3 * /MR 

7Q0 * /Ql * Q2 * /ZEROS * /MR 

/Ql * Q2 * Q3 * /ZEROS * /MR 

QO * Ql * Q2 * /Q3 * /ZEROS * /MR 
DSIN * /QO * /Q2 * Q3 * /ZEROS * /MR 
DSIN * QO * Q2 * Q3 * /ZEROS * /MR 
/DSIN * /QO * Q3 * ZEROS * /MR 

/DSIN * /Q1l * /Q2 * /Q3 * ZEROS * /MR 
/DSIN * Q2 * Q3 * ZEROS * /MR 


Q2 


++etttetets i 





/DSIN * /QO * Ql * /Q3 * /MR 
/DSIN * QO * /Q1 * Q2 * /MR 
/Q0 * Ql * /Q3 * /ZEROS * /MR 
QO * /Ql * Q2 * /ZEROS * /MR 
/Q0 * /Q2 * Q3 * /ZEROS * /MR 
QO * Q2 * Q3 * /ZEROS * /MR 
/DSIN * /QO * /Q2 * Q3 * /MR 
/DSIN * QO * Q2 * Q3 * /MR 


Ql 


teteeetstil 


ttteest i 


DSIN * /QO * ZEROS * /MR ; LSB STATE VARIABLE 
DSIN * /Q1l * ZEROS * /MR 

/Ql * Q2 * Q3 * /MR 

Ql * /Q2 * /Q3 * /MR 

DSIN * Q2 * ZEROS * /MR 

ODDEVEN * /Q0 * Q3 * ZEROS * /MR 

ODDEVEN * /QO * /Q2 * ZEROS * /MR 


Qo 


SIMULATION 
TRACE _ ON MR CLK DSIN ZEROS ODDEVEN Q3 Q2 Q1 QO RESET ERROR 


SETF OE MR DSIN ZEROS ODDEVEN 7;RESET CONDITION 


CLOCKF CLK 
SETF /MR 

SETF DSIN ZEROS /ODDEVEN ;VECTOR 1, STATE 14, 1001 
CLOCKF CLK ; ERROR 

SETF /DSIN /ZEROS /ODDEVEN ;VECTOR 2, STATE 0, 0000 
CLOCKF CLK 

SETF DSIN /ZEROS /ODDEVEN ;VECTOR 3, STATE 1, 1000 
CLOCKF CLK ;PHDB 
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SETF /DSIN 
CLOCKF CLK 


SETF /DSIN 
CLOCKF CLK 


SETF /DSIN 
CLOCKF CLK 


SETF /DSIN 
CLOCKF CLK 


SETF DSIN 
CLOCKF CLK 


SETF DSIN 
CLOCKF CLK 


SETF /DSIN 
CLOCKF CLK 


SETF /DSIN 
CLOCKF CLK 


SETF /DSIN 
CLOCKF CLK 


SETF /DSIN 
CLOCKF CLK 


SETF DSIN 
CLOCKF CLK 


SETF DSIN 
CLOCKF CLK 


SETF /DSIN 
CLOCKF CLK 


SETF /DSIN 
CLOCKF CLK 


SETF /DSIN 
CLOCKF CLK 


SETF /DSIN 
CLOCKF CLK 


SETF DSIN 
CLOCKF CLK 


SETF /DSIN 
CLOCKF CLK 


SETF /DSIN 
CLOCKF CLK 
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/ ZEROS 


/ ZEROS 


/ ZEROS 


/ ZEROS 


/ ZEROS 


ZEROS 


/ ZEROS 


/ ZEROS 


/ ZEROS 


/ ZEROS 


/ ZEROS 


ZEROS 


/ ZEROS 


/ ZEROS 


/ ZEROS 


/ ZEROS 


ZEROS 


/ ZEROS 
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ODDEVEN 


ODDEVEN 


ODDEVEN 


ODDEVEN 


/ODDEVEN 


ODDEVEN 


/ ODDEVEN 


/ODDEVEN 


/ODDEVEN 


/ODDEVEN 


/ODDEVEN 


ODDEVEN 


/ODDEVEN 


/ODDEVEN 


/ ODDEVEN 


/ODDEVEN 


/ ODDEVEN 


ODDEVEN 


ODDEVEN 


;VECTOR 4, STATE 8, 0111 
;VECTOR 5, STATE 5, 1100 
;VECTOR 6, STATE 6, 1101 
;VECTOR 7, STATE 7, 1111 
;PHDB, RESET 

;VECTOR 8, STATE 3, 1110 
;NHDB 

;VECTOR 9, STATE 1, 1000 
;PHDB 

;VECTOR 10, STATE 9, 0110 
;NHDB, RESET 

;VECTOR 11, STATE 10, 0010 
;VECTOR 12, STATE 11, 0011 
;VECTOR 13, STATE 12, 0001 
;NHDB, RESET 

;VECTOR 14, STATE 1, 1000 
; PHDB 

;VECTOR 15, STATE 3, 1110 
;NHDB 

;VECTOR 16, STATE 4, 0100 
;PHDB, RESET 

;VECTOR 17, STATE 5, 1100 
;VECTOR 18, STATE 6, 1101 
;VECTOR 19, STATE 7, 1111 
;PHDB, RESET 

;VECTOR 20, STATE 3, 1110 
;NHDB 

;VECTOR 21, STATE 13, 0101 
;VECTOR 22, STATE 10, 0010 
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SETF /DSIN 
CLOCKF CLK 


SETF /DSIN 
CLOCKF CLK 


TRACE_OFF 


/ ZEROS 


/ ZEROS 


HDB3 Line Coding Using PAL Devices 


ODDEVEN 7;VECTOR 23, STATE 11, 


ODDEVEN *VECTOR 24, STATE 12, 
*;NHDB, RESET 


PALASM SIMULATION, V2.22 - MARKET RELEASE (11-19-86) 
(C) ~- COPYRIGHT MONOLITHIC MEMORIES INC, 1986 
PALASM SIMULATION SELECTIVE TRACE LISTING 
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Title HDB3 shift register 

Pattern HDB3EN3.pds 

Revision A 

Author Cindy Lee 

Company Monothic Memories Inc., Santa Clara, Ca 
Date 2/24/87 


CHIP HDB3EN3 PAL16R4 


;PIN1 PIN2 PIN3 PIN4 PINS PIN6 PIN7 PIN8 PINS PIN1O 
CLK NRZIN QINO QIN1 QIN2 QIN3 MR NC NC GND 


*;PIN11 PIN12 PIN13 PIN14 PIN15 PIN16 PIN17 PIN18 PIN1i9 PIN20 
/OE NC NC DSIN Q2 Ql Q0 NHDB PHDB VCC 


INPUT SIGNALS 
CLK : EXTERNAL CLOCK 
NRZIN : NON-RETURN-ZERO INPUT SIGNAL 
QINO, QIN1, QIN2, QIN3 : FOUR STATE VARIABLE INPUTS 
MR : MASTER RESET 
/OE : ASSERTIVE LOW OUTPUT ENABLE 


=e *%e eae “ee se VO 


OUTPUT SIGNALS 

DSIN, Q2, Q1, QO : 4-BIT SHIFT REGISTER OUTPUT SIGNALS 
PHDB : POSITIVE HDB3 SIGNAL 

NHDB : NEGATIVE HDB3 SIGNAL 


=e ™s te Ne 


EQUATIONS 
716R4 GENERATES TWO HDB SIGNALS AND PROVIDES A 4-BIT SHIFT REGISTER 


?(1) A POSITIVE HDB3 AND A NEGATIVE HDB3 SIGNALS ARE GENERATED 


/PHDB = QINO * /QIN1 ; THE PHDB IS GENERATED AT 
+ /QINO * QIN2 * QIN3 ; STATE 1, STATE 4, OR STATE 7 
+ QIN] * QIN2 * /QIN3 ; THIS EQUATION IS THE INVERTING 
+ QIN1 * /QIN2 ; LOGIC OF THE PHDB SIGNAL 
+ /QIN2 * /QIN3 
/NHDB /QINO * /QIN1 THE NHDB IS GENERATED AT 


QIN1 * /QIN2 
/QIN2 * QIN3 
QINO * QIN2 


STATE 3, STATE 9, OR ATATE 12 
THIS EQUATION IS THE INVERTING 
LOGIC OF THE NHDB SIGNAL 


+++ il 


=e %e Se SO 


7(2) THIS IS A 4-BIT SHIFT REGISTER WITH MASTER RESET 


/Q0 := /NRZIN ; LSB OF SHIFT REGISTER 
+ MR ; MASTER RESET 

/QU 2= /Q0 
+ MR 

/Q2 2= /Q1 
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+ MR 

/DSIN := /Q2 * MSB OF SHIFT REGISTER 
+ MR 

SIMULATION 


TRACE _ON MR CLK QINO QIN1 QIN2 QIN3 PHDB NHDB NRZIN DSIN 





SETF OE MR QINO /QIN1 /QIN2 QIN3 NRZIN ;RESET CONDITION 
CLOCKF CLK 

SETF /MR QINO /QIN1 /QIN2 QIN3 NRZIN ;VECTOR 1 
CLOCKF CLK 

SETF /MR QINO /QIN1 /QIN2 QIN3 NRZIN ;VECTOR 2 
CLOCKF CLK 

SETF /MR /QINO /QIN1 /QIN2 /QIN3 /NRZIN ;VECTOR 3 
CLOCKF CLK 

SETF /MR /QINO /QIN1 /QIN2 QIN3 NRZIN ;VECTOR 4, PHDB 
CLOCKF CLK ;DSIN STARTS 
SETF /MR /QINO QIN1 /QIN2 QIN3 /NRZIN ;VECTOR 5 
CLOCKF CLK ;DSIN 

SETF /MR /QINO /QIN1 QIN2 QIN3 /NRZIN ;VECTOR 6 
CLOCKF CLK ;/DSIN 

SETF /MR QINO /QIN1 QIN2 QIN3 NRZIN ;VECTOR 7 
CLOCKF CLK ;DSIN 

SETF /MR QINO QIN1 QIN2 QIN3 NRZIN ;VECTOR 8, PHDB 
CLOCKF CLK ;/DSIN 

SETF /MR QINO /QIN1 QIN2 /QIN3 /NRZIN ;VECTOR 9 
CLOCKF CLK ;/DSIN 

SETF /MR /QINO QIN1 /QIN2 /QIN3 NRZIN ;VECTOR 10 
CLOCKF CLK ;DSIN 

SETF /MR QINO QIN1 /QIN2 /QIN3 /NRZIN ;VECTOR 11 
CLOCKF CLK ;DSIN 

SETF /MR QINO /QIN1 /QIN2 /QIN3 NRZIN ;VECTOR 12, NHDB 
CLOCKF CLK ;/DSIN 
TRACE_OFF 
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Title HDB3 decoder 

Pattern HDB3DE.pds 

Revision A 

Author Cindy Lee 

Company Monolithic Memories Inc., Santa Clara, Ca 
Date 3/12/87 


CHIP HDB3DE PAL20RS4 


;PIN1 PIN2 PIN3 PIN4 PINS PIN6 
CLK NC PHDB NHDB MR NC 


;PIN7 PIN8 PINS PIN10 PIN11 PIN12 
NC NC NC NC NC GND 


;PIN13 PIN14 PIN15 PIN16 PIN17 PIN18 
/OE ERROR VERROR NC Q3 Q2 


;PIN19 PIN20 PIN21 PIN22 PIN23 PIN24 
Ql Q0d NC DSOUT RESET VCC 


INPUT SIGNALS 


/OE ASSERTIVE LOW OUTPUT ENABLE 


U 

; CLK : EXTERNAL CLOCK 

; PHDB : POSITIVE HDB3 SIGNAL 
; NHDB : NEGATIVE HDB3 SIGNAL 
; MR : MASTER RESET 

t e 


; OUTPUT SIGNALS 

; ERROR : ERROR FLAG 

; VERROR : BIPOLAR VIOLATION ERRIR 

; Q3, Q2, Ql, QO : HDB3 DECODING STATE VARIABLE OUTPUTS 
; DSOUT : DATA SERIAL OUTPUT 

; RESET : RESET SIGNAL 


EQUATIONS 


?THE 20RS4 GENERATES ONE SIGNAL, TWO ERROR FLAGS 
;AND A HDB3 DECODING STATE MACHINE 


;(1) A RESET SIGNAL IS GENERATED TO CLEAR 
; THE 4-BIT SHIFT REGISTER 


STATE 8 OR STATE 9 
WITH /PHDB * NHDB 
STATE 3 OR STATE 4 
WITH PHDB * /NHDB 


RESET = QO * Q2 * /Q3 * /PHDB * NHDB 


+ Q1 * /Q2 * Q3 * PHDB * /NHDB 


me Me Me Me 


;(2) A VERROR SIGNAL IS GENERATED TO INDICATE THE VIOLATION ERROR 


VERROR = /Q0 * Q1* Q2 * Q3 ; STATE 11 
+ Q0 * /Q1i * Q2 * Q3 ; STATE 12 
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CLOCKF CLK 


SETF /MR PHDB NHDB 
CLOCKF CLK 


SETF /MR /PHDB /NHDB 
CLOCKF CLK 


SETF /MR /PHDB /NHDB 
CLOCKF CLK 


SETF /MR PHDB /NHDB 
CLOCKF CLK 


SETF /MR /PHDB /NHDB 
CLOCKF CLK 


SETF /MR /PHDB /NHDB 
CLOCKF CLK 


SETF /MR /PHDB /NHDB 
CLOCKF CLK 


SETF /MR PHDB /NHDB 
CLOCKF CLK 


SETF /MR /PHDB NHDB 
CLOCKF CLK 


SETF /MR PHDB /NHDB 
CLOCKF CLK 


SETF /MR /PHDB NHDB 
CLOCKF CLK 


SETF /MR /PHDB /NHDB 
CLOCKF CLK 


SETF /MR /PHDB /NHDB 
CLOCKF CLK 


SETF /MR /PHDB NHDB 
CLOCKF CLK 


SETF /MR PHDB /NHDB 
CLOCKF CLK 


SETF /MR /PHDB NHDB 
CLOCKF CLK 


SETF /MR PHDB /NHDB 
CLOCKF CLK 


SETF /MR /PHDB /NHDB 
CLOCKF CLK 


; VECTOR 


; ERROR 


; VECTOR 


; VECTOR 


7 VECTOR 


; VECTOR 


; VECTOR 


; VECTOR 


; VECTOR 


; VECTOR 


; VECTOR 


7 VECTOR 


7 VECTOR 


; VECTOR 


7 VECTOR 


; VECTOR 


; VECTOR 


; VECTOR 


; VECTOR 


1, 


8, 


9, 


STATE 0, 


STATE 0, 


STATE 0, 


STATE 1, 


STATE 2, 


STATE 3, 


STATE 4, 


STATE 5, 


7RESET 


STATE 6, 


0000 


0000 


0000 


1000 


1001 


1011 


1010 


0010 


0110 


10, STATE 1, 1000 


11, STATE 


12, 


13, 


14, 


15, STATE 


16, 


17, STATE 


18, 


STATE 7, 


STATE 8, 


6, 


0110 


0100 


0101 


STATE 10, 1111 


7RESET 


1, 1000 


STATE 6, 0110 


STATE 2, 


1, 


1000 


1001 
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?(3) A ERROR SIGNAL IS GENERATED WHEN PHDB AND NHDB BOTH ARE HIGH 
; AT THE SAME TIME, 

ERROR = NHDB * PHDB BOTH PHDB AND NHDB 
ARE HIGH 


=e ™e 


7(4) A DATA SERIAL OUTPUT SIGNAL IS GENERATED 


DSOUT = NHDB 
+ PHDB 


;(5) THE HDB3 DECODING STATE MACHINE 


Q3 := /NHDB * /PHDB * /Q2 * Q3 * /MR ; MSB STAE VARIABLE 
+ /NHDB * PHDB * /Q0 * /Q3 * /MR 
+ /NHDB * /QO * Ql * /Q2 * /Q3 * /MR 
+ /NHDB * /QO * Qi * Q2 * Q3 * /MR 
+ /NHDB * /Q1 * /Q2 * Q3 * /MR 
+ /NHDB * PHDB * QO * Q2 * /MR 
+ NHDB * /PHDB * Q2 * /Q3 * /MR 
+ NHDB * /PHDB * QO * Q2 * /MR 
Q2 := /NHDB * PHDB * /Q0 * Ql * /Q2 * /Q3 * /MR 
+ /NHDB * PHDB * /QO * Ql * Q2 * Q3 * /MR 
+ /NHDB * PHDB * /Q1l * /Q2 * Q3 * /MR 
+ NHDB * /PHDB * /QO * /Q3 * /MR 
+ NHDB * /PHDB * Ql * Q3 * /MR 
+ NHDB * /PHDB * /Q2 * Q3 * /MR 
+ /PHDB * Q2 * /Q3 * /MR 
+ /PHDB * QO * Q2 * /MR 
Ql := /NHDB * /PHDB * QO * Q3 * /MR 
+ /NHDB * PHDB * /Q2 * Q3 * /MR 
+ /NHDB * PHDB * /QO * Ql * /Q2 * /MR 
+ /NHDB * PHDB * /QO * Ql * Q3 * /MR 
+ NHDB * /PHDB * /QO * /Q2 * /MR 
+ NHDB * /PHDB * /QO * Ql * Q3 * /MR 
+ /PHDB * QO * /Q2 * Q3 * /MR 
+ /PHDB * Ql * /Q2 * Q3 * /MR 
+ /PHDB * QO * Q2 * /Q3 * /MR 
Qo := /NHDB * /PHDB * /Q1 * /Q2 * Q3 * /MR 
+ NHDB * /PHDB * Q2 * /Q3 * /MR ; LSB STATE VARIABLE 
+ /PHDB * /Q1l * Q2 * /Q3 * /MR 
+ /PHDB * QO * Q2 * /Q3 * /MR 
+ NHDB * /PHDB * QO * Q2 * /MR 
SIMULATION 


TRACE_ON MR CLK PHDB NHDB DSOUT Q3 Q2 Q1 QO RESET VERROR ERROR 


SETF OE MR PHDB NHDB ;RESET CONDITION 
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SETF /MR /PHDB /NHDB ;VECTOR 19, STATE 3, 1011 
CLOCKF CLK 

SETF /MR PHDB /NHDB *;VECTOR 20, STATE 5, 0010 
CLOCKF CLK 7RESET 

SETF /MR /PHDB NHDB ;VECTOR 21, STATE 6, 0110 
CLOCKF CLK 

SETF /MR /PHDB /NHDB ;VECTOR 22, STATE 7, 0100 
CLOCKF CLK 

SETF /MR /PHDB /NHDB ;VECTOR 23, STATE 8, 0101 
CLOCKF CLK 

SETF /MR /PHDB NHDB ;VECTOR 24, STATE 9, 0111 
CLOCKF CLK 

SETF /MR /PHDB NHDB ;VECTOR 25, STATE 10, 1111 
CLOCKF CLK *RESET 

TRACE OFF 
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With the rapid increase in popularity of T1 trunk links following 
the deregulation of AT&T, the transparent nature of the 24 data 
channels must be supported by handling of the framing bits. 
These must comply with certain standards, the most common 
of which is known as D4. 


T1-Carriers 


More than 100 million miles of T-carrier lines are installed in the 
U.S. and Canada. They are full-duplex digital communication 
links with a specific bit rate and format. T-carrier lines’ 
transmission quality and reliability exceeds that of analog 
lines. T-carrier lines are available in a range of transmission 
rates between 1.544 Mbps and 274 Mbps, designated T1, T1C, 
T2, T3, and T4, 


The most common standard, T1 can be considered a building 
block for the others. T1 transmits at 1.544 Mbps. AT&T 
specifications define D1, D1D, D2, D3, D4 and ESF framing 
structures for T1 carriers. These standards specify the 
framing-bit pattern and frame synchronization. D1 was 
established in 1969 and has been superseded by the later 
standards. D1D, D2, D3, and D4 framing formats group twelve 
frames into a superframe. Today, the D4 framing format is the 
most prevalent. The future format is ESF, also called Fe, which 
groups twenty-four frames into an extended superframe. It is 
presently under definition, and will be supported in the future. 


D4 Framing Format 


The T1 standard for D4 framing defines a superframe as a 
group of twelve frames. A frame consists of twenty-four 8-bit 
channels plus one framing bit for a total of 193 bits. The 8-bit 
channel consists of one polarity bit and seven magnitude bits; 
however, in frames six and twelve, the eighth bit in each 
channel carries signaling information. Since this bit cannot 
transmit user data (either voice or data) it is called robbed-bit 
signaling. The signaling bit transmits requests for services and 
calling information. At the 1.544 Mbps transmission rate, 8 
kbps are overhead for framing; so the user transmission rate is 
1.536 Mbps. Because the user transmission rate is divided 
into twenty-four channels, each channel transmits at 64 kbps. 


A frame bit should occur every 193rd bit in the data stream. 
Frame detection occurs when a sequence of sampled frame 
bits matches the frame pattern. However, occasionally data 
can produce the same pattern. False framing occurs when the 
frame synchronization circuitry mistakes data bits for the 
framing bits. 


A frame-bit error occurs when the incoming frame bit does not 
match the expected framing bit. If two out of four consecutive 
frame bits are in error, then frame synchronization is “lost.” 
Once frame synchronization is “lost,” a search for the frame 
pattern is started. AT&T T1 standards specify the maximum 
time to search and re-synchronize as 50 ms. This time period 
is called the reframe time. 


For the D4 frame format, the framing bits are divided into two 


groups: Ft and Fs. The terminal framing bit (Ft) identifies frame 
boundaries. The signaling frame bit (Fs) identifies the 
superframe boundary and the frames containing signaling bits. 
Ft and Fs are interleaved to create the 12-bit framing pattern 
shown in Table 1.This pattern is repeated every twelve frames. 

















SUPER- 
FRAMING 
SF 


TERMINAL | SIGNALING 
FRAMING FRAMING 
Ft Fs 











10 


yy 
_ 


= 
Le 


Table 1. Framing Pattern 


Hardware Implementation 


The frame synchronization logic was implemented in PAL 
devices because current single-chip solutions do not offer 
enough flexibility in line coding. Using PAL devices, this 
design has the flexibility to interface various line coding 
devices. 


The PAL devices implement the frame synchronization logic, 
which can be divided into three sections. The first section 
detects the D4 framing pattern. The second section searches 
for frame synchronization. The third section provides the su- 
perframe timing. These are implemented using four CMOS PAL 
devices, a shift register, and a counter as shown in Figure 1. 


The frame detection hardware samples every 193rd bit of the 
data stream. The data is sampled on the positive edge of 
T1_CK. Either a shift register or a state machine can be used 
to sample the framing bits. The data stream is shifted into a 
(193x23)-bit shift register with taps every 193 bits. The taps 
are fed into two identical Frame Pattern Detection (FPD) PAL 
devices. The more taps sampled, the less likely a data pattern 
will reproduce the framing pattern. In this design, twenty-four 
taps ensure reasonable prevention of false framing. 
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Each FPD device monitors twelve framing bits. If the framing 
pattern is found, a FP_DETB signal is generated. From these 
signals, the controller determines the correct framing pattern. 
The FPD’s FRM1B indicates the start of a superframe. 


The first FPD’s inputs are the first eleven taps from the shift 
register plus the data input stream. The second FDP monitors 
the remaining taps from the shift register as shown in Figure 1. 


The control section is a controller and a counter, which search 
for frame synchronization. The controller, SYNC PAL, uses the 
193-bit counter to determine if the frame detection signals are 
spaced every 193 bits. First the controller searches for the 
first framing pattern. During the search operation, SYNC PAL 







FRM _ BIT 


r 193x23 SHIFT REGISTER 


waits for the FPD’s frame detection signals. When a framing 
pattern is found, the counter is reset. If 193 bits later, the next 
expected framing bit is found, the controller is “in sync.” It 
continues checking the framing pattern every 193 bits. If the 
framing pattern is not found, it assumes that the incoming 
framing bit is in error. If the next four framing bits are correct, 
the controller remains “in sync.” If, during this time, a second 
frame-bit error occurs, the framing is “lost.” The controller, 
then, asserts the lost frame signal, NO_FRM. Once the framing 
is “lost,” the controller begins searching for the correct frame 
pattern. While the controller searches for the correct frame 
pattern, the previous frame synchronization is maintained until 
the controller re-synchronizes. 


12 










SRD(1-11) 
SRD(0) FPD PAL 


pit FPD1 





DATA 
NO_T1C 


SYNC PAL 
PALC20R4Z 











f= 193 COUNTER 













PALC16L8Z 


FPD2 


SRD(0-11) 
FPD PAL 
PALC16L8Z 






NO_FRM 
BYT_CK 
SUNK 
FRM6 
FRAME 
PAL 
PALC16R6Z FRM12 

SOF 


Figure 1. Frame Synchronization Logic 
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Figure 2 shows the sync. state machine for the controller. 
Signals FPD1B and FPD2B (from the FPDs) indicate a T1 frame 
pattern detection, while SOF (from the counter) indicates 193 
counts. Beginning in states F and G, the controller searches 
for the frame pattern. Once the framing has been found, the 
state machine enters state A. At state A SUNK is asserted, 
which indicates “in sync.” The state machine remains in state A 
until a single frame-bit error occurs. Upon the first frame-bit 
error, the state machine enters state B. If the next four framing 
bits are correct, the state machine sequences through states 
C, D, and E, which returns to state A. However, if during this 
time, a second frame bit error occurs, the state machine enters 
state F. Loss of T1 carrier (NO_T1CB) or reset (RSTB) forces 
the state machine into the searching state. 


SYNC PAL performs two other miscellaneous functions. When 
a single framing-bit error is encountered, the device corrects 
the bit. The corrected version of the input data stream, BIT, is 
fed into the shift register. The device also generates the 
BYT_CK. A single pulse is asserted for every channel for 
external serial-to-parallel (S/P) data conversion. For this imple- 
mentation BYT_CK is asserted when the counter’s three least 





significant bits are all zeros. Depending on the requirements of 
the S/P converter, any combination of the counter’s three least 
significant bits could generate BYT_CK for the correct 
alignment with the channel sample. 


SUPER FRAME PAL generates the superframe timing. It 
generates FRM_6 and FRM_12 by counting the frames. These 
signals indicate the frames containing the signaling bits. The 
CMOS PAL device implements a 4-bit counter. When the frame 
is “in sync,” SOF increments the counter, while FRM_1 clears 
it. If the controller is “out of frame sync,” the counter continues 
counting, based on the last known frame position. 


Design Recommendations 


While this design meets the D4 specification, one design 
improvement might be useful. A signal flagging single-framing 
bit errors could be added. Monitoring this signal indicates the 
amount of random noise on the line. This signal could warn that 
a line is noisy even though frame synchronization was never 
lost. 


-FPD1B * FPD2B * SOF + SOF 






FPD1B * SOF + FPD2B * SOF 





















































BIT = BITA* 

A LOB = SOF 
SUNK = 1 

NO_FRM =0 














FPD18 * FPD2B * SOF 




































ae BIT = DATA FPDiB * FPD2B * SOF BIT = DATA 
SOF = \DB=SOF SOE 
SUNK =1 F 
= NO_FRM=0 
FPD1B*SOF+ SOF 
‘ FPD1B * SOF + 
FPD2B * SOF FPD2B * SOF 
FPD1B * FPD2B * SOF 
FPD1B * SOF + 
FPD2B * SOF 
FPD1B * FPD2B * SOF 
BIT=DATA_ _ 
LDB = FPD1B * FPD2B + SOF 
SUNK =0 
NO_FRM=1 
FPD1B * SOF + FPD1B * SOF + 
FPD2B * SOF FPD2B * SOF 
= FPD1B + FPD2B 
SOF BIT = DATA BIT =DATA 
FPD1B * FPD2B * SOF p LDB=SOF ear 
SUNK = 1 
NO_FRM=0 
NOTE: “BITA = FPD1B * FPD2B * SOF * DATA + 
SOF * DATA+ 
FPD1B * SOF * DATA+ 
FPD2B * SOF * DATA 
Figure 2. Sync State Machine 
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TITLE FRAME_DETECTION_PAL 


PATTERN Tl FRAME DETECTION PAL FOR T1 INTERFACE 


REVISION P1.04 


AUTHOR STEVE PATTERSON AND THERESA SHAFER 


COMPANY 
DATE 6/4/87 


=e te we 


of Frame 1. 


CHIP FPD PAL16L8 


;PINS 
i1 2 
SRDO SRD1 
11 12 
SRDS FRM1B 
; INPUTS: 
; 

OUTPUTS: 


we Te te Ne Me Me 


3 4 5 6 7 
SRD2 SRD3 SRD4 SRD5 SRD6 
13 14 15 16 17 
SRD10 SRD11 NC FRM1_6B NC 
SRD(11-0) FRAMING BITS 
SOF LAST KNOWN START OF FRAME 
FP_DETB 

PATTERN DETECTED 
FRM1B 

FRAME 1 
FRM1_6B 


FRAMING PATTERNS 


we Ne Te Me TO Ne Te TS “se Ne Te WS Te Te “We VS We we 


EQUATIONS 


/FRM1B = SOF * SRD11 * /SRD10 * /SRD9 * /SRD8 * SRD7 * SRD6 * /SRD5 
* SRD4 * SRD3 * SRD2 * /SRD1 * /SRDO 


/FRM1_6B = SRD11 * /SRD10 * /SRD9 * /SRD8 * SRD7 * SRD6 * /SRDS5 
* SRD4 * SRD3 * SRD2 * /SRD1 * /SRDO 


COORPRFPORPPRPKHPOOPRPRPUWN 


~- = *s* = = & > 2&2 82& = S&S 
COPRPOPRPRPOORPOCOPFUWN 
--=— = &» 8s = selmUC OlmUmC UC UC TUUC DCU 
CP RPOPFRPOORPOO 


-~_— =. 8 se 2 %* & 2 F BW & 


ooao~ar eee 


onoWNn 
onOoWnn 
Nown 
AoOWNn 
nown 
Sonn 
VOWNn 
Nvnown 


SCRPOOORPFORPFRPO PUWN 


ba a i i i i i | 


This PAL device monitors 12 193rd bits in the incoming Tl NRZ data s 
It detects any valid Frame Pattern (start of any Frame) and the start 


ACTIVE LOW SIGNAL INDICATING FRAMING 
ACTIVE LOW SIGNAL INDICATING START OF 


ACTIVE LOW SIGNAL INDICATING FRAME 1 


TO 6 DETECTED 


rPOOOFPRPORPRPF OO oUuWwNn 


= - = ~ ~ ~ = ~ = = = = 
FPOPHHOOROCOORF 
~= ~ ~ = ~= ™= = ~ = = = ~-= 
OFPFPRPOOrFOOOFMPF 
= = = 7 ~ = = = -= = = 
KPRHOOFPOCOORRO 
= ~~ = = ~~ = ~= ~-= = ~ = ~= 
KFROORFOOOFPPOPR 
= = = = = 7” = = = = = = 
POORPOOOHKOPFP 
= ~= = = -~ ~ ~~ = 7” ~ = is 
COPCOOPPOFPEHP 
= bd = ”- -= ~~ ~ = = ~ ™ ~~ 
ed Led Receed Reed Bicead Meet Keel Gere feed femed eed Senne 
me ™e Me Se SO TO Ne Ve VO WS Ve VEO 


FPROrPRrRPRrRPOOrROOCO 


GND 


FP_DETB vcc 
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+ /SRD11 * SRD1O * /SRD9 * /SRD8 * /SRD7 * SRD6 * SRD5 


* /SRD4 * SRD3 * SRD2 * SRD1 * /SRDO 


+ /SRD11 * /SRD10 * SRD9 * /SRD8 * /SRD7 * /SRD6 * SRD5 


* SRD4 * /SRD3 * SRD2 * SRD1 * SRDO 
+ SRD11 * /SRD10 * /SRD9 * SRD8 * /SRD7 * /SRD6 * /SRD5 
* SRD4 * SRD3 * /SRD2 * SRD1 * SRDO 
+ SRD11 * SRD10 * /SRD9 * /SRD8 * SRD7 * /SRD6 * /SRD5 
* /SRD4 * SRD3 * SRD2 * /SRD1 * SRDO 
+ SRD11 * SRD10 * SRD9 * /SRD8 * /SRD7 * SRD6 * /SRD5 
* /SRD4 * /SRD3 * SRD2 * SRD1 * /SRDO 
/FP_DETB = /FRM1_6B 
+ /SRD11 * SRD10 * SRD9 * SRD8 * /SRD7 * /SRD6 * SRDS5 
* /SRD4 * /SRD3 * /SRD2 * SRD1 * SRDO 
+ SRD11 * /SRD10 * SRD9 * SRD8 * SRD7 * /SRD6 * /SRD5 
* SRD4 * /SRD3 * /SRD2 * /SRD1 * SRDO 
+ SRD11 * SRD10 * /SRD9 * SRD&8 * SRD7 * SRD6 * /SRD5 
* /SRD4 * SRD3 * /SRD2 * /SRD1 * /SRDO 
+ /SRD11 * SRD10 * SRD9 * /SRD8 * SRD7 * SRD6 * SRD5 
* /SRD4 * /SRD3 * SRD2 * /SRD1 * /SRDO 
+ /SRD11 * /SRD10 * SRD9 * SRD8 * /SRD7 * SRD6 * SRD5 
* SRD4 * /SRD3 * /SRD2 * SRD1 * /SRDO 
+ /SRD11 * /SRD10 * /SRD9 * SRD8 * SRD7 * /SRD6 * SRDS 
* SRD4 * SRD3 * /SRD2 * /SRD1 * SRDO 
Pel ge te Se ete ee ce fe et eh ee es Se eS See i eee cee Se we) ce 
; e e e ° e e e e e ° e e e e e ° e . ° e e e e e e ° e e . ° ° 
SIMULATION 
TRACE_ON 


SOF SRD11 SRD10 SRD9 SRD8 SRD7 SRD6 SRD5 SRD4 SRD3 SRD2 


FP_DETB FRM1B FRM1_6B 


SETF SOF 


SETF 


SRD1 SRDO 


c 


=e 


=e 


=e “oe 


=e =e =e 


=e 


FRM 2 


FRM 3 


FRM 4 


FRM 5 


FRM 6 


FRM 1-6 


FRM 7 


FRM 8 


FRM 9 


FRM 10 


FRM 11 


FRM 12 


FRAME 1 


SRD11 /SRD10 /SRD9 /SRD8 SRD7 SRD6 /SRD5 SRD4 SRD3 SRD2 /SRD1 /SRDO 


SETF 


° 
c 


FRAME 2 


/SRD11 SRD10 /SRDI /SRD8 /SRD7 SRD6 SRD5 /SRD4 SRD3 SRD2 SRD1 /SRDO 


SETF 


/SRD11 /SRD10. SRD9 /SRD8 /SRD7 /SRD6 SRD5 SRD4 /SRD3 SRD2 


SETF 


SRD11 /SRD10 /SRD9 SRD8 /SRD7 /SRD6 /SRDS5 SRD4 SRD3 /SRD2 


SETF 
SRD11 
SETF 
SRD11 
SETF 
/SRD11 
SETF 


° 
e 


FRAME 3 


SRD1 SRDO 


c 


FRAME 4 


SRD1 SRDO 


. 
c 


FRAME 5 


SRD10 /SRD9 /SRD8 SRD7 /SRD6 /SRD5 /SRD4 SRD3 SRD2 /SRD1 SRDO 


SRD10 SRD9 /SRD8 /SRD7 SRD6 /SRDS5 /SRD4 /SRD3 SRD2 


SRD10 SRD9 SRD8 /SRD7 /SRD6 SRDS /SRD4 /SRD3 /SRD2 


FRAME 6 


SRD1 /SRDO 


e 
’ 


FRAME 7 


SRD1 SRDO 


, 


FRAME 8 


SRD11 /SRD10 SRD9 SRD8& SRD7 /SRD6 /SRDS5 SRD4 /SRD3 /SRD2 /SRD1 SRDO 


SETF 
SRD11 

SETF 

/SRD11 


e 
’ 


FRAME 9 


SRD10 /SRD9 SRD8 SRD7 SRD6 /SRDS5 /SRD4 SRD3 /SRD2 /SRD1 /SRDO 


e 
‘ 


FRAME 10 


SRD10 SRD9 /SRD8 SRD7 SRD6 SRD5 /SRD4 /SRD3 SRD2 /SRD1 /SRDO- 
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SETF ; FRAME 11 
7SRD11 /SRD1O0 SRD9 SRD8 /SRD7 SRD6 SRDS SRD4 /SRD3 /SRD2 SRD1 /SRDO 
SETF ; FRAME 12 


/SRD11 /SRD10 /SRD9 SRD8 SRD7 /SRD6 SRDS SRD4 SRD3 /SRD2 /SRD1 SRDO 


SETF ; NOT FRAMING PATTERN 
/SRD11 /SRD10 /SRD9 /SRD8 /SRD7 /SRD6 /SRD5 /SRD4 /SRD3 /SRD2 /SRD1 /SRDO 
SETF ; NOT FRAMING PATTERN 
/SRD11 /SRD10 SRDS /SRD8 /SRD7 /SRD6 /SRD5 /SRD4 /SRD3 /SRD2 /SRD1 /SRDO 
SETF ; NOT FRAMING PATTERN 
/SRD11 /SRD10 /SRD9 /SRD8 /SRD7 /SRD6 /SRDS5 /SRD4 /SRD3 /SRD2 /SRD1 /SRDO 
SETF ; NOT FRAMING PATTERN 
/SRD11 SRD10 SRD9 /SRD8 SRD7 /SRD6 /SRD5 /SRD4 /SRD3 /SRD2 /SRD1 /SRDO 
SETF ; NOT FRAMING PATTERN 
SRD11 SRD10 SRD9 SRD8 /SRD7 SRD6 SRD5 SRD4 SRD3 SRD2 SRD1 SRDO 
SETF ; NOT FRAMING PATTERN 


SRD11 SRD10 SRD9 SRD&8 SRD7 SRD6 SRD5 SRD4 SRD3 SRD2 SRD1 SRDO 


TRACE_OFF 
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PALASM SIMULATION, V2.22 - MARKET RELEASE (11-19-86) 


(C) 


PALASM SIMULATION SELECTIVE TRACE LISTING 


Title 
Pattern 
Revision 


FPD 
Page : 


SOF 
SRD11 
SRD10 
SRD9 
SRD8 
SRD7 
SRD6 
SRD5 
SRD4 
SRD3 
SRD2 
SRD1 
SRDO 
FP_DETB 
FRM1B 


1 


; FRAME DETECTION _PAL 
: Tl FRAME DETECTION PAL FCompany 


P1.04 


gggg9g9g9999 
HHHHHHHHHH 


XHLLHHHLHH 
XLHLLHHHLH 
XLLHLLHHHL 
XLLLHLLHHH 
XHLLLHLLHH 
XHHLLLHLLH 
XLHHLLLHLL 
XHLHHLLLHL 
XHHLHHLLLH 
XHHHLHHLLL 
XLHHHLHHLL 
XLLHHHLHHL 
XLLLLLLLLL 
XLHHHHHHHH 


FRM1_6B XLLLLLLHHH 
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ggggg9999 
HHHHHHHHH 


LLLLLLLHH 
HLLLLLHHH 
HHLLHLHHH 
LHHLLLLHH 
HLHLLLHLH 
HHLLLLLHH 
HHHLLLLHH 
LHHLLLLHH 
LLHLLLLHH 
HLLLLLLHH 
LHLLLLLHH 
LLHLLLLHH 
LLLHHHHHH 
HHHHHHHHH 
HHHHHHHHH 


Author 


Date 


- COPYRIGHT MONOLITHIC MEMORIES INC, 1986 


STEVE PATTERSON AND THER 


6/4/87 


ON mononthto Mt] memories & 


D4 Frame Synchronization 


EE EE SL AE a a a a a OS TE 


TITLE SYNC_PAL 
PATTERN Tl FRAME SYNC PAL FOR Tl INTERFACE 
REVISION P1.06 
AUTHOR STEVE PATTERSON AND THERESA SHAFER 


COMPANY 


DATE 6/4/87 


-e ™e we 


provisional Sync, 
process. 


CHIP SYNC PAL20R4 


; PINS 

71 2 3 4 5 6 7 8 
T1_CKB RSTB NO_T1CB DATA FPD1B FPD2B BC2 BC1l 
79 10 11 12 

BCO SOF NC GND 

713 14 15 16 17 18 19 20 
OEB NC NO_FRM SUNK LDB Q0 Ql Q2 
721 22 23 24 

BYT_CK BIT NC vec 

; INPUTS: T1_CKB ACTIVE LOW EXTERNAL Tl CLOCK 

; RSTB ACTIVE LOW MASTER RESET 

; NO_T1CB NO Tl SIGNAL ACTIVE LOW INPUT 

; FPD(1-2)B ACTIVE LOW FRAME PATTERN DETECT INPUTS 
7 BC(2-0) 193 COUNTER 3 LSBs INPUTS USED FOR BYT_CK 
; SOF LAST KNOWN START OF FRAME 

: DATA INPUT DATA STREAM 

; OEB ACTIVE LOW OUTPUT ENABLE INPUT 
;OUTPUTS: BIT NEXT T1 BIT STREAM 

; LDB ACTIVE LOW LOAD COUNTER SIGNAL 

; SUNK IN SYNC STATE AND NO FRAME BIT ERROR 
; NO_FRM FRAME LOST, IN SEARCH STATE 

; Q(2~-0) STATE VARIABLES 

; BYT CK 8-BIT CHANNEL SAMPLE CLOCK 
EQUATIONS 


;STATE MACHINE OUTPUTS AND STATE VARIABLES 


/BIT 


/LDB 


/ SUNK 


= DATA * /Q2 * /Q1 * QO * FPD1B * SOF 
+ DATA * /Q2 * /Q1 * QO * FPD2B * SOF 
+ /DATA * /SOF 

* /FPD1B * /FPD2B 


+ /DATA 
+ /DATA 
+ /DATA 
+ /DATA 


oss Q2 * 


= /Q2 * 


* 
* 
* 


Q2 
Ql 
/Q0 


Ql * QO * /FPD1B * /FPD2B + SOF 


/Q1l * /QO + Ql * QO ; IN FRAME SYNC 


This PAL device decides whether the Tl Interface is in Frame Sync, 
or Out of Sync. It controls the Frame Sync 
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/NO_FRM = /Q2 + /Q1 + /QO ; SEARCHING FOR FRAME SYNC 
; USING THE FOLLOWING STATE ASSIGNMENTS 
; A = 001 IN SYNC STATE 
; B = 010 SINGLE FRAME BIT ERROR STATE 
; Cc = 110 
; D = 100 
; E = 101 
; F = 111 SEARCHING STATE 
; G = 011 SEARCHING STATE 
i H = 000 UNUSED STATE - GOES TO F 
/Q2 := /FPD2B * /FPD1B * Ql * QO * RSTB * NO _T1CB 


+ /Q2 * Ql * /SOF * RSTB * NO T1CB 
+ /Q2 * /Ql * QO * RSTB * NO T1CB 
+ /FPD2B * /FPD1B * QO * SOF * RSTB * NO T1CB 


/Q1 := /Ql * QO * /SOF * RSTB * NO T1CB 
+ Q2 * /Ql * /SOF * RSTB * NO T1CB 
+ /FPD2B * /FPD1B * Q2 * /QO * SOF * RSTB * NO _T1CB 
+ /FPD2B * /FPD1B * Q2 * /Q1 * RSTB * NO T1CB 
+ /FPD2B * /FPD1B * /Q2 * QO * SOF * RSTB * NO T1CB 


/Q0 := Ql * /QO * /SOF * RSTB * NO T1CB 
+ Q2 * /QO * /SOF * RSTB * NO T1CB 
+ FPD1B * /Q2 * /Ql * QO * SOF * RSTB * NO _T1CB 
+ FPD2B * /Q2 * /Q1 * QO * SOF * RSTB * NO_T1CB 
+ /FPD2B * /FPD1B * Ql * /QO * RSTB * NO_T1CB 


; BYTE CLOCK 


/BYT_CK = BC2 + BC1 + BCO 


e 
s eoeeeoeeewmeeevseeeveeeeeees eee eevee eeeeeeeeeeeeee eevee eee eee ee eevee 


c eeoecoevoveeoeeeeeweeeeeew eee eeeeeeewee eee eee seeeoereeeseeee eee eee ee wee eve 


SIMULATION 


TRACE_ON T1_CKB RSTB NO_T1CB DATA FPD1B FPD2B SOF 
Q2 Q1 QO BIT LDB SUNK NO_FRM 
BYT_CK BC2 BCl1 BCO 


ENABLE OUTPUT 
RESET REGISTERS 


SETF /OEB 
/RSTB /NO_T1CB 
CLOCKF T1_CKB 


=e Te 


SETF RSTB NO_T1CB 
DATA 
/BC2 /BC1 /BCO 
FPD1B FPD2B SOF 
CLOCKF T1_CKB 


RESET REGISTERS 


=e 


STATE MACHINE - GO TO STATE F 


=e 


SETF FPD1B /FPD2B SOF 


CLOCKF T1_CKB LOOPS AT STATE F 


=e 
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SETF /FPD1B FPD2B SOF 
CLOCKF T1_CKB 


SETF FPD1B /FPD2B /SOF 
CLOCKF T1_CKB 


SETF /FPD1B FPD2B /SOF 
CLOCKF T1_CKB 


SETF FPD1B FPD2B /SOF 
CLOCKF T1_CKB 


SETF /FPD1B /FPD2B SOF 
CLOCKF T1_CKB 


SETF FPD1B FPD2B SOF 
/BC2 /BC1 BCO 
CLOCKF T1_CKB 


SETF /FPD1B /FPD2B 
/BC2 BC1 /BCO 
CLOCKF T1_CKB 


SETF /SOF 
/BC2 BC1 BCO 
CLOCKF T1_CKB 
SETF FPD1B /FPD2B SOF 
BC2 /BC1 /BCO 
CLOCKF T1_CKB 


SETF /FPD1B /FPD2B /SOF 
BC2 /BC1 BCO 

CLOCKF T1_CKB 

SETF /FPD1B /FPD2B SOF 
BC2 BC1 /BCO 

CLOCKF T1_CKB 


SETF /SOF 
BC2 BC1 BCO 
CLOCKF T1_CKB 
SETF /FPD1B /FPD2B SOF 
/BC2 /BC1 /BCO 
CLOCKF T1_CKB 


SETF /FPD1B FPD2B SOF 

CLOCKF T1_CKB 

SETF FPD1B FPD2B /SOF 
/BC2 /BC1 BCO 

CLOCKF T1_CKB 

SETF /FPD1B /FPD2B SOF 

CLOCKF T1_CKB 


SETF /SOF 
CLOCKF T1_CKB 


~e 


’ 


e 
a 


e 
’ 


e 
a 


s 


’ 


=e 


=e 


=e 


LOOPS AT STATE F 


LOOPS AT STATE F 


LOOPS AT STATE F 


LOOPS AT STATE F 


; GO TO STATE G 


GO TO STATE F 


GO TO STATE G 


LOOP AT STATE G 


GO TO STATE F 


GO TO STATE G 


GO TO STATE A 


LOOP AT STATE A 


LOOP AT STATE A 


GO TO STATE B 


LOOP AT STATE B 


GO TO STATE C 


LOOP AT STATE C 


at Monolithic rata Memories cl 








2-413 











2-414 


D4 Frame Synchronization 


SETF /FPD1B /FPD2B SOF 
CLOCKF T1_CKB 


SETF /SOF 

CLOCKF T1_CKB 

SETF /FPD1B /FPD2B SOF 
CLOCKF T1_CKB 


SETF /SOF 
CLOCKF T1_CKB 
SETF /FPD1B /FPD2B SOF 
CLOCKF T1_CKB 


SETF FPD1B /FPD2B SOF 
CLOCKF T1_CKB 
SETF FPD1B /FPD2B SOF 
CLOCKF T1_CKB 


SETF /FPD1B /FPD2B /SOF 

CLOCKF T1_CKB 

SETF /FPD1B /FPD2B SOF 

CLOCKF T1_CKB 

SETF /FPD1B FPD2B SOF 

CLOCKF T1_CKB 

SETF /FPD1B /FPD2B SOF 

FOR I:=1 TO 1 DO BEGIN 
CLOCKF T1_CKB 

END 

SETF FPD1B SOF 

CLOCKF T1_CKB 


SETF /FPD1B /FPD2B /SOF 

CLOCKF T1_CKB 

SETF /FPD1B /FPD2B SOF 

CLOCKF T1_CKB 

SETF /FPD1B FPD2B SOF 

CLOCKF T1_CKB 

SETF /FPD1B /FPD2B SOF 

FOR I:=1 TO 2 DO BEGIN 
CLOCKF T1_CKB 

END 

SETF FPD1B /SOF 

CLOCKF T1_CKB 

SETF FPD1B SOF 

CLOCKF T1_CKB 


SETF /FPD1B /FPD2B /SOF 

CLOCKF T1_CKB 

SETF /FPD1B /FPD2B SOF 

CLOCKF T1_CKB 

SETF /FPD1B FPD2B SOF 

CLOCKF T1_CKB 

SETF /FPD1B /FPD2B SOF 

FOR I:=1 TO 3 DO BEGIN 
CLOCKF T1_CKB 


7 


=e 


me 


we 


bal 


=e 


=e 


=e 


~e 


=e 


=e 


=e 


=e 


me 


=e 


=e 


=e 


=e 


=e 


=e 


me 


GO TO STATE D. 


LOOP AT STATE D 


GO TO STATE E 


LOOP AT STATE E 


GO TO STATE A 


GO TO STATE B 


GO TO STATE F 


GO TO STATE G 
GO TO STATE A 


GO TO STATE B 


GO TO STATE C 


GO TO STATE F 


GO TO STATE G 
GO TO STATE A 


GO TO STATE B 


GO TO STATES C,D 


LOOP AT STATE D 


GO TO STATE F 


GO TO STATE G 
GO TO STATE A 


GO TO STATE B 


GO TO STATES C,D,E 
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END 
SETF FPD1B SOF 


CLOCKF T1_CKB GO TO STATE F 


=e 


SETF /DATA INVERT DATA 

SETF /FPD1B /FPD2B /SOF 

CLOCKF T1_CKB 

SETF /FPD1B /FPD2B SOF 

CLOCKF T1_CKB 

SETF /FPD1B FPD2B SOF 

CLOCKF T1_CKB 

SETF /FPD1B /FPD2B SOF 

FOR I:=1 TO 4 DO BEGIN 
CLOCKF T1_CKB 


=e 


GO TO STATE G 


=e 


GO TO STATE A 


=e 


GO TO STATE B 


=e 


GO TO STATES C,D,E,A 


=e 


END 
SETF FPD1B SOF 
CLOCKF T1_CKB 
TRACE_OFF 


GO TO STATE B 


=e 
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noe) 


a see 


Wor 
-9 CQ ¢ 


C 


NS 


Ti FRAME SYNC 
cq 


SYNC_FAL 


Fi. 
Q 


SIMULATION 


FBLASM STIMULATION, 


iC) 


Title 
Fattern 
Revision 


FALASM 
SYNC 


ign 


og ap hs tie 


, 
J 
—T" 


x | 





iy 





Ck 


K 


Ti_CKE 
NG Tick 
NO FRM 


FFDIE 


DATA 
SUME 
RYT 
HC1 
EEO 


RSTH 
FFD 
SOF 
SIT 
LDE 


QO 


(i 
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SY NIT 
Face 3: 
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‘ 
i 


' 
to 
hoa 
t 


aa 


‘ 
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i 
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RANA RSSAURSTAAEEASSOMSSEAEGGEANURESS 


2A lee re 
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TUTTO ee 


TTT] 


Q 


Fage : 


SYM(C 


ar 


CER 


Th, 


me Ei cae 


es Es 


Se cetera ate deat 


Cn 


NG TICE 


ROTE 
DATA 
FFDIE 
NO_UFRM 


ETT 
SUMNE 


LD 


C1 
GO 


~~ 


BCL 


— 


RTO 


td 
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SYNC 


Faqe : 


cq 


TTT Ey RC Cr eae ee 


ok 


ag 


Por 


sO Er a Se 


ae a} 


+ 
— 


T1_CEE 


tol 1a J tot too 
a a yp at ng a et 
1 to 
, sot 


43 


ta 
+ 
tos 


tt 
’ 


RiSsTh 


I 
tr 
pa i 
a 


FFDLE 





g IR 


wm 
a 


FFD 


SOF 


od 


QO 


BIT 


LDB 





SUNK 


NO_FRM 


-- - 


Zk 


RYT _ 


3 
mm 





BCO 
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SYNC 
Faqge 


cg 


cg 


cgqg 


TTT TTT 
Yee) 


cg 


cg 


ap 


—T 


— 


Pe 


i 


_CKE 


Tet 


1a 1 i 
a ee een np 
t 
i 


t 
aa 
te 
ul 


1 t 
sete Seem peeetemecteiam duteateninencess oem ante autiestatecerts paar ate amemeanieren oot coats 
' 
t 


RSOTE 


1 


3 

ap cena prenatal On ye ape ep en op mast eer nt ate Mar 
ret 
poe 


4 
i] 


NO_T1CE 
DATA 
FFDIE 


FPD2E 





oe 


NO_FRM 


SOF 
RIT 
DE 
SUNE 
RC 
BCi 
RCO 


Qt 
QO 
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SYNC 
Fage :° 6 


Ti CKE 
RSTH 


NO Tice 


t 
- 
! 
' 
J 
— 
' 
' 
5 
' 
' 


lemeatec r oe 


FFDIE 


FFD2H 


SOF 





EIT 
LDE 
SUNK: 
NG_FRM 
BYT_CE 
EC? 
EC 1 
BCO 


H 
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PALASM SIMULATION, V2.22 - MARKET RELEASE (11-19-86) 


(C) 


PALASM SIMULATION SELECTIVE TRACE LISTING 





Title : SYNC_PAL Author : STEVE PATTERSON AND THER 
Pattern : Tl FRAME SYNC PAL FOR Tl1Company =: 
Revision : P1.06 Date : 6/4/87 
SYNC 

Page : 1 

g cg cg cg cg cg cg cg cg cg cg c 

T1_CKB XHHLLHHLLH LLHLLHHLLH LLHLLHHLLH HLLHHLLHHL 
RSTB LLLLHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
NO_T1CB LLLLHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
DATA XXXXHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
FPD1B XXXXHHHHHH HLLLHHHHLL LHHHLLLLHH HHLLLLLLLL 
FPD2B XXXXHHHHLL LHHHLLLLHH HHHHLLLLHH HHLLLLLLLL 
SOF XXXXHHHHHH HHHHLLLLLL LLLLHHHHHH HHHHHHLLLL 
Q2 XXHHHHHHHH HHHHHHHHHH HHHHHHLLLL HHHHLLLLLL 
Ql XXHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
QO XXHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
BIT XXXXHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
LDB XXXXXXLLLL LLLLLLHHHH HHHHHHLLLL LLLLLLLLHH 
SUNK XXLLLLLLLL LLLLLLLLLL LLLLLLLLLL LLLLLLLLLL 
NO_FRM XXHHHHHHHH HHHHHHHHHH HHHHHHLLLL HHHHLLLLLL 
BYT_CK XXXXHHHHHH HHHHHHHHHH HHHHHHHHLL LLLLLLLLLL 
BC2 XXXXLLLLLL LLLLLLLLLL LLLLLLLLLL LLLLLLLLLL 
BCl1 XXXXLLLLLL LLLLLLLLLL LLLLLLLLLL LLHHHHHHHH 
BCO XXXXLLLLLL LLLLLLLLLL LLLLLLLLHH HHLLLLHHHH 
SYNC 

Page 

g cg cg cg cg cg cg cg cg cg c 

T1_CKB LHHLLHHLLH HLLHHLLHHL LHHLLHHLLH HLLHHLLHHL 
RSTB HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
NO_T1CB HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
DATA HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
FPD1B HHHHLLLLLL LLLLLLLLLL LLLLHHHHLL LLLLLLLLLL 
FPD2B LLLLLLLLLL LLLLLLLLLL HHHHHHHHLL LLLLLLLLLL 
SOF HHHHLLLLHH HHLLLLHHHH HHHHLLLLHH HHLLLLHHHH 
Q2 LLHHHHLLLL LLLLLLLLLL LLLLLLLLLL HHHHHHHHHH 
Ql HHHHHHHHHH LLLLLLLLLL LLHHHHHHHH HHHHHHHHLL 
QO HHHHHHHHHH HHHHHHHHHH HHLLLLLLLL LLLLLLLLLL 
BIT HHHHHHHHHH HHHHHHHHHH LLHHHHHHHH HHHHHHHHHH 
LDB HHLLLLLLLL LLLLHHHHLL LLLLLLHHHH LLLLHHHHLL 
SUNK LLLLLLLLLL HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
NO_FRM LULHHHHLLLL LLLLLLLLLL LLLLLLLLLL LLLLLLLLLL 
BYT_CK LLLLLLLLLL LLLLLLHHHH HHHHLLLLLL LLLLLLLLLL 
BC2 HHHHHHHHHH HHHHHHLLLL LLLLLLLLLL LLLLLLLLLL 
BC1l LLLLLLLLHH HHHHHHLLLL LLLLLLLLLL LLLLLLLLLL 
BCO LLLLHHHHLL LLHHHHLLLL LLLLHHKHHHH HHHHHHHHHH 
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SUNK 
NO_FRM 
BYT_CK 
BC2 
BC1 
BCO 


SYNC 
Page : 


T1_CKB 
RSTB 
NO_T1CB 
DATA 
FPD1B 
FPD2B 
SOF 

Q2 

Ql 


4 
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g cg cg 
LHHLLHHLLH 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLHHHHLL 
HHHHHHHHHH 
LLLLLLLLLL 
LLLLLLHHHH 
HHHHHHHHHH 
LLHHHHLLLL 
HHHHHHHHHH 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
HHHHHHHHHH 


cg cg 
HHLLHHLLHH 


cg cg c 
HLLHHLLHHL 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLHHHH 
LLLLLLLLLL 
LLHHHHHHHH 
HHHHLLLLLL 
LLLLLLLLHH 
HHHHHHHHLL 
HHHHHHLLHH 
HHHHLLLLLL 
HHHHHHHHHH 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
HHHHHHHHHH 


cg cg c 
LLHHLLHHLH 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
LHHHHLLLLL 
HHHHHHHHHH 
LLLLLLLHHH 
LLLHHHHHHH 
HHHLLLLLLL 
HLLHHHHHHH 
LLLLLLLLLL 
HHHHHHHHHH 
LLLLLLLLLL 
LLLULLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
HHHHHHHHHH 


cg cg 
HHLLHHLLHH 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
HHHLLLLLLL 
LLLLLLLLLL 
HHHLLLLHHH 
LHHHHLLLLL 
HHHHHHHHHL 
LHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
HLLLLLLLLH 
LHHHHLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 


LLLLLLLLLL 
HHLLLLHHHH 
HHHHHHHHHH 
LLLLLLLLHH 
LLLLLLLLHH 
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cg cg cg 
LLHHLLHHLL 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLH 
LHHHHLLLLL 


HHHLLLLLLL 
HLLHHHHHHH 
LLLLLLLLLL 


HHHHHHHHHH 


LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 


HHHHHHHHLL 
LLLLLLLLLL 
LLLLLLHHHH 
HHLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
HHHHHHHHHH 
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SYNC 
Page : 5 
cg cc 
T1_CKB HLLHHLHHLH 
RSTB HHHHHHHHHH 
NO_T1CB HHHHHHHHHH 
DATA HHHHHHHHHH 
FPD1B LLLLLLLLLL 
FPD2B HHLLLLLLLL 
SOF HHHHHHHHHH 
Q2 LLLLHHHHHH 
Ql HHHHHHHLLL 
Qo LLLLLLLLLL 
BIT HHHHHHHHHH 
LDB LLLLLLLLLL 
SUNK HHHHHHHHHH 
NO_FRM LLLLLLLLLL 
BYT CK LLLLLLLLLL 
BC2 LLLLLLLLLL 
Bcl LLLLLLLLLL 
BCO HHHHHHHHHH 
SYNC 
Page : 6 
cg c 

T1_CKB HLLHHL 
RSTB HHHHHH 
NO_T1CB HHHHHH 
DATA LLLLLL 
FPD1B LLHHHH 
FPD2B LLLLLL 

SOF HHHHHH 

Q2 LLLLLL 

Ql LLLLHH 

Qo HHHHLL 

BIT LLHHLL 

LDB LLLLLL 
SUNK HHHHHH 
NO_FRM LLLLLL 

BYT CK LLLLLL 

BC2 LLLLLL 

Bcl LLLLLL 

BCO HHHHHH 


cg cgg 
HLLHHLLLHH 
HHHHHHHHHH 
HHHHHHHHHH 
HHHHHHLLLL 
LLHHHHHLLL 
LLLLLLLLLL 
HHHHHHHLLL 


HHHHHHLLLL 
LLLLLLLLLL 
HHHHLLLLLL 
LLLLHHHHHL 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
HHHHHHHHHH 


cg cg cg 
LLHHLLHHLL 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLHHHHL 
LHHHHHHHHH 
LLLLLLLLLL 
HHHLLLLHHH 
HHHHHHHLLL 
LLLLLHHLLL 
LLLLLLLLLL 
LLLHHHHHHH 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
HHHHHHHHHH 
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c 6c US 
HHLHHLHHLH 
HHHHHHHHHH 
HHHHHHHHHH 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
HHHHHHHHHH 
LHHHHHHHHH 
HHHHLLLLLL 
LLLLLLLHHH 
LLLLLLLLLL 
LLLLLLLLLL 
HHHHHHHHHH 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
HHHHHHHHHH 
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TITLE SUPER_FRAME_ PAL 

PATTERN SUPER FRAME PAL FOR Tl INTERFACE 
REVISION P1.03 

AUTHOR STEVE PATTERSON AND THERESA SHAFER 
COMPANY 
DATE 6/4/87 


This PAL device counts the Tl Frames and controls the Signal Bits 
extraction process, including Fly Wheeling. It also provides variou 
other signals which indicate the frames with signal bits. 

The counter is reset with either RSTB or when frame detection is SUNK 
and frame 1 occurs from two different sources (FRM1 & SOF). 


me “oe te TO MO 


CHIP SUPER_FRAME PAL16R6 


; PINS 
31 2 3 4 5 6 7 8 9 10 
T1_CKB RSTB FRM1B SUNK SOF NC NC NC NC GND 
311 12 13 14 15 16 17 18 19 20 
OEB NC Q3 Q2 Q1 Q0 FRM_6 FRM_12 NC vec 2 | 
; INPUTS :T1_CKB ACTIVE LOW EXTERNAL T1 CLOCK 
; RSTB ACTIVE LOW MASTER RESET 
; SOF ACTIVE HIGH INPUT SIGNAL INDICATING 
; LAST KNOWN START OF FRAME 
; FRM1B ACTIVE LOW INPUT SIGNAL INDICATING 
: START OF FRAME 1 
: SUNK ACTIVE HIGH INPUT SIGNAL INDICATING "IN FRAME SYNC" 
; OEB ACTIVE LOW OUTPUT ENABLE INPUT 
; OUTPUTS: Q(3-0) STATE VARIABLES 
; FRM_6 CLOCK SIGNAL WHICH INDICATES SIGNAL BIT A 
; FRM_12 CLOCK SIGNAL WHICH INDICATES SIGNAL BIT B 
EQUATIONS 
/Q3 t= /Q2 * Ql * QO 
+ /Q3 * /Q2 
+ /Q3 * /QL 
+ /Q3 * /Q0 
+ /FRM1B * SOF * SUNK 7; RESET WHEN SUNK AND FRAME 1 OCCURS 
+ /RSTB 3; MASTER RESET 
/Q2 := Q2 * Ql * QO 
+ /Q2 * Q3 
+ /Q2 * /Q1 
+ /Q2 * /Q0 
+ /FRM1B * SOF * SUNK 3; RESET WHEN SUNK AND FRAME 1 OCCURS 
+ /RSTB ; MASTER RESET 
+ /RSTB ; MASTER RESET 
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/Q1l := Ql * QO 
+ /Ql * /Q0 
+ /FRM1B * SOF * SUNK 


/Q0 := QO 
+ /FRM1B * SOF * SUNK 


+ /RSTB 


; SIGNAL BIT EXTRACTION OUTPUTS 


/FRM_6 := Q3 + /Q2 + Ql + QO 
/FRM_12 := /Q3 + Q2 + /Q1 + QO 
SIMULATION 


TRACE_ON T1_CKB RSTB FRM1B SOF SUNK 
Q3 Q2 Ql QO FRM_6 FRM_12 


SETF /OEB 
/RSTB 
CLOCKF T1_CKB 
SETF RSTB /SOF FRM1B SUNK 
CLOCKF T1_CKB 


FOR I:=1 TO 24 DO 
BEGIN 
CLOCKF T1_CKB 
END 


SETF /SUNK SOF /FRM1B 
CLOCKF T1_CKB 

SETF /SUNK /SOF /FRM1B 
CLOCKF T1_CKB 

SETF /SUNK SOF FRM1B 
CLOCKF T1_CKB 

SETF /SUNK /SOF FRM1B 
CLOCKF T1_CKB 

SETF SUNK SOF /FRM1B 
CLOCKF T1_CKB 

SETF SUNK /SOF /FRM1B 
CLOCKF T1_CKB 

SETF SUNK /SOF FRM1B 
CLOCKF T1_CKB 

SETF SUNK SOF FRM1B 
CLOCKF T1_CKB 


TRACE_OFF 


=e te 


=e 


RESET WHEN SUNK AND FRAME 1 OCCURS 


RESET WHEN SUNK AND FRAME 1 OCCURS 
MASTER RESET 


ENABLE OUTPUT 
RESET REGISTERS 


COUNT 


RESET CONDITIONS 
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FPALASM SIMULATION, WS.25 - MARERET RELEASE (li-itv-h4é) 
(0) -— COPYRIGHT MONGLITHTIC MEMORIES INC, 19846 
FALASM SIMULATION SELECTIVE TRACE LISTING 


Title 
Pattern 
Revisian 


SURFER FRAME FAL Author : STEVE FATTERSON AND THER 
SUFFER FRAME FAL FOR Ti ICampany  : 
Pi. Os Date 2 6/4/87 


SUPER FRAME 
Fas Qe 5 1 


GQ COs so 16-8 A GE SS 
I CE ST Ea OE as et 
rie fe LEU UW UU 


RSTE 





XX 


-- -€ -- 


FRMIE ; 
SOF “XXXK 
x 


SUNK XXX to 
ryt mo 

Os fee ee eee es eed Saree eee 
Mi 8 

a2 er a re 


(24 
Qo 


FRM_6 


x 
: 
: 
: 
. 


FRM_12 
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Page 3 
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Ti CKA 
RSTE 
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(C) - COPYRIGHT MONOLITHIC MEMORIES INC, 1986 
PALASM SIMULATION SELECTIVE TRACE LISTING 


Author : STEVE PATTERSON AND THER 


Title : SUPER_FRAME_PAL 

Pattern : SUPER FRAME PAL FOR T1 ICompany : 
Revision : P1.03 Date : 6/4/87 
SUPER_FRAME 

Page 1 

g cg c cc eS s&s e ©. e 6. ¢ 

T1_CKB XHHLLHHLHH LHHLHHLHHL HHLHHLHHLH HLHHLHHLHH 
RSTB LLLLHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
FRM1B XXXXHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
SOF XXXXLLLLLL LLLLLLLLLL LLLLLLLLLL LLLLLLLLLL 
SUNK XXXXHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
Q3 XXLLLLLLLL LLLLLLLLLL LLLLLLLHHH HHHHHHHHHL 
Q2 XXLLLLLLLL LLLLLHHHHH HHHHHHHLLL LLLLLLLLLL 
Ql XXLLLLLLLH HHHHHLLLLL LHHHHHHLLL LLLHHHHHHL 
Q0 XXLLLLHHHL LLHHHLLLHH HLLLHHHLLL HHHLLLHHHL 
FRM_6 XXXXXXLLLL LLLLLLLLHH HLLLLLLLLL LLLLLLLLLL 
FRM_12 XXXXXXLLLL LLLLLLLLLL LLLLLLLLLL LLLLLLHHHL 
SUPER_FRAME 

Page : 2 

cS, ¢..¢ ec ¢ -:e cc ec C.c8) 61S 

T1_CKB LHHLHHLHHL HHLHHLHHLH HLHHLHHLHH LHHLHHLHHL 
RSTB HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
FRM1B HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
SOF LLLLLLLLLL LLLLLLLLLL LLLLLLLLLL LLLLLLLLLL 
SUNK HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
Q3 LLLLLLLLLL LLLLLLLLLL LLLHHHHHHH HHHHHLLLLL 
Q2 LLLLLLLLLL LHHHHHHHHH HHHLLLLLLL LLLLLLLLLL 
Ql LLLLLHHHHH HLLLLLLHHH HHHLLLLLLH HHHHHLLLLL 
QO LLHHHLLLHH HLLLHHHLLL HHHLLLHHHL LLHHHLLLHH 
FRM_6 LLLLLLLLLL LLLLHHHLLL LLLLLLLLLL LLLLLLLLLL 
FRM_12 LLLLLLLLLL LLLLLLLLLL LLLLLLLLLL LLHHHLLLLL 
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SUPER_FRAME 
Page : 3 
Gg cg cg 
T1_CKB LHHLLHHLLH 
RSTB HHHHHHHHHH 
FRM1B LLLLLLLLHH 
SOF HHHHLLLLHH 
SUNK LLLLLLLLLL 
Q3 LLLLLLLLLL 
Q2 LLLLLLLLLL 
Ql LLHHHHHHHH 
Qo HHLLLLHHHH 
FRM_6 LLLLULLLLLL 


FRM_12 LLLLLLLLLL 


cg cg c 
HLLHHLLHHL 
HHHHHHHHHH 
HHHHHHLLLL 
HHLLLLHHHH 
LLLLLLHHHH 
LLLLLLLLLL 
HHHHHHHHLL 
LLLLLLLLLL 
LLLLHHHHLL 
LLLLHHHHLL 
LLLLLLLLLL 


g cg cg 

LHHLLHHLLH 
HHHHHHHHHH 
LLLLHHHHHH 
LLLLLLLLHH 
HHHHHHHHHH 
LLLLLLLLLL 
LLLLLLLLLL 
LLLLLLHHHH 
LLHHHHLLLL 
LLLLLLLLLL 
LLLLLLLLLL 
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Principles of CRC-6 


In digital transmission, bits are grouped into frames. North 
America's standard, T-carrier uses a framing bit to group data 
into frames. In T1 extended-superframe (ESF or Fe), the frame- 
bit position carries additional information besides framing. One 
such extended functionality provides transmission line error 
detection. In transmission channels, most errors are bursty in 
nature; the cyclic-redundancy-checking (CRC) codes provide 
sufficient detection for these types of errors. With CRC, the 
block length and the polynomial! determine the type and percent- 
age of error which can be detected. For ESF, the block length is 
one extended-superframe or 4,632 bits. The T1 Fe standard 
specifies the CRC-6 polynomial. The CRC-6 = X5+X +1 
polynomial detects 98.4% of one or more errors. 


Functional Description 


Figure 1 shows a simple configuration of XOR-gates and 
registers which calculates the CRC-6 polynomial. 


Figure 1. CRC-6 Block Diagram 


The low-power PALC16R6Z implements the CRC-6 polynomial 
for a serial data stream (see Figure 1). At the beginning of CRC 
generation, the remainder is preset to all zeros by enabling the 
synchronous CLR. After shifting in the 4,632-bit superframe, the 
result is the CRC-6 remainder. This remainder is transmitted in 
the check-bit positions of the subsequent superframe. The 
remainder can be shifted out serially, most significant bit first or 


PAL® is a registered trademark of Monolithic Memories. 
ZPAL™ is a trademark of Monolithic Memories. 


output as a parallel word. The CHECK_BIT signal enables the 
remainder to be shifted out serially; it must be held HIGH until 
the entire 6-bit remainder has been shifted out. When CHECK_ 
BIT =0, CRC6 echoes the serial data input stream. CB1-CB6 are 
the parallel remainder bits where CB1 is the most significant bit 
and CB6 is the least significant bit. 


There are two implementations for error detection at the 
receiver. First, the transmitted CRC-6 value is fed serially along 
with the data. If anon-zero remainder is calculated, an errorhas 
occurred. !In this case, the ERROR flag indicates at least one 
transmission error. Second, the CRC-6 for the extended- 
superframe can be recalculated and externally compared with 
the transmitted CRC-6 value. 


CK vec 
DATA CRC6 
CLR CB6 
CHECK_BIT CB5 
EOX CB4 
NC CB3 
NC CB2 
NC cB 
NC ERROR 
GND OE 





Figure 2. CRC-6 Configuration 
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PAL® Device Design Specification 


TITLE CRC6 

PATTERN CRC 6 ERROR DETECTION PAL 
' REVISION 1.01 

AUTHOR THERESA SHAFER 

COMPANY MMI 

DATE 9/10/86 


CHIP CRC6 PAL16R6 


CK DATA /CLR CHECK BIT EOX NC NC NC NC GND 
/OE ERROR CBl CB2 CB3 CB4 CBS CB6 CRC6 VCC 


THE CRC-6 PAL PERFORMS ERROR DETECTION ON A SERIAL DATA 

STREAM. CRC-6 PAL SUPPORTS THE Tl Fe STANDARD FOR ERROR 
DETECTION. THE CRC RESULT CAN BE OUTPUT EITHER IN SERIAL 
OR IN PARALLEL. 


~e 


se te 


CRC-6 = .X**6 + X +1 


~e te te we 


INPUTS: CK EXTERNAL CLOCK 
/OE ACTIVE LOW OUTPUT ENABLE SIGNAL 
/CLR ACTIVE LOW CLEAR SIGNAL WHICH RESETS THE 
CRC 
DATA SERIAL DATA STREAM INPUT 


CHECK BIT ACTIVE HIGH SELECT INPUT WHICH SWITCHES 
FROM SERIAL DATA STREAM TO CHECK BITS 
EOX ACTIVE HIGH SIGNAL WHICH INDICATES END OF 
EXTENDED SUPERFRAME AND ENABLES THE 
ERROR DETECTION FLAG 


=e “8 ~e ~e 0 “8 O8 “ee Se “Oe 


OUTPUTS: CRC6 CRC SERIAL OUTPUT 
ERROR ACTIVE HIGH OUTPUT FLAG WHICH INDICATES 
A TRANSMISSION ERROR 
CBl - CB6 REMAINDER BITS (CHECK BITS) 
CBl ~- MOST SIGNIFICANT BIT 
CB6 - LEAST SIGNIFICANT BIT 


=e “oe “ce “0 Se ~e 


EQUATIONS 
; CRC-6 PAL 


/CRC6 = /CBl * CHECK BIT 
+ /DATA * /CHECK BIT 


/ERROR = EOX * /CBl * /CB2 * /CB3 * /CB4 * /CBS5 * /CB6 
+ /EOX 


/CB1 := /CB2 + CLR 


/CB2 := /CB3 + CLR 
/CB3 := /CB4 + CLR 
/CB4 := /CB5 + CLR 
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/CB5 := /DATA * /CBl * /CB6 
+ DATA * CBl * /CB6 
+ DATA * /CB1l * CB6 * /CHECK BIT 
+ /DATA * CBl * CB6 * /CHECK BIT 
+ /CB6 * CHECK BIT 
+ CLR 


/CB6 := DATA * CBl * /CHECK BIT 
+ /DATA * /CBl * /CHECK BIT 
+ /CBl * CHECK BIT 
+ CLR 


oeovneweoeeeeaeeeee ee eee eee e eee eee ee eee wee wm ese esos eeen eee eee eee ee 


=e te 


SIMULATION 


TRACE ON CK /OE /CLR CHECK _BIT EOX DATA CRC6 ERROR 
CB1l CB2 CB3 CB4 CB5 CB6 





SETF OE 
CLR 
CLOCKF CK 


ENABLE OUTPUT 
CLEAR SHIFT REGISTER 


° 
’ 
e 
4 


; INITIALIZE 
SETF DATA 
/CHECK BIT 
/EOX 
/CLR 
CLOCKF CK 


; FIND PERIOD 

SETF /DATA 

FOR J:= 0 TO 70 DO 
BEGIN 
CLOCKF CK 
END 


; CALULATE CRC-6 FOR 1110010101 
SETF CLR ; CLEAR SHIFT REGISTER 
. DATA 
CLOCKF CK 
SETF /CLR 
DATA 
CLOCKF CK 
SETF DATA 
CLOCKF CK 
SETF DATA 
CLOCKF CK 
SETF /DATA 
CLOCKF CK 
SETF /DATA 
CLOCKF CK 
SETF DATA 
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CLOCKF CK 
SETF /DATA 
CLOCKF CK 
SETF DATA 
CLOCKF CK 
SETF /DATA 
CLOCKF CK 
SETF DATA 
CLOCKF CK 


i CHECK ERROR DETECTION 

SETF EOX ; CHECK ERROR FLAG 
SETF /DATA 7 001001 
CLOCKF CK 

SETF /DATA 

CLOCKF CK 

SETF DATA 

CLOCKF CK 

SETF /DATA 

CLOCKF CK 

SETF /DATA 

CLOCKF CK 

SETF DATA 

CLOCKF CK 


; READ CRC RESULT 

SETF /EOX 

SETF /DATA 

CLOCKF CK 

SETF DATA 

CLOCKF CK 

SETF CHECK _BIT 
DATA 

FOR J:= 0 TO 6 DO 
BEGIN 
CLOCKF CK 
END 


TRACE_OFF 
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Introduction 


The Logic Cell Array (LCA device) implements a multiplexer and 
counter used in time division multiplexing. With the device's 
flexible I/O pins, the multiplexer and counter are implemented 
into a single CMOS device. This application note covers time 
division multiplexing and the design of a multiplexer and coun- 
ter. Additional information on how to design with an LCA device 
can be found in the LCA design methodology chapter in Mono- 
lithic Memories' LCA Design and Applications Handbook. For 
programming the LCA device, refer to "Configuring the LCA De- 
vice", Monolithic Memories’ Application Note 182. 


Principles of Multiplexing 


Multiplexing efficiently utilizes data communication lines by 
combining multiple low-speed signals into a single, high-speed 
line. The two methods of performing multiplexing are Time Divi- 
sion Multiplexing (TDM) and Frequency Division Multiplexing 
(FDM). TDM divides the transmission bandwidth into equal time 
slots where each input signal is assigned one time slot per time 
cycle. Once assigned, that time slot is not used by any other 
input. Figure 1 shows a multiplexer combining three low-speed 
signals into one high-speed line. Terminal A transmits during 
the first time slot, B transmits during the second time slot, and 
C transmits during the third time slot. This sequence is repeat- 
ed every time cycle. FDM, on the other hand, divides the fre- 
quency spectrum among logical channels where each channel 
has full bandwidth of its assigned frequencies. Analog sys- 
tems usually use FDM. 





Figure 1. Time Division Multiplexing 


There are many classes of TDMs including bit interleave, char- 
acter interleave, statistical TDM, and T1 multiplexers. In bit in- 
terleaved TDM, each input is assigned to one time slot. Each 
time slot's length is one bit. Character interleaved TDM is simi- 
lar to bit interleave, except that each time slot represents one 
character. Statistical multiplexing assigns the bandwidth into 
unequal slots where only the active incoming lines are as- 


signed slots. The slots are of variable length, so each input is 
assigned the amount of bandwidth needed. The T1 standard 
specifies twenty-four 64 kbps channels multiplexed on a 1.544 
Mbps high-speed link. 


Data Selection/Time Slot 
Assignment 


To transmit data from terminal A in the first time slot as shown 
in Figure 1, data buffering, rate adaption and data selection 
must be performed. Data buffering and rate adaption are re- 
quired since the rates of the lines to be multiplexed are slower 
than the high-speed link. A serializing FIFO or shift register can 
implement the required buffering. Rate adaption is performed 
by a clock/shift scheme tailored to the exact application. Once 
buffered, the data must be selected in the proper order. Sever- 
al methods can be used for data selection; one method is a 32- 
to-1 multiplexer. To implement sequential selection of input 
lines, an on-board counter selects the multiplexer's output. 
However, for random selection needed in statistical multiplex- 
ing, more flexibility is needed. A 2-to-1 multiplexer provides 
this flexibility by selecting between the counter and random 
selection inputs. 


Comparison of Design Methodologies 


The 32-to-1 multiplexer and counter design can be implement- 
ed in several ways. Briefly, we will evaluate discrete logic, PAL 
and LCA device implementations. 


For this design, thirty-eight inputs, five registers, and one out- 
put are required. Conventional PLD devices do not meet this 
large I/O requirement. A single PLD device is not suited for this 
specific application because flexible I/O and buried registers 
are not supported. Instead, the design must be divided into 
four sections. The counter fits in a PALC16R6Z device and the 
32-to-1 multiplexer requires three PALC20L8Z devices. 


Since a large 32-to-1 multiplexer is not available as a discrete 
part, it must be built using smaller multiplexers. The total chip 
count, using discrete logic, is five devices: two 74AS850s, two 
74ALS257s and one 74AS867. A combination of PAL devices 
and discrete logic devices is possible, however, it is also a 
multiple chip solution. 


Monolithic Memories’ LCA device is a high-density programma- 
ble CMOS circuit with flexible I/Os. It has forty pins which are 
user-defined as either inputs, outputs, or bidirectional. The 
LCA device meets the I/O requirement for this design. Since 
the LCA circuit allows multiple logic levels, implementing the 
counter does not use up output pins. Moreover an LCA device 
enables the 32-to-1 multiplexer with counter to be implemented 
in a single low-power device. 
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Detailed LCA Design 


The M2064 LCA device in a 48-pin DIP is used in this design, 
although both PLCC and PGA packages are available. Eight of 
the forty-eight pins are reserved for programming, power, and 
ground, and the remaining forty /O pins are available to the 
user. Table 1 shows the efficient use of the package pinout. 


| PIN | DESCRIPTION TOTAL PINS 


ADDR(4:0) | External Address Inputs 5 inputs 
External Clock Input 1 input 
Counter Load Input 1 input 

32 inputs 


32-to-1 Multiplexer 
Data Inputs. 
Output 


TOTAL I/O 40 pins 


1 output 





Table 1. 32-to-1 Multiplexer Pins 


Figure 2 shows the block diagram of the 32-to-1 multiplexer and 
the counter. The select lines of the 32-to-1 multiplexer are ei- 
ther the address lines or the output of the counter. When LD is 
deasserted, the 5-bit counter sequentially selects each input. 
Asserting the LD signal loads the counter with the external ad- 
dresses.It also selects the external address for the 32-to-1 
multiplexer's select lines. This permits random input selection 
from the external address lines which supports statistical mul- 
tiplexing. 


ADDR 





26 


5-BIT 
|_}, COUNTER 


OUT 


Figure 2. Block Diagram 


The select signals for the 32-to-1 multiplexer are determined by 
the LD signal. When LD = 0, the counter selects the multiplex- 
er's output. When LD = 1, the counter is loaded and the exter- 
nal address selects the output. Also, the LD asynchronously 
loads the 5-bit counter. Table 2 summarizes the device opera- 
tion. 


INPUTS OUTPUT FUNCTION 
cD ADORED 


0 XXXXX D(CNT) 
1 ADDR(4:0) D(ADDR) 














Counter as Select 
Address as Select 
and Load Counter 









Table 2. 32-to-1 Multiplexer Function Table 


The design was entered with FutureNet's DASH schematic cap- 
ture package using high-level macros. Then the schematic 
capture file was translated into an LCA design file. The transla- 
tion software, called PIN2LCA, partitions the design into CLBs 
and generates the equations for the CLBs’ configuration. The 
translation software produces an unrouted LCA design file. 
This design file requires final placement of CLBs and intercon- 
nect routing which was performed using Monolithic Memories’ 
APR software, an automatic place and route program. After the 
design is placed and routed, it was optimized with Monolithic 
Memories’ XACT Design Editor System. The XACT system pro- 
vides a graphic interface to manually optimize the CLB logic 
functions and connections. 


The logic functions and interconnects of an LCA device are es- 
tablished with CMOS memory cells, so the array is never physi- 
cally altered, although it is physically programmed. A program- 
mable LCA device can be reconfigured for the prototype. In ad- 
dition, it can be reprogrammed any number of times in the tar- 
get system. With reconfigurable devices such as this one, the 
array can also be programmed on power-up, or whenever the 
design details need to be changed. For volume production, a 
hard-array version will be available. 


Entering the Design with Futurenet 


The schematic entered in FutureNet's DASH is comprised of 
different components called from the macro library supplied by 
Monolithic Memories. The M8-1 and M4-1 macros are the multi- 
plexers which implement the 32-to-1 multiplexer as shown in 
Figure 3. The 5-bit counter was built from INV, C16BPRD, 
XOR2, and FDM macros. Since the C16BPRD macro is only a 
4-bit parallel-load binary counter, the XOR2 and FDM macros 
were added to form another bit which increased the length to 
five bits. The GMUX macro, a 2-to-1 multiplexer, selects be- 
tween the counter and the external address lines. 


To translate from FutureNet to LCA, the PIN2LCA program is in- 
voked. This program generates an unrouted <filename>.LCA 
from the FutureNet’s <filename>.PIN. The PIN2LCA software 
partitions the design into CLBs and generates each CLB's con- 
figuration. 


The PIN2LCA performs logic reduction by eliminating unused 
logic to maximize CLB utilization. With this design, for exam- 
ple, C1IGBPRD's reset logic is not used, and PIN2LCA elimi- 
nates all the reset logic in the C16BPRD macro. Another means 
to maximize CLB utilization is by combining macro logic. At 
times, this combined logic results in a single CLB which imple- 
ments logic from two different macros. For designs not utilizing 
all the CLBs, this logic reduction is not required and may actu- 
ally hinder placement and delay tradeoffs. If it does produce 
delay problems, it is possible to edit the CLBs in XACT. 
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At this stage in the design flow, the PIN2LCA software can 
generate a P-SILOS simulation file. Since the <filename>.SIM 
was generated from an unrouted LCA design file, the simulation 
only has unit delays and does not have actual routing delays. 
The simulation verifies the logic, but does not give any timing 
information. It may be necessary to make changes in DASH™ 
depending on the simulation results. 


Place and Route Using APR 


Once the logic is finalized, the LCA design must be placed and 
routed. Monolithic Memories' APR software performs the final 
placement of CLBs and interconnect routing. The input to APR 
is an unrouted <filename>.LCA, and a routed LCA design file 
and report document are the outputs. 


At first glance, the APR program seems cumbersome to apply 
to every application. However, it can be used to create an effi- 
cient design. While the software uses random placement, it 
does allow tailoring for specific requirements. 


Initially, the multiplexer and counter design did not route com- 
pletely. To achieve good placement and 100% routing comple- 
tion, several things were tried. Some worked well while others 
did not. An overview of how 100% routing completion was 
achieved is explained below. 


Since this design utilizes only 53% of the LCA device, delay is 
a larger concern than CLB utilization. In this case, placement 
is Critical to achieve an efficient design with the desired de- 
lays. The ideal placement would be for the 5-bit counter and 2- 
to-1 multiplexer to be placed near the ADDR(4:0), CK, and LD 
signals. The four M8-1 macros should be near the 8-bit cluster 
of inputs. The remaining M4-1 macro should be placed near the 
OUT signal. 

In FutureNet, the !/Os are assigned pin numbers so that the 
signals are clustered into groups. These I/O assignments re- 
strict the APR's placement of the signals and are found in the 
<filename>.SCP. Other restrictions and options are specified 
when invoking the APR software. The options, "-a", "-g", "-e", 
and "-k", tailor the placement for this specific design. 


The APR's "-a" option specifies the depth of CLB logic levels 
which are considered connected. In this design, "-a2" was se- 
lected because the C16BPRD and M8-1 macros are implement- 
ed in two CLB logic levels. 


The "-g" and "-e" options are related. The "-g" option specifies 
the number of CLBs that should be grouped together. The "-e" 
option determines the number of CLBs in a group which should 
be evaluated for all possible placements. Since the C16BPRD 
and M8-1 macros contain four to six CLBs, the options, "-e4" 
and "-g4", were used. This particular value was selected be- 
cause it was the largest possible value which maintained the 
e>=g relationship. Avoiding values where e<g insures as much 
as possible that the CLBs within each group are placed in the 
best possible arrangement. This maintains the groups’ integrity. 


The "-k" option specifies the number of shapes per group. By 
trial and error, "-k6" works well for this design. With "-k6", six of 
the best arrangements or shapes were saved for each group of 
CLBs. Every combination of shaped group is tried with all six 
other grouped shapes and evaluated for best placement. Gen- 
erally, the larger values result in better placements, however, 
run-time grows exponentially. While larger values of "-k" were 
tried, they did not improve the placement significantly. 


Running APR with these options and restrictions did not result 
in a 100% routed design. Therefore, more application-specific 
information was required. Noting that the APR software was not 
fully utilizing the high-level macro information, placement could 
still be optimized. The APR's report file shows that the CLBs 
which comprise C16BPRD or the M8-1 macros were not 
grouped together. Instead, the CLBs were randomly spread 
over the device. In a constraint file, <filename>.CST, CLBs 
from the high-level macros could be grouped together (see Ap- 
pendix A). If necessary, the exact CLB placement could be 
specified in the same file. 


To generate the constraint file, CLBs which implement each 
high-level macro were identified using the cross-reference file 
generated by PIN2LCA, <filename>.CRF. The <filename>.CRF 
file is organized into three cross-reference sections: macros, 
CLBs, and IOBs. 


The macro cross-reference section identifies and assigns a 
hierarchical symbol number to each macro. The symbol num- 
ber is used to generate the default CLB names. 


ASSIGNED ae 


SYMBOL NUMBER 


21-1, \M8-1.DWG 
Path: 


\USER\THERESA\TDM.DWG(32) 
Title: M8-1 


The CLB cross-reference section of the <filename>.CRF con- 
tains the CLB names. For macros with multiple CLBs, the sys- ° 
tem assigns default names. The default CLB name consists of 
two parts. The first is the macro symbol number as specified in 
the macro section and the second number is the signal name. 
By grouping CLBs with related symbol numbers, all the CLBs 
for a given macro will be placed together. 


CLB BD Name = '21-D03': 
F = signal '21-D03', contains: 
symbol '26-OR(2)', output signal = '21-D03' 
symbol '26-AND(3)', output signal = '26-S0' 
symbol '27-OR(2)', output signal = '21-D01' 
symbol '26-AND)1)', output signal = '26-S1' 


The IOB cross-reference section shows the [OB assignments. 
For this design, the assignments are those specified in the Fu- 
tureNet's schematic. 


IOB P30: Name = 'D13', Symbol = 'PIN(40)' 
1 = signal 'D13' 





al Monolithic FA) Memories a | 


2-437 





Time Division Multiplexing with the LCA Device 








Optimizing the Design with 
Monolithic Memories' XACT 
Design Editor System 


Monolithic Memories' XACT Design Editor System can increase 

resource utilization and performance by manually modifying 
the placement and routing. The XACT system provides a 
graphic interface to specify the CLB design. All CLB logic func- 
tions and connections can be optimized for the designs’ 
needs. With XACT, the designer can partition the design and 
optimize the placement of logic blocks to gain the utilization 
and performance required. 


The clock line routing was optimized to reduce clock skew. Us- 
ing the clock buffer resources, the common clock is driven by 
the global clock buffer. To optimize internal routing, "long lines” 
were used for signals with large fanouts such as the select 
lines for the multiplexers and the LD signal. Use of the "long 
lines" minimizes the delay for these control signals. The "long 
lines" were selected by manually routing the signal net through 
the programmable interconnect points (PIPs). 


On the device itself, implementing the 5-bit counter and 2-to-1 
multiplexers requires ten CLBs; and the 32-to-1 multiplexer re- 
quires twenty-four CLBs, giving a total of thirty-four CLBs. This 
design utilizes 53% of the LCA device. 


Summary and More Information 


For time division multiplexing systems, data selection can be 
implemented in a single CMOS device. FutureNet's DASH 
schematic capture with Monolithic Memories’ LCA macro library 
was used to implement a design in an LCA device. Placement 
of CLBs and signal routing were performed by Monolithic Mem- 
ories' APR software. Optimization, which may be required to 
improve performance, can be done using Monolithic Memories’ 
XACT Design Editor System. This is useful for placing and rout- 
ing critical signal nets such as clock or control signals. More 
information can be found in P-SILOS, FutureNet, and LCA user 
guides and handbooks. 


This design, developed with an LCA device, is available upon 
request. The FutureNet drawing, LCA design and bit pattern 
files can be provided for programming the LCA device in an EP- 
ROM. Please ask for design XDES07.LCA. 


Print World: THERESA.LCA (2064PD48-70), XACT 1.30, 10:15:46 JUL 31, 1987 
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Figure 3. FutureNet Schematic 
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Appendix A> Constraint File 


; CONSTRAINT FILE GROUPING MACROS IN BLOCKS 


DEFINE GROUP CNT_GROUP 
3-T3 3-T2 CKB 
C4 C2 C3 Cl CO 
M3 M2; 


DEFINE GROUP OUT_GROUP 
CLB1 45-D23; 


DEFINE GROUP M1 GROUP 
47-S0 
19-SO 19-S1 
13-D03 13-D47; 


DEFINE GROUP M2_GROUP 
47-S1 
15-SO 15-S1 
21-D03 21-D47; 


DEFINE GROUP M3_GROUP 
27-SO 27-S1 
23-SO 23-S1 
36-SO 36-S1; 


DEFINE GROUP M4 GROUP 
OUTM4 
42-SO 42-S1 
37-D47 37~-D67 
39-S0; 
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$ 

$ Simulation file for design ‘TDM.sim’, type ‘2064N48-50’ 
$ Created by PIN2LCA Ver. 1.01x at 10:58:00 JUL 14, 1987 

$ 

! INPUT TDM.sim 





GLOBALRESET- -CLK 050151 $ Initial pulse to reset latches 
CK.PAD .CLK 0 0 500 11000 0 .REP O 

LD .PAD . CLK 0 0 40000 1 

ADDRO .PAD . CLK 0 0 40000 0 41000 1 42000 O .REP 40000 

+ 780000 

ADDR1 .PAD . CLK 0 0 40000 0 42000 1 44000 0 .REP 40000 

+ 78000 1 

ADDR2 .PAD . CLK 0 0 40000 0 44000 1 48000 0 .REP 40000 

+ 780001 

ADDR3 . PAD . CLK 0 0 40000 0 48000 1 56000 O .REP 40000 

+ 780000 

ADDR4 .PAD . CLK 0 0 40000 0 56000 1 72000 0 .REP 40000 

+ 780001 

DO.PAD .CLKO1 10000 40000 0 41000 1 

D1.PAD .CLKOO 10001 2000 0 40000 1 41000 0 42000 1 
De .PAD -CLKOO 20001 3000 0 40000 1 42000 0 43000 1 
D3.PAD .CLKO0 30001 4000 0 40000 1 43000 0 44000 1 
D4.PAD -CLKOO 40001 5000 0 40000 1 44000 0 45000 1 
DS.PAD -CLKOO 50001 6000 0 40000 1 45000 0 46000 1 
D6.PAD -CLKOO 60001 7000 0 40000 1 46000 0 47000 1 
D7 .PAD -CLKOO 70001 8000 0 40000 1 47000 0 48000 1 
D8.PAD -CLKOO 80001 9000 0 40000 1 48000 0 49000 1 
D9.PAD -CLKO0O 9000 110000 0 40000 1 49000 0 50000 1 
D10.PAD . CLK 0 0 10000 1 11000 0 40000 1 50000 0 51000 1 
D11.PAD . CLK 0 0 11000 1 12000 0 40000 1 51000 0 52000 1 
Dl2.PAD . CLK 0 0 12000 1 13000 0 40000 1 52000 0 53000 1 
D13.PAD . CLK 0 0 13000 1 14000 0 40000 1 53000 0 54000 1 
D14.PAD . CLK 0 0 14000 1 15000 0 40000 1 54000 0 55000 1 
D15.PAD . CLK 0 0 15000 1 16000 0 40000 1 55000 0 56000 1 
D16.PAD . CLK 0 0 16000 1 17000 0 40000 1 56000 0 57000 1 
D17.PAD . CLK 0 0 17000 1 18000 0 40000 1 57000 0 58000 1 
D18.PAD . CLK 0 0 18000 1 19000 0 40000 1 58000 0 59000 1 
D19.PAD . CLK 0 0 19000 1 20000 0 40000 1 59000 0 60000 1 
D20.PAD . CLK 0 0 20000 1 21000 0 40000 1 60000 0 61000 1 
D21.PAD . CLK 0 0 21000 1 22000 0 40000 1 61000 0 62000 1 
D22.PAD . CLK 0 0 22000 1 23000 0 40000 1 62000 0 63000 1 73000 0 
D23.PAD . CLK 0 0 23000 1 24000 0 40000 1 63000 0 64000 1 
D24.PAD . CLK 0 0 24000 1 25000 0 40000 1 64000 0 65000 1 
D25.PAD . CLK 0 0 25000 1 26000 0 40000 1 65000 0 66000 1 
D26.PAD . CLK 0 0 26000 1 27000 0 40000 1 66000 0 67000 1 
D27.PAD . CLK 0 0 27000 1 28000 0 40000 1 67000 0 68000 1 
D28.PAD . CLK 0 0 28000 1 29000 0 40000 1 68000 0 69000 1 
D29.PAD . CLK 0 0 29000 1 30000 0 40000 1 698000 0 70000 1 
D30.PAD . CLK 0 0 80000 1 31000 0 40000 1 70000 0 71000 1 
D31.PAD . CLK 0 0 31000 1 32000 0 40000 1 71000 0 72000 1 


.MONITOR CK.PAD LD.PAD ; ADDR4.PAD ADDRS3.PAD ADDR2.PAD ADDR1.PAD ADDRO.PAD ; ; 
+ C4 C3 C02 C100 ; 
+DO.PAD D1.PAD D2.PAD D3.PAD D4.PAD DS5.PAD D6.PAD D7.PAD ; 
+ D8.PAD D9.PADD10.PADD11.PAD D12.PAD D13.PADD14.PADD15.PAD ; 
+D16.PAD D17.PAD D18.PAD D19.PAD D20.PAD D21.PAD D22.PAD D23.PAD ; 
+ D24.PAD D25.PAD D26.PAD D27.PAD D28.PAD D29.PAD D30.PAD D31.PAD ; OUT.PAD 


. TABLE CK.PAD LD.PAD ; ADDR4.PAD ADDR3.PAD ADDR2.PAD ADDR1.PAD ADDRO.PAD ; ; 
+0403 C2C1 C0; 
+ DO.PAD D1.PAD D2.PAD D3.PAD D4.PAD D5.PAD D6.PAD D7.PAD ; 
+ D8.PAD D9.PADD10.PAD D11.PADD12.PAD D13.PADD14.PADD15.PAD ; 
+ D16.PAD D17.PAD D18.PAD D19.PAD D20.PAD D21.PAD D22.PAD D23.PAD ; 
+ D24.PAD D25.PAD D26.PAD D27.PAD D28.PAD D29.PAD D30.PAD D31.PAD ; OUT.PAD 
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* P= S$ FLO Ss 1U.3 * OUTPUTS 10:26:22 O7=23787 
CL AAAAA CCCCC DDDDDDDD DDDDDDDD DDDDDDDD DDDDDDDD oO 
KD DDDDD 43210 01234567 89111111 11112222 22222233 U 
+. DDDDD weer ane --012345 67890123 45678901 T 
PP RRRRR PPPPPPPP PP..seee eoccccse eoacoree . 
AA 43210 AAAAAAAA AAPPPPPP PPPPPPPP PPPPPPPP P 
DD: esas « DDDDDDDD DDAAAAAA AAAAAAAA AAAAAAAA A 
PPPPP DDDDDD DDDDDDDD DDDDDDDD D 
AAAAA 
DDDDD 


TIME 
0 00 00000 00000 10000000 00000000 00000000 00000000 
500 10 00000 00000 10000000 00000000 00000000 00000000 
1000 00 00000 00000 01000000 00000000 00000000 00000000 
1500 10 00000 00001 01000000 00000000 00000000 00000000 
2000 00 00000 00001 00100000 00000000 00000000 00000000 
2500 10 00000 00010 00100000 00000000 00000000 00000000 
3000 00 00000 00010 00010000 00000000 00070000 00000000 
3500 10 00000 00011 00010000 00000000 00000000 00000000 
4000 00 00000 00011 00001000 00000000 00000000 00000000 
4500 10 00000 00100 00001000 00000000 00000000 00000000 
5000 00 00000 00100 00000100 00000000 00000000 00000000 
5500 10 00000 00101 00000100 00000000 00000000 00000000 
6000 00 00000 00101 00000010 00000000 00000000 00000000 
6500 10 00000 00110 00000010 00000000 00000000 o0000000 
7000 00 00000 00110 00000001 00000000 00000000 00000000 
7500 10 00000 00111 00000001 00000000 00000000 00000000 
8000 00 00000 00111 00000000 10000000 00000000 00000000 
8500 10 00000 01000 00000000 10000000 00000000 00000000 
9000 00 00000 01000 00000000 01000000 00000000 00000000 
9500 10 00000 01001 00000000 01000000 00000000 00000000 
10000 00 00000 01001 00000000 00100000 00000000 00000000 
10500 10 00000 01010 00000000 00100000 c0000000 00000000 
11000 00 00000 01010 00000000 00010000 00000000 00000000 
11500 10 00000 01011 00000000 00010000 00000000 00000000 
12000 00 00000 01011 00000000 00001000 00000000 00000000 
12500 10 00000 01100 00000000 00001000 00000000 00000000 
13000 00 00000 01100 00000000 00000100 00000000 00000000 
13500 10 00000 01101 00000000 00000100 00000000 00000000 
14000 00 00000 01101 00000000 00000010 00000000 00000000 
14500 10 00000 01110 00000000 00000010 00000000 00000000 
15000 00 00000 01110 00000000 00000001 00000000 00000000 
15500 10 00000 01111 00000000 00000001 00000000 00000000 
16000 00 00000 01111 00000000 00000000 10000000 00000000 
16500 10 00000 10000 00000000 00000000 10000000 00000000 
17000 00 00000 10000 00000000 00000000 01000000 00000000 
17500 10 00000 10001 00000000 00000000 01000000 00000000 
18000 00 00000 10001 00000000 00000000 00100000 00000000 
18500 10 00000 10010 00000000 00000000 00100000 00000000 
19000 00 00000 10010 00000000 00000000 00010000 00000000 
19500 10 00000 10011 00000000 00000000 00010000 00000000 
20000 00 00000 10011 00000000 00000000 00001000 00000000 
20500 00 00000 100112 00000000 00000000 00001000 00000000 
21000 00 00000 10100 00000000 00000000 00000100 00000000 
21500 10 00000 10101 00000000 00000000 00000100 00000000 
22000 00 00000 10101 00000000 00000000 00000010 00000000 
22500 10 00000 10110 00000000 00000000 00000010 00000000 
23000 00 00000 10110 00000000 00000000 00000001 00000000 
23500 10 00000 10111 00000000 00000000 00000001 00000000 
24000 00 00000 10111 00000000 00000000 00000000 10000000 
24500 10 00000 11000 00000000 00000000 00000000 10000000 
25000 00 00000 11000 00000000 00000000 00000000 01000000 
25500 10 00000 11001 00000000 00000000 00000000 01000000 
26000 00 00000 11001 00000000 00000000 00000000 00100000 
26500 10 00000 11010 00000000 00000000 00000000 00100000 
27000 00 00000 11010 00000000 00000000 00000000 00010000 
27500 10 00000 11011 00000000 00000000 00000000 00010000 
28000 00 00000 11011 00000000 00000000 00000000 00001000 
28500 10 90000 11100 00000000 00000000 00000000 00001000 
29000 00 00000 11100 00000000 00000000 00000000 00000100 
29300 10 00000 11101 00000000 00000000 00000000 00000100 
30000 00 00000 11101 00000000 00000000 00000000 00000010 
30500 10 00000 11110 00000000 00000000 00000000 00000010 
31000 00 00000 11110 00000000 00000000 00000000 00000001 
31500 10 00000 11111 00000000 00000000 00000000 00000001 
32000 00 00000 11111 00000000 00000000 00000000 60000000 
32500 10 00000 00000 00000000 00000000 00000000 00000000 
33000 00 00000 00000 00000000 00000000 O0000000 00000000 
39000 00 00000 00110 00000000 00000000 00000000 00000000 
39500 10 00000 00111 00000000 00000000 00000000 00000000 
40000 01 00000 00111 021112111 11121111 11111111 11111111 
40500 11 00000 01000 011112112 11111112 1121211111 11111111 
41000 01 00001 01000 10121111 11111111 11111122 11111111 
41500 11 00002 00001 101112111 12111111 11111111 11111111 
42000 01 00020 00001 110111212 11121111 111211111 11111111 
42500 211 00010 00010 11011111 11111112 112111111 11111111 


DCDODDDDDDODOPBRBRHBH PRP HPP HPP PRP PP RPP PHP PRP PRP RPP EP RPP RP PRP RP RP RP RP RP PP RP BP REP RPP Bp PEP PPP PP PEP PPP EYP 
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* P-SILOS 10.3 -* OUTPUTS 10:26:22 07-23-87 
CL AAAAA CCCCC DDDDDDDD DDDDDDDD DDDDDDDD DDDDDDDD 0 
KD DDDDD 43210 01234567 89111111 11112222 22222233 U 
.-. DDDDD weceeees 62012345 67890123 45678901 T 
PP RRRRR PPPPPPPP PP... 220 cece ceee ceeeeeee & 
AA 43210 AAAAAAAA AAPPPPPP PPPPPPPP PPPPPPPP P 
DD ..... DDDDDDDD DDAAAAAA AAAAAAAA AAAAAAAA A 
PPPPP DDDDDD DDDDDDDD DDDDDDDD D 
AAAAA 
DDDDD 


TIME 
43000 01 00011 00010 121021211 11112112 11211111 11111111 
43500 11 00011 00011 11101111 111211112 11111111 111211111 
44000 01 00100 00011 11110111 21111111 112111111 11111111 
44500 12 00100 00000 113210111 112111112 11111111 11111111 
45000 01 00101 00000 112111011 11111121 11111111 11111111 
45500 11 00101 002101 111120112 11111111 11111111 11111111 
46000 01 00110 00101 11111101 11112111 112111111 11111111 
46500 11 00110 00110 111111201 111111112 12111111 11111111 
47000 01 00111 00210 112111110 121111211 12211111 11111111 
47500 21 00111 00211 11111110 11111111 12121111 11111111 
48000 01 01000 00211 111111121 02111111 12211111 11111111 
48500 11 01000 021200 11111121 021111111 112211111 111211111 
49000 021 01001 021100 11111111 10111111 111221121 11111111 
49500 11 01001 01001 11111111 101112111 11211111 111211111 
50000 01 01010 01001 11111111 11011111 111221111 11111111 
50500 11 01010 01010 11111111 110112112 111221111 11111111 
51000 01 01011 01010 11111121 11101111 111112111 11111111 
51500 11 01011 01011 11111111 11101111 11211111 11111211 
52000 01 01100 01011 11111111 11110111 1121211211 11111111 
52500 11 01100 01000 11111111 111101112 11111111 111112111 
53000 01 01101 01000 21121111 111121011 111112111 211111111 
53500 11 01101 01201 211111211 12111011 11111111 11111111 
54000 O21 01110 01102 11111111 11111101 11111111 11111111 
54500 11 01110 01110 121211112 11212102 1111212111 11111111 
55000 01 01111 01110 11111111 111112110 111212111 11111111 
55500 11 01112 01111 111111112 11111110 1111121112 11111111 
56000 O01 10000 01111 111111112 11221111 01111111 11111111 
56500 11 10000 210100 111111112 211211111 01111111 111112111 
57000 01 10001 10100 11111112 11111111 10112111 11111111 
57500 11 10001 10001 111111112 11111112 10111111 11111111 
58000 01 10010 10001 11111112 111212112 211012111 11111111 
58500 11 10010 10010 111211112 1112121112 11011111 11111111 
59000 01 10011 10010 11111112 11111112 211102111 11111111 
59500 11 10011 10012 112111112 11111111 11102111 11111111 
60000 01 10100 10012 211211111 11111111 111201112 11111111 
60500 11 10100 10000 2112121121 11111112 11120111 11211111 
61000 01 10101 10000 12212111 111111121 111211011 11111111 
61500 11 10101 10102 111121112 11111111 121211011 11211111 
62000 01 10110 10101 11111111 111111121 12111101 11211111 
62500 11 10110 10120 11111221 11111111 112111101 11111111 
63000 02 10111 10110 11121111 11111111 11111110 111211111 
63500 11 101121 10121 11121111 12111111 11111110 111211111 
64000 01 11000 10111 111111211 12111111 11111111 01111111 
64500 11 11000 11100 11112111 12111111 11111211 01211111 
65000 01 11001 11100 11111111 11111111 11111111 10111111 
65500 12 11001 11001 11111111 12111211 11111111 101112111 
66000 02 11010 11001 111111211 111111211 11111111 110121111 
66500 12 11020 11010 11111211 11111111 12111111 11011111 
67000 O21 11011 11010 11111211 111121111 11111111 11101111 
67500 11 11012 112011 1111221211 11111211 111111131 11101111 
68000 01 12100 110211 11111111 12111111 111111121 11110111 
68500 11 11100 11000 11111111 1121111211 111112111 11110111 
69000 O01 11101 11000 11111111 12111111 11112111 11111011 
69500 11 21102 11101 11111111 111211111 11112121121 11111011 
70000 O21 112110 11101 11111111 1111211112 1121112111 11111101 
70500 12 11120 11110 1112211212 212212111 11121111 11111101 
71000 02 111212 11110 111212111 112111122 112111111 11121110 
71500 12 11111 11111 12121111 11211112 11111211 11111110 
72000 O21 00000 11111 11121112 111211212 112111211 111121111 
72500 11 00000 00100 11111112 11221111 121112111 12111111 
73000 O01 00001 00100 11111112 1112112121 111112101 1121212111 
73500 11 00001 00001 11111111 11111121 11111101 11111111 
74000 021 00010 00001 2121111212 11111111 11111101 11111111 
74500 11 00010 00010 11111121 111211111 11111101 11111111 
75000 O1 00011 00010 122111121 111111211 11111101 11111111 
75500 11 00011 00011 221121122 111111211 11111101 11111111 
76000 01 00100 00011 12111111 121111211 11111101 11111111 
76500 11 00100 00000 111111211 12111111 11112101 111211111 
77000 O1 00101 00000 11112111 1121111211 11112101 11111111 
77500 11 00101 00101 11112211 12111111 111112101 11111111 
78000 O21 10110 00101 11121111 12111111 11111101 11211111 
78500 11 10110 10110 11111111 11111111 11112101 111211111 
79000 01 10120 102120 11111111 121111211 111112101 111211111 
79500 12 10119 102110 11121111 11211111 11111101 11111111 
80000 O21 1012u 10110 1111212111 11111111 11111101 11111111 





(ok Ok Ok OF al ell oa oll el el el el oe ee ee OOO OO OR ORO nook ono Ronen ohoRhoRononoRonon ohononononoRononon on ono oon ono ono nono nonononono nono no nenononeonononekone) 
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Introduction 


In a Private Branch Exchange (PBX) or Central Office (CO), 
incoming digitized voice circuits are "switched” or routed to 
their appropriate destination. The “switching” takes place while 
the data is in serial or parallel format. An eight-bit format conver- 
ter circuit, widely used ina variety of PBX and CO architectures, 
translates the serial data streams into parallel data streams and 
vice-versa. 


The eight-bit format converter circuit requires a large number of 
register elements. These elements, namely serial shift registers, 
can be implemented efficiently in an LCA device due to its high 
register content and flexible structure. The LCA M2018 pro- 
grammable device contains one hundred configurable register 
elements, ninety-nine of which are used in the implementation 
of the converter circuit. This circuit is capable of running at 12.5 
MHz and 18.5 MHz in a 50-MHz and 70-MHz LCA device, 
respectively. 


Powerful software tools and circuit uniformity allowed the 
design to be laid out and simulated in just two days. However, if 
different support logic becomes necessary, PBX vendors are 
able to generate a new configuration program in the LCA re- 
programmable device. Since these devices are manufactured in 
a CMOS technology, a complete PBX system with several LCA 
devices can help keep the active power consumption down. 
(Please refer to the LCA Design and Applications Handbook- 
Reference 3 for basic information regarding structure and pro- 
gramming aspects.) 


MICROPROCESSOR BUS 






NETWORK 
SWITCHING 
MODULES 


PERIPHERAL 
EQUIPMENT. 
MODULES 


TERMINAL PHONE 


TOC.O., 
PBX, ETC. 


Overview of a PBX 


Central Offices (CO) and Private Branch Exchanges (PBX) pro- 
vide both telephone and data services. A PBX may be consi- 
dered as a localized telephone exchange serving the intercon- 
nection requirements of users in office environments. Outgoing 
calls are directed out of the PBX and routed through the local 
CO to the far destination. In general terms, a PBX or a CO 
consists of major blocks shown in Figure 1. Racks of peripheral 
equipment modules containing line cards provide access to 
telephone units, or to data adapter modules which provide data 
communication services. Trunk cards provide high-speed links 
to host computers, or other PBXs. Network switching modules 
interconnect calling parties to answering parties via digital mul- 
tiplexed links. Control modules containing a Central Processor 
Unit (CPU), mass storage and local memory, perform the 
“setup” and “tear-down” of each circuit connection, as well as 
the monitoring of PBX activities. Central resources are modules 
of hardware and software which operate in a time-shared 
manner. The central resources include facilities such as ringing 
tone generator and message recording modules. 


SYSTEM ACCESS 

TERMINAL 

CONTROL 
MODULES 


CENTRAL 
RESOURCES 


Figure 1. PBX Communication and Control Architecture 
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Hierarchy of Channel Multiplexing 
ina PBX 


The main function of switching modules is to interconnect cir- 
cuits between two or more parties such as in a conference call. 
A digital telephone unit, or a line card, contains a Coder/De- 
coder (CODEC) device. The CODEC device digitizes voice and 
transmits it into a Pulse Code Modulated (PCM) data stream on 
a common serial data highway at appropriate time intervals, 
which are assigned at the start of the call. 


Analog voice is sampled at 8 KHz and passes through an eight- 
bit analog-to-digital converter. The digitized sample (eight-bit 
binary word) is transmitted serially to produce a serial data 
stream of (8 x 8 = ) 64 Kbps. Thirty-two of these samples or 
CODECs are normally connected to a common highway operat- 
ing at (32 x 64 = ) 2.048 Mbps. Each CODEC is assigned “Serial 
Time Slots” of eight-bit duration, and sends its eight bits of data 
at the rate of 2.048 Mbps every 8 KHz. Another time slot is 
available on a separate highway for the receive data stream. 


Therefore, a 2.048 Mbps highway can support thirty-two chan- 
nels or voice connections in one direction. However, there are 
several ways to combine serial highways for additional voice 
connections. One way is to combine four highways into one 


high-speed highway of 8.192 Mbps. Further integration is possi- 
ble, but the aggregate data rate becomes high and leads to 
implementation problems including timing, signal reflection, 
and radiation, to name a few. 


Another way to integrate channels without increasing the clock 
rate is to convert eight serial data highways intoa single eight-bit 
parallel bus. With this method, the serial clock rate and the 
parallel clock rate are the same, 2.048 Mbps. This means that 
data throughput on the eight serial highways of 16.384 Mbps is 
maintained in the eight-bit parallel bus operating at 2.048 Mbps. 
This works by careful alignment of the incoming parallel time 
slots to serial time slots on the serial highways. An eight-bit 
format converter scheme, sometimes called a “Corner Bender”, 
is widely used in PBX or CO switching modules. Four of these 
parallel buses are combined (see Figure 2) to form a single 
parallel system bus that is thirty-two bits wide, yet still operates 
at 2.048 Mbps. The multiplexing architecture shown in Figure 2 
is used in the Harris 20-20™ Integrated Network Switch (see 
Reference 1). 

Space switching and/or time slot switching to interconnect dif- 


ferent parties is performed either in the serial highway format or 
in the parallel data stream format. 


SYSTEM PCM BUS 4MHz x 32 BITS 


= 
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Figure 2. Multiplexing Levels 
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Description of the 
“Corner Bender” Circuit 


The circuit consists of eight parallel-to-serial shift registers, and 
additional shift registers which provide proportional delays to 
synchronize the outputs. The principle of operation is shown in 
Figure 3, and the circuit is shown in Figure 4. Parallel data from 
the parallel bus is loaded with each clock into one of the eight 
shift registers in a rotating pattern. The data then shifts out of the 
shift register with each subsequent clock. After all eight registers 
have been loaded, the first register will be empty on the next 
clock. New data is loaded into this register at the same clock 
edge as the last bit shifts out to the next stage. Thus, a continu- 


PARALLEL TIME SLOTS 
1 





wo. ma < @ 


0123 4 5 6 


7,01 23 
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1 \ 2 
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Figure 3b. After Parallel to Serial Conversion 


: \ 


CL pex7 






a 


ous flow of data through the registers is insured. The data out of 
the eight shift registers is skewed in relation to each other as 
shown in Figure 3b. The various space slots on the highways 
need to be aligned to keep synchronization. This is achieved by 
proportionally delaying the outputs. Channel 1 output is delayed 
by seven clocks (using aseven-bit serial shift register), channel 2 
is delayed by six clocks, etc. Channel 8 output does not go 
through any additional delay. The final outputs of the delay 
registers are aligned and data is clocked out in a synchronous 
fashion. See Figure 3c. 


Figure 3a. Parallel Bus (8 Bit), 2.048 Mbyte/sec 
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Figure 3c. After Appropriate Time Delay to Synchronize 


Figure 3. Principles of “Corner Bending” 
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Figure 4 also shows the additional circuitry needed to control 
the shift registers. A preloadable three-bit counter keeps count 
of eight clock pulses. The parallel-to-serial bus conversion can 
be "programmed” to start in any register by setting the approp- 
riate binary value on the counter preload inputs and applying a 
pulse to the sync input. If the loading sequence produced by the 


ces load pulses to latch parallel data into the shift registers. 
Figure 5 shows the parallel-to-serial conversion data matrix. 


The description above shows the conversion of parallel data into 
eight streams of serial data. However, the same circuit also 
performs serial-to-parallel conversion. A serial eight-bit data 
stream on one of the eight inputs appears as an eight-bit parallel 


counter is not required, it can be disabled by connecting the 
“clock” to “sync” input. At each positive clock edge, the register 
loaded will depend upon the data at the counter inputs on the 
previous positive clock edge. The 3-to-8 decode circuit produ- 


word on the eight outputs. Successive parallel words appearing 
at the eight outputs correspond to the serial data on each of the 
eight inputs in rotation. See Figure 5. 
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Figure 4. 8-Bit Format Converter (“Corner Bender”) Circuit 
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PARALLEL TO SERIAL CONVERSION 


AO, BO, CO, DO, EO, FO, GO, HO = IN(I)O, IN(I)1, IN(1)2, IN(I)3, IN(I)4, IN()5, IN(I)6, IN()7 
A1, B1, C1, D1, E1, F1, G1, H1 = IN(I)O, IN(11)1, IN(I)2, IN(11)3, IN(1)4, IN(II)5, IN(II)6, IN(I1)7 


SERIAL TO PARALLEL CONVERSION 


AO, A1, A2, A3, A4, A5, A6, A7 = IN(I)0, IN(1)1, IN(I)2, IN(1)3, IN(1)4, IN(I)5, IN(1)6, IN(I)7 
BO, B1, B2, B3, B4, B5, B6, B7 = IN(II)0, IN(I)1, IN(I1)2, IN(11)3, IN(11)4, IN(II)5, IN(I)G, BN(I1)7 


* IN()X ARE THE INPUT 8-BIT WORDS 


Figure 5. Data Conversion 


Comparison of Implementation 
Techniques 


The "Corner Bender” circuit can be implemented in several 
ways. The efficiency of an LCA design can be compared to that 
of a design in SSI/MSI logic devices or Erasable Programmable 
Logic devices (EPLDs). Table 1 contains the comparison of 
package counts with the different alternatives. 


The circuit implemented in 74LSxx devices requires nineteen 
packages. Alternatively, sixteen simple PLD devices (such as a 
PAL20R8 or a PAL20R4) are required to implement the same 
circuit. Since the circuit is register intensive and not designed for 
high speed, PLDs are not chosen for this application. 


The functionally larger EPLD devices, EP1200 and EP 1800, are 
more register intensive and contain twenty-eight and fourty- 
eight register elements, respectively. Nevertheless, four EP1200 
devices, or two EP1800 plus a smaller PLD device, is required to 
implement this design. 


The entire Corner Bender circuit fits neatly into a single LCA 
2018 device. It uses ninety-nine out of the one hundred available 
internal macrocells and demonstrates efficient implementation 
of shift registers and small counters. An LSI device from Plessey 
(Reference 2) is available to implement the same circuit. How- 
ever, itis an NMOS device and limited to speeds of under 2 MHz. 











EQUIVALENT CIRCUIT IN SSI/MSI DEVICES PACKAGES| 
Paraliel in serial out shift Registers 74LS165 i eG? cil 


Serial in Parallel out shift Registers 74LS164 
74LS161 
74LS138 
74LS241 
EQUIVALENT CIRCUIT IN EPLDs 
a) EP1200 
b) EP1800 48 macro cells each 
plus PAL device e.g., 16R4 
EQUIVALENT CIRCUIT IN MMI LCA DEVICE 


LCA 2018 device 100 macro cell 


Table 1. Implementation Alternatives for the 8-Bit Format 
Converter Circuit “Corner Bender” 











4-bit counter 
3 to 8 decoder 


Three-state buffers 


28 macro cells each 
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“Corner Bender” Design in an 
LCA Device 


A count of the register elements in the circuit shows a need of 
sixty-four elements for the eight-shift registers, twenty-eight 
elements for the delay registers, three elements for the counter, 
and four elements for the 3-to-8 decode circuit. Each configura- 
ble logic block (CLB) in the LCA device can produce two out- 
puts. Hence, only four CLBs are required for the 3-to-8 decode 


XACT 1.30, 10:50:36 MAY 5, 1987 
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circuit. Since, each CLB contains one register element, the total 
count of CLBs is therefore ninety-nine. The LCA 2018 contains 
one hundred CLBs. Twenty-two input/output blocks (lOBs) 
were used. Figure 6 shows the layout of the design in the LCA 
device. The circuit fits into a 68-pin PLCC package. 


Print World: fce.lca (2018PC68=-70), XAC? 
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Figure 6. Layout Diagram of “Corner Bender” Circuit on LCA Device of Size 10x10 
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The Monolithic Memories’ XACT™ Design Editor is used to 
create the design by implementing the appropriate logic func- 
tions in CLBs and [OBs. An example of a CLB for one bit of the 
eight-bit shift register in this design is shown in Figure 7. InputA 
is the decoder input and it is ANDed with input B, the data to be 
shifted. To implement the eight-bit shift registers from the one- 
bit shift register, eight of the one-bit shift registers were linked 
together so that the output of each register became an input to 
the next register. This is shown in Figure 8 where each CLB 
represents one bit of the eight-bit shift register. 


The three-bit counter and the 3-to-8 decoder in this design were 
implemented in CLBs as shown in Figures 9 and 10. The counter 
is a synchronous binary counter with ripple carry and parallel 
load. The decoder is a standard 3-to-8 decoder. The outputs of 
the counter become inputs to the decoder, whereas the outputs 
of the decoder are used to decode the eight 8-bit shift registers. 


With the XACT Development System, the designer can optimally 
arrange the logic blocks on the LCA device in order to minimize 
net delays between each block. With this in mind, the layout for 
the design is described below: 


Four of the eight-bit parallel-to-serial shift registers were placed 
starting from the top left-hand edge of the LCA device (see 
Figure 7). The three-bit counter (three CLBs) and the 3-to-8 
decoder (four CLBs) were then placed on the following row. The 
next four rows contained the last four parallel-to-serial shift 
registers. This allowed the shift register select lines to have 
minimal delay spread when accessing all eight shift registers. 
The seven serial-to-serial shift registers were placed in the 
remaining CLBs as uniformly as possible. 













FQ 
FQ 
FF LATCH 
AF 
OF 
KCFNOT 





The optimum placement and distribution of configured blocks 
in the array is influenced by the performance needs of the 
system. Blocks placed in close proximity can use local intercon- 
nection resources which incur short signal propagation delays, 
whereas blocks placed further apart must use either “long lines” 
or other interconnection resources. Manual optimization using 
the delay efficient “long lines” was performed for the most criti- 
cal net connections. After routing completion, the longest delay 
between two clock pulses was the delay for the counter to 
change state, the state which is decoded via the 3-to-8 decoder 
and selects the appropriate shift register to load the parallel data 
(see Figure 7). This delay was 54 ns, 79 ns, and 106 ns, respec- 
tively for the 70-, 50-, and 33-MHz versions of the device. The 
delays were measured using the XACT simulation package by 
invoking the timing delay calculator. This translates to a maxi- 
mum circuit operating speed of about 18.5 MHz for the 70-MHz 
version of the device, or 9.43 MHz for the 33 MHz version. 


Although fabricated in a CMOS technology, the inputs to the 
LCA device can be made either TTL or CMOS compatible. For 
high fan-out CMOS or LS TTL-compatible loads, the output 
buffers of the LCA device are capable of driving 4mA. Moreover, 
each output buffer can be put into a HIGH-Z state for bus-driving 
applications. This features was also used in the design of the 
eight-bit format converter. 


More information about entering a design with the XACT Devel- 
opment System is included in the LCA Design and Applications 
Handbook (Reference 3). Information about configuring the 
LCA device is described in the "Configuring the LCA Device” 
applications note. 





CLKIN_ 





DEC8_ 


Figure 8. CLB Schematic Output for the 8-Bit Shift Register 
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F=Ax-~Bx-C 
G=Ax~BxC 





Figure 9a. Decode Outputs 1 and 2 of 3-to-8 Decoder Figure 9c. Decode Outputs 5 and 6 of 3-to-8 Decoder 
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Figure 9b. Decode Outputs 3 and 4 of 3-to-6 Decoder Figure 9d. Decode Outputs 7 and 8 of 3-to-8 Decoder 





Figure 9. CLB Configuration of a 3-to-8 Decoder 
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Figure 10a. 1st Bit of a 3-Bit Counter 


F=AxB+~Ax(C@Q) 










F=AxB+~Ax(C@Q) 






Figure 10b. 2nd Bit of a 3-Bit Counter 





Figure 10c. 3rd Bit of a 3-Bit Counter 


Figure 10. CLB Configuration of a 3-Bit Binary Counter with Ripple Carry 


Summary 


The LCA device has achieved a good solution to the principles 
used for multiplexing digitally-encoded voice channels in both 
serial and parallel hierarchies. These hierarchies can accom- 
modate thousands of voice circuits ina PBX switching module. 
The detailed design of the eight-bit (parallel-to-serial and serial- 
to-parallel) format converter circuit, or Corner Bender, deve- 
loped in Monolithic Memories’ LCA M2018 device was shown to 
be efficiently implemented. Despite the fast development of the 
system using the XACT Design Editor, the final design was 
programmed and bench tested to verify functional integrity. 


The Corner Bender design is available from Monolithic Memo- 
ries upon request. The bit pattern and .LCA file will be provided 
for programming the LCA device in an EPROM. Please ask for 
design XDESO5.LCA 
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AN-131E 
Functional Description Pin Description 
This application was first developed by LTT, Conflans Ste. V.TEST sete ss ieta cin Allows preload function for testing. 
Honorine, France. Part of the design, reprinted with courtesy of 2. SYSRESET ........ Reset tine from microprocessor. 
LTT, is used to control a serial data link based upon a specialized Bs AD nantes eat ated Address tine from address bus. 
LSI chip. AOA Sisch tanto teat ee Address line from address bus. 
Originally designed with six standard SSI/MSI circuits, this same 5. HDSHAKE «....... Handshake line (CTS/RTS). 
function can now be implemented, not only into a single GOK oc esessececeees External clock. 
PAL20RA10, but with even more features and better perfor- Fp Eek ataia cre ses Seve ated. Enable line from microprocessor. 
mance. The function can be divided into three sub-functions: 8. AUXDECOD ....... Extra decoding line 
. (e.g. board level decoding). 
1. Address Decoding > Dy. one Address line from address bus. 
2. Control Flags FOLDS oscacrtat shoes Address line from address bus. 
3. Transmission Speed Selection Wl ae bceauahaes bade Address line from address bus. 
Up to four address lines are allowed (eight were actually used), 12: (GND: oiendss vaueaas Reference power supply ground. 
plus two extra lines which are special decoding controls 13: JOB axseosilee ceieens Output enable line. 
(MEM/IO selection, Enable Control . . .). Two flip-flops load flag 14. AG. os tsadaca hades Address line from address bus. 
conditions, from the address bus (A1 and A2), providing hand- 15. SPEEDSEL ........ Speed selection line. 
shake between the 6850 UART and the communication lines. 16. DIV4 .............. MSB 4-bit synchronous counter. 
They have a common clock which also serves as Chip Select 1752DINS? estevcotis Laas 3rd stage synchronous counter. 
(CSO) for the UART. 183: DIN2 waits secaetied 2nd stage synchronous counter. 
The UART Transmit clock (TXCLK) can bedirectlyconnectedto J) OL vrrtrrrres PBT cin SEE Wg CcON 
the Receive Ciock (CK or RXCLK) or represents the Receive 34 BLOCREC eee, oe Bice Ras fina ; 
Clock value divided by sixteen. This function was performed by 52 DIRDIV ........... Dirsstor divided wlodk 
four “D” Flip-Flops connected as a 4-stage Asynchronous 53, TPH ............. Butera usta ; 
Divider. Since each basis cell ina PAL20RA10 has four Product Visage ged SEES 9- 
24: NGC. 3.004 be eeet 5 V power supply. 


Terms available, this function could be implemented either 
asynchronously or synchronously. In the PAL Design Specifica- 
tion example, a 4-bit synchronous divider was used instead of 
the asynchronous circuit shown in the schematic. 
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Figure 2. 
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Serial Data Link Controller 


Simulation Results 





Title Serial Data Link Controller Page: 1 
Pattern Link.pds G39 g99 SS99 G9 G95 G9 99 G9 Og 9g Og 
Revision A Al XXHHHHHHHH HHEHHHHHHH HHHHHHHHHH HHHHHHHHHH 
Author Jose Juntas / Kelvin Chow A2 XXHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
Company Monolithic Memories Inc., Santa Clara, Ca A3 XXHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
Date 3/1/85 A4 XXHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
AS XXHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
CHIP SE_CH_CNTRL PAL20RA10 AG XXLLLLLLLL LLLLLLLLLL LLLLLLLLLL LLLLLLLLLL 
E XXHHHHHHHH HHEHHHHEHH HHHHHHEHHH HHHHHHHHHH 
TEST SYSRESET A2 Al HDSHAKE CK E AUXDECOD A3 A4 AS GND AUXDECOD XXHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
/OE A6 SPEEDSEL DIV4 DIV3 DIV2 DIV1 CSO BLOCREC DIRDIV SYSRESET HHLLLLLLLL LLLLLLLLLL LLLLLLLLLL LLLLLLLLLL 
/TPH Vcc /TPH LLLLHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
HDSHAKE LLHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
EQUATIONS cso XXHHHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHEHH 
SPEEDSEL LLLLLLLLLL LLLLLLLLLL LLLLLLLLLL LLLLLLLLLL 
/TPH sm A2 #Load A2 as flag DIRDIV LLLLHHHHHH HHHHHHHHHH HHHHHHHHHH HHHHHHHHHH 
/ TPH. CLEF = CSO CLK W/ ADDR. decode CK XXXXXHHLHH LHHLHHLHHL HHLHHLHHLH HLHHLHHLHH 
/TPH.SETF = SYSRESET #global system reset DIV1 XLLLLLHHHL LLHHHLLLHH HLLLHHHLLL HHHLLLHHHL 
DIv2 XLLLLLLLLH HHHHHLLLLL LHHHHHHLLL LLLHHHRHHL 
DIRDIV t= Al zLoad speed ratio DIv3 XLLLLLLLLL LLLLLHHHHH HHHHHHHLLL LLLLLLLLLH 
DIRDIV.CLKF = cso #CLK W/ ADDR. decode DIVv4 XLLLLLLLLL LLLLLLLLLL LLLLLLLHHH HHHHHHHHHH 
DIRDIV.SETF = /HDSHAKE -?CLR by CTS/RTS line 
/BLOCREC = /DIRDIV ?Controlled by speed Page : 2 
+ HDSHAKE soption and CTS/RTS agggagg 
sline Al HHHHHHHHHH 
cso = /A6*A5*A4*A3 *AUXDECOD*E A2 HHHHHHHHHH 
SUART address valid A3 HHHHHHHHHH 
/DIVL t* DIV1 74-bit synchronous AG HHHHHHHHHH 
idivider LSB AS HHHHHHHHHH 
/DIV1.CLKF = CK 7CLK by CK(external) AG LLLLLLLLLL 
/DIVL.SETF = /DIRDIV 7CLR by speed option E HHHHHHHHHH 
AUXDECOD HHHHHHHHHH 
/DIv2 rw /DIV1*/DIV2 72ND stage of SYSRESET LLLLLLLLLL 
+ DIV1*DIV2 ;divider /TPH HHHHHHHHHH 
/DIV2.CLKF = CK 7CLK by CK(external) HDSHAKE HHHHHHHHHH 
/DIV2.SETF = /DIRDIV #CLR by speed option cso HHHHHHHHHH 
SPEEDSEL LLLLLLLLLL 
/DIV3 3:@ /DIV2*/DIV3 73RD stage of DIRDIV HHHHHHHHHH 
+ /DIV1*/DIV3 saivider CK LHHLHHLHHL 
+ DIVI*DIV2*DIV3 DIV1 LLHHHLLLHH 
/DIV3.CLKF = CK 7CLK by CK(external) DIV2 LLLLLHHHHH 
/DIV3.SETF = /DIRDIV 7;CLR by speed option DIV3 HHHHHHHHHH 
DIV4 HHHHHHHHEH 
/DIV4 := /DIV3*/DIV4 ;4TH stage of 
+ /DIV2*/DIV4 divider MSB 
+ /DIV1*/DIV4 
+ DIVI*DIV2*DIV3*DIV4 
/DIV4.CLKF = CK CLK by CK(external) 
/DIV4.SETF = /DIRDIV 7CLR by speed option 
SPEEDSEL s= /Al ;Load speed choice 
SPEEDSEL. CLKF = CSO 3CLK W/ ADDR. decode 
SPEEDSEL.SETF = /HDSHAKE ;CLR by CTS/RTS line 
SIMULATION 
TRACE ON Al,A2,A3,A4,A5,A6,E, 7Signals to be 
~  AUXDECOD, SYSRESET, /TPH, HDSHAKE, observed 
CSO, SPEEDSEL, DIRDIV, CK, 
DIV1,DIV2,DIV3,DIV4 
SETF SYSRESET, /HDSHAKE Reset all regs 
CHECK /SPEEDSEL, /DIRDIV, TPH 
SETF /SYSRESET,A1,A2,A3,A4,A5,/A6,HDSHAKE, ;Set decode 
E,AUXDECOD ;condition 
CHECK /SPEEDSEL, DIRDIV Check SPEEDSEL and 
;DIRDIV regs 
FOR I:=1 TO 15 DO 
BEGIN ?This portion 
SETF CK 7simulates divide 
sby four counter 
SETF /CK 
END 
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The ZPAL PALC20R8Z family consists of four devices: 
combinatorial PALC20L8Z, and registered PALC20R8Z, 
PALC20R6Z, and PALC20R4Z. These devices offer zero 
standby and very low active power dissipation, along with 
traditional architecture and functionality of standard PAL 
devices. These 24-pin devices offer high speed (TPD = 35 ns) 
and are ideal for applications in communications equipment 
and instrumentation, which are typically under severe power 
constraints. 


This note describes one such application in communications 
equipment for Quadrature Amplitude Modulation (QAM) 
encoding and provides a design based on a PALC20R8Z 
device. QAM encoding technique is used in standalone modem 
equipment which uses CMOS devices. Monolithic Memories' 
CMOS ZPAL devices became a natural choice for replacing 
glue logic or any complete function that is being implemented in 
discrete SSI components. 


QAM Encoder 


Data modems use encoders as a means of increasing data 
throughput within a given bandwidth of a channel. Quadrature 
Amplitude Modulation (QAM) gives an input data rate of four 
times the encoding rate, which is the output rate of the 
encoder. The encoding rate determines the channel bandwidth 
required by the output signal. Voice band data modems have 
become popular for use over the switched telephone network 
for interconnecting personal computers to distant computers. 
In these modems the data encoder function is usually 
integrated within the modem chip set. However, higher-speed 
modems for 20-MHz digital radio, satellite up/down links and 
other group band modems implement the encoder as a dis- 
crete hardware function due to the speed requirements. A 
PALC20R8Z device is ideally suited for implementing the data 
encoder function because of its ability to implement state 
machines efficiently and its low power consumption. 


The PAL Design Specification (QAM.PDS) file enclosed shows 


the QAM Encoder example implemented using the PALASM®2 
software state machine syntax. 


TO A AND B CHANNELS 
OF OSCILLOSCOPE 


IN-PHASE 


DATA (I) 
TRANSMIT | 
DATA 
DATA ENCODER 
STREAM scimitar  ecooen | 





QUADRATURE 
PHASE DATA (Q) 







Modem Encoding Techniques: 


Figure 1 shows the encoder function within a typical modem 
transmitter. Incoming data may be scrambled (to randomize 
the data) and then passed through the encoder. The output 
streams (I, Q) are passed through pulse-shaping filters before 
being multiplied by the sine and cosine carrier frequencies. 


Let us consider the QAM encoding technique as used in the 
voice band data modems. The scheme is also widely used in 
other high-speed applications. Some of the popular modem 
standards for the switched telephone network are Bell 212A 
(for 1200-bps operation), CCITT V.22 (for 1200-bps operation), 
CCITT V.22bis (for 2400-bps operation) and CCITT V.32 (for 
9600-bps operation). 

According to Bell 212A, CCITT V.22 and V.22bis recom- 
mendations, full duplex communication over a pair of tele- 
phone wires is achieved by separating the bandwidth occupied 
by the signals traveling in opposite directions. (see Figure 2). 





AMPLITUDE 





800 1600 2000 2800 
1200 Hz 2400 Hz 


FREQUENCY - Hz 

Figure 2. Full Duplex Communication by Bandwidth Separation 
The transmit carrier frequency of one modem is 1200 Hz while 
the far-end modem uses 2400 Hz as its transmit carrier fre- 


quency. The encoding rate (or symbol rate) for each carrier is 
600 baud (equivalent to transitions per second) and the band- 
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FILTER 
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Figure 1. Data Encoder in a Typical Modem Transmitter 
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width occupied by the outgoing signal spectrum is about 800 peta Riehl 


Hz, centered around the carrier. Since the two bands of trans- 
mission do not overlap, no interference takes place between 


the two channels of communications. x Bx 3 x x 
: : Batis she 10,11 10,01 11,10 44,11 
The transmit data throughput in each direction is increased 


from the 600 baud encoding rate by means of encoding two or 

four incoming data bits at a time. This produces a multilevel 

signal which is then used to modulate the carriers. The x x 1 x A x 
transmit data bit rate is then 1200 bits per second (bps) and 10,10 10,00 11,00 ee 


2400 bps respectively. IN PHASE 


() 
CCITT V.22bis and V.32 recommendation modems employ 


Quadrature Amplitude Modulation (QAM) technique for 
modulating the carrier frequency. The 16-state QAM signal 
constellation can be observed at the | and Q outputs (Figure 1) 
of the encoder. The diagram of Figure 3 is observed by 





connecting the | and Q outputs to the A and B channels of a x x -3 X oD x 

dual channel oscilloscope and turning the timebase to the X-Y 00,11 00,10 01,01 01,11 

setting. 

The data stream to be transmitted is divided into groups of four A,B, C, D ARE HALF RATE 

consecutive bits called quadbits. The first two bits are POINTS e.g., 1200 bps (2 
encoded as a phase quadrant change relative to the quadrant 

occupied by the preceding signal element. See Figure 3 and Figure 3. QAM Signal Pattern (YO, Y1, X2, X3) e.g., 2400 

Table 1. The last two bits of each quadbit define the amplitude bps (V.22bis) 


of the signal element in each phase quadrant. The left-hand 
bits in Table 1 are the first of each pair in the data stream as it 


enters the encoder after the data scrambler. FIRST TWO BITS 
Bell 212A and CCITT V.22 recommendation modems employ OF QUADBITS ORA 


the Differential Quadrature Phase Shift Keying (DQPSK) DIBIT (XO, X1) 
technique for modulating the carrier. The data stream to be 
transmitted is divided into groups of two consecutive bits 
called dibits. The dibits are encoded as a phase quadrant 
change relative to the quadrant occupied by the preceding 
signal element. The signal element corresponding to XX01 (YO, 
Y1, X2, X3) in the V.22bis signal constellation (Figure 3) is 
transmitted irrespective of the quadrant occupied. This ensu- 
res compatibility with the V.22bis modems operating in a fall- Table 1. Relative Quadrant Change for Input (X0, X1) 
back mode of 1200 bps. 
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SYMBOL 
CLOCK 






















x3 
DIGITAL-TO- 
INPUT x2 —>] 3] VERE bs 
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Figure 4. PALC20R8Z Device QAM Encoder 





at Monolithic rari Memories cl 2-457 


QAM Encoder in a ZPAL Device 








Modems in geostationary satellite links use phase encoding 
schemes such as Binary Phase Shift Keying (BPSK) or Dif- 
ferential Phase Shift Keying (DPSK). QAM techniques which 
require both phase and amplitude modulation, are traditionally 
used for deep-space satellite links, and digital radio communi- 
cation applications. 


PALC20R8Z-based QAM Encoder 


Figure 4 shows a PALC20R8Z device implementing a high- 
speed QAM encoder. Four bits of transmit data stream are 
input to the device every symbol clock. For a V.22bis modem, 
the symbol clock rate is 600 baud and the bit rate is 2400 bps. 
However the encoder device can run at several megahertz for 
use in deep-space satellite communications. The design 
utilizes all of the available outputs on the PALC20R8Z. QO and 
0 signals are held in logic high state. The extra inputs on the 
device may be used to control the start-up phase of the 
encoder. 







XO, X1 
00 


STATE SO 
(Y0,Y1) 
=0,0 


00 






STATE $3 
(YO, Y1) 






STATE S1 
(YO, Y1) 


Figure 5. Phase Quadrant State Diagram 


lt is assumed that a quadbit (XO, X1, X2, X3) of incoming 
transmit data is available every symbol clock. The first two 
bits, XO and X1, are used to compute the phase quadrant of 
the next output symbol. The quadrants are assigned states 
(represented by YO and Y1).and the state transitions (or the 
differential quadrant change) is described using a state 
diagram shown in Figure 5 and Table 1. The state transition 
outputs are latched at the positive edge of the first symbol 
clock. Equations for YO and Y1 could be derived using 
Karnaugh maps (Figure 6) and the reduced equations 
implemented. Alternatively, PALASM 2 software has a state 
machine syntax which can be used to input the state diagram 
directly (see the design file). The Minimize software module of 
PALASM 2 software is then used to reduce the equations. 








Karnaugh Map 
for Next /Y1 State 


Karnaugh Map 
for Next /YO State 


Figure 6. Karnaugh Maps for Next States of /YO, /Y1 


The last two bits, X2 and X3, are latched at the same time as 
the new states are computed (see design file). The output data 
values, | and Q, are combinatorial functions of YO, Y1, X2, and 
X3. These outputs are latched at the positive edge of the next 
symbol clock. Thus the computational delay through the 
encoder is two symbol clocks. If the processing delay is 
critical to the design, it could be reduced to one symbol clock 
by using a PAL20RA10 device because this device allows 
independent clocking of registers. The phase quadrant state 
could then be latched on the positive edge of the symbol clock 
and the output could be available on the negative edge of the 
clock. 





Figure 7. Photograph of QAM Pattern 
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The values +3, +1, -1, -3 of land Q (Figure 3) are represented 
by the three-bit binary words, 111, 101, 011, 001, (I2 11 10 or 
Q2 Q1 QO), respectively. These three-bit values are output 
every symbol. This number scheme allows the use of simple 
digital-to-analog conversion as required in the satellite 
applications. For these high-speed applications the pulse- 
shaping filtering and carrier-modulation functions are 
performed in analog domain using operational amplifiers and 
active components. 


The PALC20R8Z device has only eight registered outputs, 
whereas this design needs ten registered outputs (two for the 
phase quadrant state, and eight for the | and Q outputs). 
Fortunately the 10 and QO are always "1" (as defined above) 
and hence can be tied to this state at the input to the next 
stage. In this case the eight registers in the PALC20R8Z 
device are sufficient to implement the full encoder function. 


The design file shows an example of using the state machine 
syntax of PALASM 2 software. The QAM encoder is a standard 


Mealy type of state machine. However, to keep the number of 
product terms per output reasonable (less than eight for the 
PALC20R8Z device), it is implemented as a Moore state 
machine followed by the output decode function (see 
appendix). There is a pipeline delay between the new state 
values and their corresponding outputs. The differential 
quadrant change is implemented in the State section of the 
QAM.PDS file whereas output decoding is done in the 
Equations section of the same file. 


In the laboratory verification, the | (I0 to 12) and Q (QO to Q2) 
outputs were connected to two digital-to-analog converters. 
The analog outputs were then connected to the A and B 
channels of an oscilloscope and the X-Y pattern was observed 
(see Figure 7). For high data-rate operation (several MHz), the 
PALC20R8Z device encoder together with a data scrambler 
implemented in hardware facilitates a cost-effective solution 
for a modem transmitter. 
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TITLE 
PATTERN 
REVISION 
AUTHOR 
COMPANY 
DATE 


QAM Encoder in a ZPAL Device 


QAM_ENCODER 
QAM. PDS 


A 


RAJ PARIPATYADAR, M. GZOWSKI 
MONOLITHIC MEMORIES INC., 


5 


April 1987 


CHIP QAM PAL20R8 


CK X3 X2 X1 XO NC NC NC NC NC NC 
/OE NC X3A X2A Yl YO 12 I1 Q2 Ql 


STATE 


MOORE_MACHINE 


STATE 0 := CS1 
STATE 1 := CS1 
STATE 2 := CS1 
STATE 3 := CS1 


->S 
->S 
->S 
->S 


TATE 1 
TATE 2 
TATE 3 
TATE 0 


cs2 
cs2 
cs2 
cs2 


ett 


->STATE_0 + 
->STATE_1 + 
->STATE 2 + 
->STATE 3 + 


GND 
NC vcc 


CS3 
CS3 
CS3 
CS3 


SANTA CLARA 


->STATE_ 3 
~>STATE_0 
->STATE_1 
~>STATE 2 


; Equations for the next states represented by YO Yl 
; These outputs are latched on the positive edge of the 
; first symbol clock 


STATE_0 
STATE 1 
STATE 2 
STATE 3 


woud ti 


CONDITIONS 
cSl 
CS2 
CS3 
CcS4 


/X0 
/X0 
XO 
XO 


fou ue fl 


EQUATIONS 


; latch values 
/X3A := /X3 


/X2K += /X2 


~e te te te NO 


/XO 
/X0 
YO 
Yo 


+ + + 


of 


The following 
of the next symbol clock. 
encoder is available after the second symbol clock. 
The value of IO and QO are HIGH and therefore input to the next 
stage may be tied HIGH. 


+e + + 
— 
= 


X2 and X3 on the same symbol clock 


four outputs 


/T2 := STATE_O + STATE_1 
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+->STATE 2 
+->STATE 3 
+=->STATE_0 
+->STATE_1 


are latched on the positive edge 
Therefore the ouput of the whole 
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/I1l := STATE _0* X3A 
+ STATE _1* X2A 
+ STATE 2*/X3A 
+ STATE 3%*/X2A 
/Q2 := STATE_0 + STATE _3 
/Ql := STATE _O* X2A 
+ STATE_1*/X3A 
+ STATE 2*/X2A 
+ STATE 3* X3A 
SIMULATION 
TRACE _ON OE XO Xl X2 X3 xX2a X3a YO YL CK 
Il I2 Ql Q2 
SETF CK OE ;Preset polarity and enable output 2 
PRLDF X3A X2A /Yl /YO I2 Il Q2 Ql 
;check the state transitions from 
SETF /XO /Xl /X2  /X3  /CK 
;state YO=0 Y1l=0 with input 
CLOCKF ;X0=0 and X1=0 
CLOCKF ;X2=0 and X3=0 sets the smallest 
CLOCKF ;vector amplitude 
CLOCKF ;output available on the negative 
;edge of clock 
PRLDF X3A X2A /Y1l /YO I2 I1 Q2 Ql 
;check the state transitions from 
SETF XO Xl ;state YO=0O Y1=0 with inputs 
CLOCKF *;XO=1 and X1l=1 
CLOCKF 
CLOCKF 
CLOCKF 
PRLDF X3A X2A /Y1l /YO T2 Il Q2 Ql 
;check the state transitions from 
SETF XO /Xl ;state YO=0 Y1l=0 with inputs 
CLOCKF ;XO=1 and X1=0 
CLOCKF 
PRLDF X3A X2A Y1 /YO I2 I1 Q2 Ql 
;check the state transitions from 
CLOCKF ;state YO=0 Y1l=1 with inputs 
CLOCKF ;XO=1 and X1=0 
PRLDF X3A X2A /Y1 /YO I2 Il Q2 Ql 
;check the state transition from 
SETF /XO XL ;state YO=O Y1=0 with inputs 
CLOCKF ;XO=O and X1=1 
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PRLDF KOA. X2A. YI /YO L221. 02 O21 


CLOCKF 


;check the state 
;state YO=0 Yl=1 
7;X0O=0 and Xl=1 


PRLDF X3A X2A /Y¥l1 YO I2 Il Q2 Ql 


CLOCKF 


;check the state 
;state YO=1 Y1=0 
;X0=0 and Xl=1 


PRLDF X3A X2A Yl YO I2 I1 Q2 Ql 


CLOCKF 


scheck the state 
;state YO=1 Yl=1 
;X0O=0 and X1l=1 


PRLDF X3A X2A /Y1 /YO I2 Il Q2 Ql 
sIn state YO=0 Y1=0 


SETF /XO Xl /X2 /X3 


CLOCKF 
CLOCKF 


SETF /X2 X3 
CLOCKF 
CLOCKF 


SETF X2 /X3 
CLOCKF 
CLOCKF 


SETF X2 X3 
CLOCKF 
CLOCKF 


SETF /X2 /X3 
CLOCKF 
CLOCKF 


TRACE _OFF 


° 
, 


ON monotithic KH) memories &* 


transitions 
with inputs 


transitions 
with inputs 


transitions 
with inputs 


from 


from 


from 


;check the 4 vector amplitudes 


PAL Devices Implement the 


Full V.82 Gonvolution Encoder 
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The continuing drive for higher communication data rates is 
resulting in the development of data modems complying with 
CCITT specification V.32. This standard uses the Quadrature 
Amplitude Modulation (QAM) encoding scheme to achieve a 
data rate of 9600 bps while restricting the modulation rate to 
2400 Hz. However, this high rate of transmission is sensitive to 
random errors caused by Gaussian noise on the telephone 
lines. Forward error correction by means of convolution 
encoding and Viterbi decoding improves the bit-error rate 
performance in the presence of white Gaussian noise. The 
CCITT V.32 (see Reference 1) specification provides for the 
optional use of a convolution encoding scheme whenever the 
background noise is high. A PAL20X8A and a PAL20RS8 can 
implement this complex encoder function in a cost-effective 
manner, thereby providing a simple way of implementing the 
transmitter of a V.32 modem. This design, using PAL devices, 
provides a way of implementing a separate modem transmitter 
synchronized only to the transmit clock. The receiver of the 
modem can then be implemented on a separate digital signal 
processing chip synchronized to the receive clock. 


implementation of the 
V.32 Convolution Encoder 


Figure 1 shows the position of the encoder function within the 
simplified V.32 modem transmitter. Incoming data is scrambled 
to randomize the data and then passed through the encoder. 
The In-phase and Out-of-phase (I, Q) output data streams are 
passed through pulse-shaping filters before being multiplied with 
the sine and cosine carrier frequencies. If | and Q values are 
applied to the X and the Y axes of an oscilloscope with the time 


base turned to the X-Y setting, a signal constellation is 
observed. This signal pattern (photograph in Figure 8) indicates 
the modulation amplitude and the phase of the outgoing 2400 Hz 
carrier frequency. The standard signal constellation for the V.32 
is the QAM scheme and is shown in Figure 2. This scheme 
produces sixteen states or points on the signal constellation 
map. The diagram may be observed by connecting the | and Q 
Outputs of the encoder (Figure 1) to the A,B channels of an 
oscilloscope and turning the time base to the X-Y setting. The 
optional convolutional encoding scheme specified in the V.32 
standard produces thirty-two points in the signal constellation 
(Figure 3). 


The points in Figure 3 are closer to each other than the points in 
the straight QAM scheme depicted in Figure 2, and it may not be 
obvious why the second scheme should give better performance 
in the presence of noise. The convolution encoder, with its 
implicit memory, prevents adjacent (neighbor) points appearing 
in succession. The resulting distance between two successive 
points after convolution encoding is therefore bigger than the 
distance betwen any two points in Figure 2. This translates to a 
more accurate decision of the received state and hence better 
performance under high Gaussian noise. 


We now consider the implementation of this modem encoder 
with convolution encoding. Figure 4 shows the block diagram of 
the full V.32 encoder with convolution encoding. Figure 5 shows 
the implementation of this circuit in two PAL devices, a 
PAL20X8A and a PAL20RS8. The quadrant-change state 
machine and the convolution encoder are implemented in the 
PAL20X8A and the coordinate mapping of the 32-point signal 
constellation is achieved through the PAL20RS8 device. 


CONNECT TO A, B CHANNELS OF 








OSCILLOSCOPE FOR SIN (Wet) 
CONSTELLATION PATTERN | 
PULSE 
IN - PHASE 
DATA! SHAPING 60) 


TRANSMIT DATA V.32 
DATA ——! ccpamBLER CONVOLUTION 
STREAM ENCODER 


QUADRATURE 
PHASE DATA (Q) 







FILTER 





OUTPUT R 


SHAPING 
FILTER 






COSINE (Wet) 


Figure 1. V.32 Convolution Encoder in a Typical Modem Transmitter 
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QUADRATURE PHASE (Q) 


IN PHASE 
(I) 





A, B, C, D ARE HALF RATE 
POINTS e.g., 1200 bps 


Figure 2. QAM Signal Pattern (Y1,Y¥2,X3,X4) 


QUADRATURE 
PHASE (Q) 
X 4 xX 
11111 11000 
xX xX xX 
01000 00101 01010 
xX X 2 xX X 
10010 10101 10011 10100 
x xX xX X xX 
00000 01111 00010 01101 00011 


X X | xX x er 
-4 11001 ~-2 11110 11010, 2 #11101 4 


00111 01001 00110 01011 00100 
Xx X .2 Xx Xx 
10000 10111 10001 10110 
x Xx x 
01110 00001 01100 


X .4 xX 
11100 11011 


Figure 3. 32-Point Signal Pattern (Y0,Y1,Y2,X3,X4) 
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Y2n 
Ytn 


TABLE 2 
SIGNAL 
MAPPING 


yon 





INPUT DATA STREAM 


CONVOLUTION ENCODER 


PAL20RS8 
PAL20X8A <—_—_—___> 








Figure 4. Full V.32 Convolution Encoder (9600 bps) 
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Figure 5. V.32 Convolution Encoder 
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The Encoders in the 
PAL20X8A Device 


The scrambled data stream to be transmitted is divided into 
groups of four consecutive data bits. As shown in Figure 4, the 
first two bits arrive in time X1, X2 in each group are first 
‘differentially encoded into Y1 and Y2, according to Table 1. 


These state transitions are mapped on Karnaugh maps (Figure 
6), and the reduced equations are derived and implemented in 
the PAL20X8A design. The equation for Y1 turns out to be a 
single Exclusive-OR (XOR) function of the previous state of Y1 
and XO. The Karnaugh map for Y2 (Figure 6b) appears to 
indicate a fairly complex function. However, after a number of 
steps of juggling with the Boolean equations, the final equation 
can be expressed as an XOR function of two product terms. 
The PAL20X8 device offers an efficient implementation of this 
equation. 


The two differentially encoded bits, Y1 and Y2, are then used 
as input to a systematic convolution encoder, which generates 
a redundant bit YO. The convolution encoder circuit of Figure 4 
contains five XOR functions and this requires the use of a PAL 
device containing XOR elements. The XORs feed into each 
other. Combinatorial values DO and D3 are produced (see 
equations for pin 14 and 23 of the PAL20X8A). Additionally, 
registered values D1 and D2 are generated as intermediate 
delay values. Note that some of the XOR elements had to be 
multiplied out, such that the equations for D2 and YO have only 
1 XOR term in the equation. However, product terms feed the 
two inputs of the XOR gate. This arrangement uses the full 
functionality of the PAL2OX8A device. 


Y¥1(n-1), ¥2(n-1) 


XO, X1 





Figure 6a. Karnaugh Map for Y1(n) 





Table 1. Differential Encoding for V.32 Signal Pattern at 9600 bps 


Y1(n-1), ¥2(n-1) 





Figure 6b. Karnaugh Map for Y2(n) 
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Signal Mapping in PAL20ORS8 Device 


The extra bit YO and the four information carrying bits Y1, Y2, 
X2, and X3 are then mapped into the coordinates of the signal 
3lement to be transmitted according to the signal space 
diagram shown in Figure 3 and listed in Table 2. The coordinate 
values for | and Q are each represented as a 4-bit binary 
qumber. The values (13,12,11,10) 1100, 1011, 1010, 1001, 
1000, 0111, 0110, 0101, 0100 represent the magnitudes +4, 
+3, +2, +1, 0, -1, -2, -3, -4 respectively. The values for the Q 
bits have a similar representation. Karnaugh maps (Figure 7) 
are produced for each of the outputs 13 to 10 and Q3 to QO, and 
the reduced equations are implemented in the PAL device 
design. These eight output equations are dependent on only 
five outputs, YO, Y1, Y2, X2, and X3, but one of the outputs 
(l2) requires nine product terms. A standard PAL device (e.g., 
PAL20R8) has only eight product terms. A PAL device (such 
as PAL20RS8) which has sixteen product terms shared 
between pairs of outputs would be fine. The PAL20RS8 is 
ideally suited to this application because it allows the nine 
product terms required for calculating |2 to be allocated to this 
output (on pin 17) and the other seven product terms to be 
allocated to the neighboring signal I3 (on pin 18). 


Results 


The output of the first PAL20X8A is available after the first half of 
the symbol clock and the final output is available from the 
second PAL20RS8 after the second half of the symbol clock. 
Thus a delay of only one symbol clock is introduced by the full 
encoder. The two PAL device designs have been verified and 
the 32-state constellation of Figure 3 was observed on an 
oscilloscope. The encoding rate was 2400 baud, but the PAL 
device design will work at 10 MHz for applications that may 
require this speed. 
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Summary 
; : 0 1 1 0 3 -2 
The full V.32 encoder with convolution encoding has been 
implemented in two PAL devices as shown in Figure 5. Both the 0 1 1 1 = “2 
differential quadrant encoder state machine and the 1 0 0 0 1 4 
convolution encoder fit into the PAL20X8A and the signal co- 
ordinate mapping is achieved through the PAL20RS8. This PAL 1 0 0 1 73 0 
device design for the encoder provides a superior way of 1 0 1 0 1 
implementing a V.32 modem transmitter synchronized only to 
the transmit clock. 1 0 1 1 1 “4 
1 1 0 0 ~1 -4 
References 1 1 0 1 3 0 
1. CCITT V series of recommendations, Red Book (1984), Vol- i : ; 0 
ume VIII.1. 1 1 1 1 ~1 
Table 2. Signal State Mapping for V.32, 9600 bps 
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Figure 7. Karnaugh maps for outputs 10-13 and Q0-Q3 
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7V.32 trellis encoder PAL#1, 20X8 
;This PAL device performs the quadrant change based on the first 
stwo bits and the the convolution encoding to produce the output yO. 


‘ 


TITLE V32_1.PDS 
PATTERN V32_ENCODER 
REVISION A 

AUTHOR RAJ PARIPATYADAR 
COMPANY MMI, SANTA CLARA 
DATE APRIL 3 1987 


CHIP TREL1 PAL20X8 


;PINS BI 2 3 4 5 6 7 8 9 10 11 12 
CK X3 X2 = X21 x0 NC NC NC NC NC NC GND 
;PINS 13 14 15 16 17 18 19 20 21 22 23 24 


JOE /DO /D1l /D2 NC /YOn /Y1n /¥2n X2n X3n /D3 vec 


EQUATIONS 2 


;These equations implement the differential quadrant encoder 
zon the bits XO and X1. The outputs are Yin and Y2n. 





Yin s= Yln :+: XO 


Y2n := (XO *Y1in) 3+: (X1*/Y2n + /X1*Y2n) 
;These equations produce the extra output YOn. Intermediate values 


;DO0,D1,D2 AND D3 need to be calculated first. Dl and D2 are 
;registered outputs. 


D1 s= YOn 
D2 := (DO*/D1 + /DO*D1) 
t+: (Y1In*/Y2n + /Y1n*Y2n) 
Yon := Yin * YOn :+: D3 
DO = YOn*D2*/Y2n + YOn*/D2*Y2n 


D3 


D2*/Y2n + /D2*Y2n 


:The following equations are just registered storage of the bits 
7X2 and X3 


/X2an := /X2 ;internal node equation 


/X3n 2= /X3 internal node equation 
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SIMULATION 


TRACE_ON XO X1 Y2n Yin YOn CK DO D1 D2 D3 X2n X3n 





SETF OE ;Initialize Output-Enable 
PRLDF D1 D2 YOn /Yin /Y2n X2n X3n 
SETF XO /X1l /X2 = /X3 sstate Yln=0 Y2n=0 with input 
;PRELOAD VALUES ARE OPPOSITE 
7TO THOSE INTENDED, TO GET AROUND 
7A BUG IN PALASM2 V2.2 
;check the state transitions from 
CLOCKF ;X0O=1 and X1=0 
CLOCKF 7;X2=0 and X3=0 sets the smallest 
CLOCKF svector amplitude 
CLOCKF ;output available on the negative 
sedge of clock 
SETF XO X1 
PRLDF D1 D2 YOn /Yin /Y2n X2n X3n 
:Check the state transitions from 
CLOCKF ;state Yln=0 Y2n=0 with inputs 
CLOCKF 7;X0=1 and X1=1 
CLOCKF 
CLOCKF 
SETF /XO X11 
PRLDF D1 D2 YOn /Yin /Y2n X2n X3n 
;check the state transitions from 
CLOCKF ;state Yln=0 Y2n=0 with inputs 
CLOCKF 7;X0=0 and X1=1 
PRLDF D1 D2 YOn Yin /Y2n X2n X3n 
scheck the state transitions from 
CLOCKF ;state Yln=1 Y2n=0 with inputs 
CLOCKF ;X0=0 and X1=1 
SETF /X0 /X1 
PRLDF D1 D2 YOn /Yl1n /Y2n X2n X3n 
;check the state transition from 
CLOCKF ;state Y1ln=0 Y2n=0 with inputs 
7;X0=0 and X1=0 
PRLDF D1 D2 YOn /Yln Y2n X2n X3n 
;check the state transitions from 
CLOCKF ;state Y1ln=0 Y2n=1 with inputs 
7;X0=0 and X1=0 
2-470 NM monorithic HK memories &' 
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PRLDF D1 D2 YOn Yin /Y2n X2n X3n 
;check the state transitions from 
CLOCKF ;state Yin=1 Y2n=0 with inputs 
7X0=0 and X1=0 


PRLDF D1 D2 YOn Yin Y2n X2n X3n 
;check the state transitions from 
CLOCKF sstate Yln=1 Y2n=1 with inputs 
7X0=0 and X1=0 


TRACE_OFF 
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;V.32 trellis encoder PAL#2, 20RS8. 





‘This PAL performs the signal mapping onto the 32 


;state constellation according to CCITT v.32, 
ispecification. 

TITLE V32_ENCODER 

PATTERN V32_2.PDS 

REVISION A 

AUTHOR RAJ PARIPATYADAR 

COMPANY MMI, SANTA CLARA 

DATE APRIL 1 1987 


CHIP TREL2 PAL20RS8 
PINS 1 a 3 4 5 6 7 8 
CLK X3a X2a /Y2 /Y¥l /YO NC Ne 
*PINS 13 14 15 16 #17 #«218 «19 -~«20 
/OE INVSCK IO Il 12 3 QO Qi 
EQUATIONS 


I3 


/Y1*/Y2*X3a 
/XYO*®/Y1*X2a*/X3a 
/YO*RY24*/X2a 
Y1*/Y2*X2a 
YORY1*/Y24/X3a 
YORYL*Y2*/X2a*xX3a 
YO*/Y1*Y24%/X3a 


t++tttt+e 


I2 


i] 


/YO*/Y1*/X2a%/X3a 
/YO*/Y1*X2akX3a 
/YO*YL*Y2*X2a 
/YORY1*/Y2%/X2a 
Y1*/Y2*/X2akX3a 
YO*V1L*Y24*/X3a 
YO*V2*X2a*X3a 
YO*/Y1L*Y2*X3a 
YO*R/Y1*/Y2*/X3a 


t+ttttite 


Il = /YO*Y1 


YO*Y2 


+ ee 


Io 


Yo 


Q3 /YO*/Y1*/Y2*/X3a 
/Y1*/Y2*X2a 
/YO*RY2*/X2a*X3a 
/YORY1*Y2*X3a 
Y1*/Y2*/X3a 
YO*Y1*/X2a*X3a 
YO*RVY1L*Y2*X2a 


t++tttte 
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9600 bps 
9 10 11 12 
NC NC SCKIN GND 


21 22 23 24 
Q2 Q3 SCK vcc 


;IN-PHASE OUTPUT. 
;(I10--I3) 


;QUADRATURE OUTPUT 
7 (Q0-Q3) 
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+ 
Q2 r= 
+ 
+ 
+ 
+ 
+ 
+ 
+ 
Ql = 
+ 
+ 
QO := 
INVSCK = 
SCK = 
SIMULATION 


YO*R/Y1*Y24*/X2a 


/YU*/Y2*/X2a*X3a 
/YO*Y2%/X2a*/X3a 
/YORY2*X2a*/X3a 
/YL*Y2*X2a 
/YORY1*/Y2*X3a 
YO*V1*/X2a*/X3a 
YO*RY1*X2a4*X3a 
YO*/Y1*/Y2*/X2a 


/YOR/YL*Y2 
/YORYL*/Y2 
YO*/Y1 


/X0 


/SCKIN 


SCKIN 


TRACE ON YO Yl Y2 X2a X3a CLK 


I 


0 Il I2 13 QO 


SETF OE /CLK 


SETF 
CLOCKF 


/X0 
/X0 
/X0 
/X0 
/X0 
/X0 
/X0 
/X0 
/X0 
/X0 
/X0 


/YXO 


/Y1L /Y2 /X2A 
/YL /Y2 /X20 
/YL /Y¥2  X2A 
/Y1L /Y2 = X2A 
/Yl Y2 /X2A 
/Yl Y2 /X2A 
/Y¥l  Y2  X2A 
/YL  Y2  X2A 

Yl /Y2 /X2A 

Yl /Y2 /X2A 

Yl /Y2 X2A 


Yl /Y2 X2A 


Ql Q2 


/X3A 
X3A 
/X3A 
X3A 

/X3A 
X3A 

/X3A 
X3A 

/X3A 
X3A 

/X3A 


X3A 





;GENERATE /SYMBOL CLOCK 


;GENERATE SYMBOL CLOCK 
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SETF /YO Yl Y¥2 /X2A /X3A 


CLOCKF 

SETF /YO Yl Y¥2 /X2A X3A 
CLOCKF 

SETF /X¥O Yl Y¥2 X2A /X3A 
CLOCKF 

SETF /YO Yl ¥2 X2A  X3A 
CLOCKF 

SETF YO. SJ¥U SYS: JX2K: -7X3R 
CLOCKF 

SETF YO /Yl /Y¥2 /X2A  X3A 
CLOCKF 

SETF YO /YlL /Y¥2 X2A /X3A 
CLOCKF 

SETF YO /Y¥l /Y¥2 X2A  X3A 
CLOCKF 

SETF YO /Yl Y2 /X2A /X3A 
CLOCKF 

SETF YO /Yl Y¥2 /X2A  X3A 
CLOCKF 

SETF YO /Y¥l Y¥2 X2A /X3A 
CLOCKF 

SETF YO /Yl ¥2 X2A  X3A 
CLOCKF 

SETF YO Yl /Y¥2 /X2A /X3A 
CLOCKF 

SETF YO Yl /Y¥2 /X2A  X3A 
CLOCKF 

SETF YO Yl /¥2 X2A /X3A 
CLOCKF 

SETF YO Yl /Y2 X2A xX3A 
CLOCKF 

SETF YO Yl ¥2 /X2A /X3A 
CLOCKF 

SETF YO Yl ¥Y¥2 /X2A xX3A 
CLOCKF 

SETF YO Yl Y¥2 £X2A /X3A 
CLOCKF 

SETF YO Yl ¥2 X2A X3A 
CLOCKF 

TRACE OFF 
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Applications of 4B3T Line Coding 


Line coding devices are required in line terminals and line re- 
peaters to convert binary data into a proper coded format on the 
transmission lines and vice versa (see Figure 1). _ 


4B3T line coding is used for the Integrated Services Digital Net- 
work (ISDN) in Germany. Also, 4B3T is used on T-carrier trans- 
mission lines for two 24-channel DS-1 signals (forty-eight chan- 
nels). Because 4B3T coding decreases transmission 
requirements by twenty-five percent, transmission rates for two 
24-channel DS-1 signals are only fifty percent more than one 24- 
channel DS-1 signals. The 4B3T is used on the T148 transmis- 
sion line developed by ITT Telecommunication. 


Bipolar Codes 


Bipolar coding is a popular coding scheme over long transmis- 
sion lines. Bipolar codes use three different voltages (+5 V, 0 V, 
and -5 V) to represent logic one (1) and logic zero (0) (see Fig- 
ure 2a). Logic-one signals alternate between +5 V and -5 V to al- 
low the mean DC level to be integrated to zero volts. Therefore, 
the bipolar code can compensate for DC wander on the trans- 
mission lines. However, the bipolar code does not take advan- 
tage of using three logic levels. Thus, binary bipolar code is 

called a pseudo-ternary code. 


Ternary Codes 


Ternary code takes advantage of all three logic levels. Each bit 
of the ternary code represents a positive logic one (+), a logic 


LINE TERMINAL 


TRANSMISSION LINE 
LINE INTERFACE 
ENCODER DEVICE 


RECEPTION LINE 
LINE INTERFACE 


DECODER DEVICE 





zero (0), or a negative logic one (-) (see Figure 2b). These three 
logic levels are mapped to the three voltage levels. 


On the transmission lines, the power of three (3") codes are car- 
ried using the ternary code and the power of two (2") codes are 
carried using the binary code. For example, two bits of binary (0, 
1) codes only provide four combinations of data (00, 01, 10, and 
11). But two bits of ternary (-, 0, +) codes provide nine combina- 
tions of data (00, 0-, -0, --, -+, 0+, +0, +-, and ++). The ternary 
code is more efficient than the binary code because more infor- 
mation can be represented on the transmission lines. Three bi- 
polar bits are capable of representing eight codes. For ternary 
code, only two bits are required to represent eight codes as 
shown in Table 1. 


Comparison of Binary Codes and 
Ternary Codes 


4B3T is a ternary code which maps four continuous binary bits 
into three ternary bits. To transmit twenty binary data bits, twenty 
bipolar bits are needed. But with ternary code only fifteen bits 
are needed. A twenty-five percent saving on bits is made using 
4B3T line coding on the transmission lines. Without increasing 
the transmission rate, ternary coding carries more information 
than bipolar coding. Four binary bits on the computer system will 
run at the same rate as three ternary bits on the transmission 
lines using the 4B3T line coding. For example, a group of four bi- 
nary bits in a 10-MHz computer system. Substituted to a group 
of three ternary bits at 7.5 MHz on a transmission line. 


LINE REPEATER 


LINE TRANSMISSION 
INTERFACE LINE 
DEVICE DECODER 


BIPOLAR 
SIGNAL (NRZ) 


RECEPTION 
LINE 
ENCODER 


LINE 
INTERFACE 
DEVICE 


Figure 1. Line Coding Devices in a Line Terminal and a Line Repeater 


0 1 0 1 0 


Figure 2a. Bipolar Codes 


0 + 0 : 0 


Figure 2b. Ternary Codes 





at Monolithic ta Memories rm 


2-475 





PLD Devices Implement 4B3T Line Transcoder 





CODE 
NUMBER 








Table 1. Eight Codes Represented by Three Bipolar Bits or 
Two Ternary Bits 


4B3T Line Coding 


Four binary bits represent sixteen combinations of data, but 
three ternary bits represent twenty-seven combinations of data. 
Except for all zeros code, the remaining combinations of ternary 
codes can be categorized into three groups: zero, positive, and 
negative accumulated disparity. A zero-accumulated disparity is 
a set of ternary codes that have the same number of positive 
codes and negative codes, e.g., the accumulated disparity of 0-+ 
is zero (0). A positive accumulated disparity is a set of ternary 
codes that have more positive codes than negative codes, e.g., 
the accumulated disparity of the +-+ is positive (+1). A negative 
accumulated disparity is a set of ternary codes that have more 
negative codes than positive codes, e.g., the accumulated dis- 
parity of the 0-- is negative (-2). The 4B3T line coding with accu- 
mulated disparity is shown in Table 2. Six sets of four binary bits 
have zero-accumulated disparity. The remaining ten sets of four 
binary bits map into words with accumulated disparity. 


Four binary bits can be converted into any one of four possible 
ternary codes. This is normally represented by four columns, as 
shown in Table 3. Figure 3 is the state diagram of the next col- 
umn generation. Four possible columns are generated for next 
ternary codes, The column number of the next word depends on 





Figure 3. 4B3T Next Column Generation State Diagram 


the previous column and the current accumulated disparity. If 
the current disparity is zero, the number of the next column is the 
same as the previous one. If the current disparity is positive, the 
column number for the next word is found by adding the current 
disparity. Likewise, if the current disparity is negative, the num- 
ber for the the next column is found by subtracting the current 
disparity. In column 1, the next word may only have positive or 
zero disparity since the maximum of -3 negative disparity has 
been reached. Similarly, in column 4, the next word will have 
negative or zero disparity. 


3-BIT TERNARY WORD 
(ACCUMULATED DISPARITY) 


POSITIVE | ZERO |NEGATIVE 





Table 2. 4B3T Line Coding with Accumulated Disparity 


The 4B3T line coding is represented in the four different col- 
umns of Table 3 and shows that a 4-bit binary word can be con- 
verted to any of four different sets of a 3-bit ternary word. Each 
set of a 3-bit ternary word includes the substituted 3-bit ternary 
code and the column number for the next ternary word. 
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COLUMN 
4-BIT 
BINARY 3-BIT 
WORD TERNARY 
WORD 


0001 O-+ 


0100 - +0 
0010 +-0 
1011 +0- 





1001 +-+ 


0000 +O+ 











* Denotes next new column, 


4 





0110 -++ 2 “+t 
1010 $= 2 ++- 
1111 ++0 


COLUMN COLUMN COLUMN 
1 2 3 4 


3-BIT 
TERNARY 
WORD 


* 








a CC) 

nor [oro e| oo —* 

mo [+00 (af +00 fa] 00 ‘af io i 
| 





3 eo 
a1 os [sf 00 [| -00 je] -oo—is 
1100 +++ -t- 


Table 3. 4B3T Line Coding 


4B3T Line Encoder Implementation 


A 4B3T encoder converts binary data into two signals: a positive 
ternary signal and a negative ternary signal. These two signals 
generate a three-voltage level signal for the transmission line. 
The 4B3T encoder divides the incoming binary data stream into 
4-bit blocks. This 4-bit word is converted to a 3-bit ternary word. 
The 4-bit binary word and the number of the column generate a 
ternary word. 


Figure 4 is the block diagram of the 4B3T encoder. It consists of 
a 4-bit serial-in paralle!-out shift register, a 2-bit counter, a 6-bit 
input and an 8-bit output look-up table converter, and two 3-bit 
parallel-in serial-out shift registers. Two Programmable Array 
Logic (PAL) devices and one registered PROM device imple- 
ment the 4B3T encoder (see Figure 5). The first PALC16R6Z 
performs a 4-bit serial-in parallel-out shift register and a 2-bit 
counter. A 4-bit shift register (SR3, SR2, SR1, and SRO) con- 
verts the serial data into a 4-bit parallel word. A 2-bit counter pro- 
vides the code conversion's clock rate which is one-quarter the 
speed of the serial-in parallel-out shift register's clock rate. The 
63RS481A implements the 6-input and 8-output look-up table 
converter. The 4B3T encoding conversion table is shown in Fig- 
ure 6. The 4-bit binary word (B3, B2, B1, and BO) and 2-bit cur- 
rent column numbers (C1, CO) generate 8-bit output (PT2, PT1, 
PTO, NT2, NT1, NTO, NC1, and NCO). NC1 and NCO are the 
next column numbers. They are fed into the 63RS481A as cur- 
rent column numbers for the next operation. The remaining six 
outputs are divided into two groups. One group is the positive 
ternary signals (PT2, PT1, and PTO). The second group is the 


negative ternary signals (NT2, NT1, and NTO). The second 
PALC16R6Z provides two 3-bit parallel-in serial-out shift regis- 
ters at three-quarters external clock rate. PT2, PT1, and PTO are 
serialized to generate the positive ternary signal (P4B3T). Like- 
wise, NT2, NT1, and NTO are serialized to produce the negative 
ternary signal (N4B3T). 


4B3T Line Decoder Implementation 


A 4B3T decoder detects three continuous pairs of positive ter- 
nary signals (P4B3T) and negative ternary signals (N4B3T) and 
converts them to 4-bit binary words. Figure 7 is the block dia- 
gram of the 4B3T decoder. It includes a 4B3T decoding state 
machine and an NRZ data generation. 


A 5-bit state machine generates four inputs for the parallel-in se- 
rial-out shift register and a load signal. A PALC22V 10 is used for 
the 4B3T decoding state machine because it has enough prod- 
uct terms and flexible outputs. The PALC22V10's output has 
programmable register bypass and programmable output polari- 
ty features. Q4 generates shift/load signal for the 4-bit parallel-in 
serial-out shift register (HCT 195). This shift/load signal will be 
assertive low only at every three clock cycles. The remaining 
four register outputs (Q3, Q2, Q1, and Q0) are parallel inputs for 
the 4-bit parallel-in serial-out shift register (see Figure 8). This 
shift register clocks at four-thirds external clock rate. The decod- 
ing state machine diagram is shown in Figure 9. An error flag is 
generated when positive ternary signals and negative ternary 
signals both are high. The error flag also detects three continu- 
ous pairs of zero ternary signals. 
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P4B3T 









DECODING 
CONVERSION 
N4B3T 


3/4*CLK 


(LOAD/SHIFT) 


Figure 4. Block Diagram of 4B3T Line Encoding 


3/4* CLK 





vec 


NRZIN 
P4B3T 


63RS481A 


Figure 5. 4B3T Encoder Implemented by PLD Devices 
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4-BINARY CURRENT TERNARY NEXT 
CODE COLUMN CODE P4B3T and N4B3T COLUMN 


TO -T2 PTO - PT2 and NTO - NT2 


00 

01 

O-+ 001 and 010 10 
11 

00 00 

01 01 

0111 10 -O+ 001 and 100 10 
11 11 

00 00 

01 01 

0100 10 -+0 010 and 100 10 
11 11 

00 i 00 

01 01 

0010 10 +-0 100 and 010 10 
11 11 

00 00 

01 01 

1011 10 +0- 100 and 001 10 
11 "1 

00 

01 01 

1110 10 O+- 010 and 001 10 
11 1 









pe : 


00 101 and 010 01 
1001 01 101 and 010 10 
10 101 and 010 11 
11 000 and 111 00 
00 001 and 000 01 
0011 01 001 and 000 10 
10 001 and 000 11 
1 000 and 110 01 
00 010 and 000 01 
1101 01 010 and 000 10 
10 010 and 000 11 
11 000 and 101 01 
00 100 and 000 01 
1000 01 100 and 000 10 
10 100 and 000 11 
11 000 and 011 01 
00 011 and 100 01 
0110 01 011 and 100 10 
10 001 and 110 01 
1 001 and 110 10 
00 110 and 001 01 
1010 01 110 and 001 10 
10 100 and 011 01 
11 100 and 011 10 
00 110 and 000 10 
414 01 000 and 001 00 
10 000 and 001 01 
11 000 and 001 10 
00 101 and 000 10 
0000 01 000 and 010 00 
10 000 and 010 01 
11 000 and 010 10 
00 011 and 000 10 
0101 01 000 and 100 00 
10 000 and 100 01 
11 000 and 100 10 
00 111 and 000 11 
1100 01 010 and 101 00 
10 010 and 101 01 
11 010 and 101 10 














Figure 6. 4B3T Encoding Conversion Table 
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P4B3T 













NRZ CODE 











N4B3T DECODING GENERATION 
STATE AND 
CLK MACHINE 4-BIT SHIFT 
REGISTER 


Figure 7. Block Diagram of 4B3T Line Decoding 


E| 


MR 


P4B3T 


N4B3T 





Figure 8. 4B3T Decoder Implemented by a PAL Device and a Shift Register 
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P4B3T* 
N4B3T 


























S0 = 00000 
Q3, Q2, Q1, QO 
= 0000 
1Q4=1 
ERROR = 
P4B3T*NHDB3 
/P4B3T* /P4B3T* /P4B3T* 
IN4B3T IN4B3T IN4B3T 
$1 = 00011 $2 = 00111 S3 = 01011 
Q3, Q2, Q1, Q0 Q3, Q2, Q1, QO Q3, G2, Q1, QO 
= 0011 = 0111 = 1011 
1Q4=1 1Q4=1 1Q4=1 
ERROR =0 ERROR =0 ERROR =0 
/P4B3T* /P4B3T* /P4B3T* /P4B3T* /P4B3T* /P4B3T* /P4B3T* /P4B3T* /PAB3T* 
/N4B3T IN4B3T IN4B3T IN483T IN4B3T IN4B3T IN4B3T IN4B3T IN4B3T apace 
$4 = 01100 $5 = 01101 $6 = 01110 $7 = 60100 $8 = 00101 $9 = 00110 $10 = 01000 $11 = 01001 S12 = 01010 
Q3, Q2, Q1, QO Q3, Q2, Q1, Q0 Q3, Q2, Qt, Q0 Q3, G2, Q1, Q0 Q3, Q2, Q1, Q0 Q3, Q2, Q1, Q0 Q3, Q2, Q1, QO Q3, Q2, Q1, Q0 Q3, Q2, Q1, QO 
= 1100 = 1101 = 1110 = 0100 = 1101 = 0110 = 1000 = 1001 = 1011 
1Q4=1 1Q4=1 1Q4=1 1Q4=1% 1Q4=1 1Q4=1 1Q4=1 1Q4=1 1Q4=1 
ERROR =0 ERROR =0 ERROR =0 ERROR =0 ERROR =0 ERROR = 0 ERROR =0 





ERROR =0 


ERROR =0 


P4B3T*} /P4B3T*} P4B3T* | /P4B3T* /P4B3T* |P4B3T*| /P4B3T* | P4B3T* 


/P4B3T* | P4B3T* /P4B3T* | P4B3T* /P4B3T* 
N4B3T | /N4B3T /N4B3T | /N4B3T N4B3T /N4B3T} N4B3T | /N4B3T | N4B3T N4B3T |/N4B3T] N4B3T | /N4B3T 
/P4B3T* /P4B3T* | /P4B3T* | /PAB3T* /P4B3T* | /P4B3T* | P4B3T* |/P4B3T*| P4B3T* | /P4B3T* | /P4B3T* | /P4B3T* | /P4B3T* 


/N4B3T /N4B3T | N4B3T | N4B3T /N4B3T IN4B3T | /N4B3T | /N4B3T| /N4B3T | /N4B3T | /N4B3T | /N4B3T N4B3T 


P4B3T* 
IN4B3T 





Wire===' 








/P4B3T* /P4B3T* P4B3T* 
/N4B3T N4B3T /N4B3T 


Figure 9. 4B3T Decoding State Machine 





Pa Monolithic rari Memories \ 2-481 


PLD Devices Implement 4B3T Line Transcoder 





Title 4B3T ENCODER LOGIC1 
Pattern EN4B3T1.pds 
Revision A 

Author Cindy Lee 

Company Monolithic Memories 
Date 4/2/87 


CHIP EN4B3T1 PAL16R6 


;PIN1 PIN2 PIN3 PIN4 PINS PIN6 PIN7 PIN8 PINS PIN10 
CLK MR NRZIN NC NC NC Nc NC NC GND 


#PIN11 PIN12 PIN13 PIN14 PIN15 PIN16 PIN17 PIN18 PINi9 PIN20 
/OE CK4_LD CQO CQl BO Bl B2 B3 NC vcc 


INPUT SIGNALS 


‘ 

; CLK : EXTENAL CLOCK 

; MR : MASTER RESET 

; NRZIN + NON-RETURN-ZERO INPUT SIGNAL 
; /OE : ASSERTIVE LOW OUTPUT ENABLE 


OUTPUT SIGNALS 
CK4_LD : 1/4* EXTERNAL CLOCK AND LOAD SIGNAL 
CQO, CQl : 2-BIT COUNTER STATE VARIABLES 

BO, Bl, B2, B3 : FOUR PARALLEL BINARY DATA OUTPUTS 


me we Me MS 


EQUATIONS 
;THE 16R6 GENERATES: 


7(1) A 4-BIT SERIAL-IN PARALLEL-OUT SHIFT REGISTER 


/BO := /B1 ; LSB OF 4-BIT PARALLEL OUTPUTS 
+ MR ; MASTER RESET 

/B1 := /B2 
+ MR 

/B2 := /B3 
+ MR 

/B3 := /NRZIN ; MSB OF 4-BIT PARALLEL OUTPUTS 
+ MR 


7(2) A 1/4 CLOCK AND LOAD/SHIFT SIGNAL 


/CQ0 z= /CQ1 ; LSB OF 2-BIT COUNTER 
+ MR ; MASTER RESET 

/CQ1 := CQO ; MSB OF 2~BIT COUNTER 
+ MR 

/CK4_LD = CQO ; 1/4 * EXTERNAL CLOCK 
+ CQl ; AND A LOAD/SHIFT SIGNAL 
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SIMULATION 
TRACE_ON MR CLK NRZIN BO Bl B2 B3 CQO CQ1l CK4_LD 


SETF MR OE + RESET CONDITION 
CLOCKF CLK 


SETF NRZIN 
CLOCKF CLK 


DISABLE THE MASTER RESET 
SHIFT A POSITIVE NRZ DATA 


SETF /MR NRZIN 
CLOCKF CLK 


=e se 


SETF /NRZIN + SHIFT TWO NEGATIVE NRZ DATA 
CLOCKF CLK 


SETF /NRZIN 
CLOCKF CLK 
SETF NRZIN SHIFT TWO POSITIVE NRZ DATA 
CLOCKF CLK 


=e 





SETF NRZIN 
CLOCKF CLK 
SETF /NRZIN SHIFT A NEGATIVE NRZ DATA 
CLOCKF CLK 


=e 


SETF NRZIN SHIFT A POSITIVE NRZ DATA 


CLOCKF CLK 


=e 


SHIFT THREE NEGATIVE NRZ DATA 


we 


SETF /NRZIN 
CLOCKF CLK 


SETF /NRZIN 
CLOCKF CLK 


SETF /NRZIN 
CLOCKF CLK 
SETF NRZIN SHIFT A POSITIVE NRZ DATA 
CLOCKF CLK 


=e 


SETF /NRZIN SHIFT A NEGATIVE NRZ DATA 


CLOCKF CLK 


=e 


SETF NRZIN SHIFT THREE POSITIVE NRZ DATA 


CLOCKF CLK 


=e 


SETF NRZIN 
CLOCKF CLK 


SETF NRZIN 
CLOCKF CLK 


TRACE_OFF 
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PLEORS ; PROM 63RS481A PLE DESIGN SPECIFICATION 
TABLE. PLE CINDY LEE 3/15/87 

6-BIT TO 8-BIT TABLE CONVERTER 

MMI, SANTA CLARA, CA 


;PIN 8 7 6 5 4 3 
: AO Al A2 A3 A4 AS 
;ADD CO Cl B3 B2 Bl BO 


;PIN 9 10 112 #13 #14 #15 16 #17 
; Ql Q2 Q3 Q4 Q5 Q6 Q7 QB 
;DAT PTO PT1 PT2 NTO NT1 NT2 NC1 NCO 


PTO := /BO * /Bl * /B2 * /B3 * /Cl * /CO ; 0 0 0 0 0 0 
+ /BO * /Bl * B2 * /B3 ;0 0 1 0 xX X 
+ BO * /B1 * /Cl ; 1 0X X O xX 
+ BO * /B1l * Cl * /CO ; 121 0X X 1 ~«0 
+ BO * /B1 B2 * Cl * CO Pd. 0 DX A 
+ BO * Bl * /B2 * /B3 * /Cl * /CO ; 1 21 0 0 0 0 
+ BO * Bl* B2 * B3 * /Cl * /CO peal, ‘dh a AK a 20 
PT1 := Bl * /B2 * /B3 ;X 1 0 0 xX X 
+ /BO * Bl * /B2 * B3 * /Cl * /CO ; 0 1010 0 
+ /BO * Bl * B2 * /B3 * /Cl ; 0 1 21 00 X 
+ BO * /BL * B2 * /B3 * /Cl Po OO <2 OO. x 
+ BO * Bl * /B2 * B3 * /Cl 7; 1 1 031 0X 
+ BO * Bl * /B2 * B3 * Cl * /CO ; 2 212 021 21 «0 
+ BO * Bl * B2 * /B3 so Be 2 30 KX 
+ BO * Bl * B2 * B3 * /Cl * /CO PoP 4 od. 4-8 0 
PT2 := /BO * /B1l * /B2 * /B3 * /C1 * /CO ; 0 0 0 0 0 0 
+ /BO * /B1l * /B2 * B3 70 © O 2 xX xX 
+ /BO * /Bl * B2 * B3 * /Cl ;0 02 1 0 X 
+ /BO * /Bl1 * B2 * B3 * Cl * /CO ;0 021 1 1 ~°0 
+ /BO * Bl * /B2 * B3 * /Cl * /CO ; 0 1021 0 0 
+ /BO * Bl * B2 ;0 1212 *% X X 
+ BO * /Bl * /B2 * B3 * /Cl ; 1 0010 XxX 
+ BO * /B1 * /B2 * B3 * Cl * /CO ; 12 001 1 «0 
+ BO * Bl * /B2 * /B3 * /Cl * /CO >; 12 1 000 0 
NTO := /BO * /B1 * B2* B3 * C1 * CO 70 0321111 
+ /BO * Bl * /B2 * /B3 ; 0 1 00 xX xX 
+ /BO * Bl * /B2 * B3 * /C1l * CO ; 0 101021 
+ /BO * Bl * /B2 * B3 * Cl ; 0 1 0121 =X 
+ /BO * Bl * B2 Po 2. xX xX -X 
+ BO * /B2 * B3 * Cl * CO 712 X¥ 0 1121 1 
+ BO * Bl * /B2 * /B3 * Cl $0 Ee, Oy: 40: Ae 
+ BO * Bl * /B2 * /B3 * /Cl * CO 71 10001 
NT1 := /BO * /Bl * /B2 * /B3 * /Cl * CO ; 0 0 0 0 0 1 
+ /BO * /B1l * /B2 * /B3 * Cl ; 0 0 0 0 1 X 
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+ /Bl * /B2 * B3 2X 0 GO 4 eX 
+ /BO * /B1 * B2 * /B3 > 0 01 0 xX x 
+ /BO * /Bl * B2* B3* Cl* CO ;00%21314i121 
+ /BO * Bl * B2 * /B3 * Cl ;0 21 2101 X 
+ BO * /B1 * /B2 * /B3 * Cl * CO ;1 00011 
+ BO * /BL * B2 * /B3 * Cl P20) 2-0 1X 
NT2 := BO * /Bl * /B2 * cl. * CO 3:2 © 0 xX IT 2 
+ BO * /Bl * B2 ;1 0321 XxX X X 
+ BO * Bl * /B2 * /B3 * /Cl * CO 71 100021 
+ BO * Bl * /B2 * /B3 * Cl 2 ea 0 Oy ok 
+ BO * Bl * /B2 * B3* Cl* CO ;110412121 
+ BO * Bl * B2 * /B3 ;1 1212 0x X 
+ BO * Bl* B2* B3* /ClL * CO 713121212021 
+ BO * BL * B2 * B3 C1 ;1 2121211 -X 
NC1 := /BO * /Bl * /B2 * B3 * C1 ;0 0 0 11 0 
+ /BO * /BL * B2 * /B3 * Cl ;0 021 01 X 
+ /BO * Bl * /B2 * /B3 * Cl ;0 1001 X 
+ /BO * Bl * B2* B3* Cl 0! ie i a EX 
+ BO * /Bl * B2* B3* Cl *2 0 2 2 2b xX 
+ BO * Bl * B2 * /B3 * Cl ;1 121201 X 
+ /BO * /Bl * /B2 * /B3 * /Cl1 * /CO ;0 0 0 0 0 0 
+ /BO * /Bl * /B2 * /B3 * Cl * CO ;0 00011 
+ /BO * /BL * B2* B3* /Cl * CO ;0 011021 
+ /BO * /Bl * B2* B3 * Cl * /CO ;0 0121 11 0 
+ /BO * Bl * /B2 * B3 * /Cl* /CO 70 1 0 1 0 0 
+ /BO * Bl * /B2 * B3* Cl* CO 7010321121 
+ /BO * Bl * B2 * /B3 * CO ;0121 0X1 
+ BO * /B1 * /B2 * /B3 * Cl * /CO ;1 00 01 0 
+ BO * /B1 * /B2 * /B3 * /Cl * CO ;1 00001 
+ BO * /B2 * B3 * /Cl* CO ;1 X 0101 
+ BO * /B2 * B3* Cl * /CO ;1 X 0 1 1 «0 
+ BO * /Bl * B2 * /B3 * CO 71 01 0xX i 
+ BO * Bl * /B2 * /B3 * /Cl * /CO ;1 1 0 00 0 
+ BO * Bl * /B2 * /B3 * Cl * CO 71 10011 
+ BO * Bl * B2 * B3* /Cl* /CO ;1 1 21 21 0 0 
+ BO * BL * B2* B3* Cl* CO 71211312121 
NCO := /BO * /B1l * /B2 * B3 * CO 70001 xX 1i1 
+ /BO * /B1 * B2 * /B3 * CO ;0 01 0xX 1 
+ /BO * Bl * /B2 * /B3 * CO ;012100xX 1 
+ /BO * Bl * B2 * B3 * cO ;01%i21 1X1 
+ BO * /BL * B2 * B3 * cO :;1 0311x121 
+ BO * Bl * B2 * /B3 * cO 3;121310Xii 
+ /BO * /Bl * /B2 * /B3 * Cl * /CO ;0 0 0 01 O 
+ /BO * /BL * B2 * B3 */cO ;0 01 21 X O 
+ /BO * /Bl * B2* B3* Cl* CO +0 021%11 1 
+ /BO * Bl * /B2 * B3* Cl * /CO ;0 1 0 1 1 O 
+ /BO * Bl * B2 * /B3 */CO ;0 1 1 0 xX O 
+ BO * /Bl * /B2 * /B3 x /CO +;1 000 xX 0 
+ BO * /B1 * /B2 * /B3 * Cl ; 1 0001 X 
+ BO * /B2 * B3 * /CO ;1 X 0 1 X O 
+ BO * /B1 * B2 * /B3 */CO +1 0121 0 xX 0 
+ BO * Bl * /B2 * /B3 */CO :1 1 00 X 0 
+ BO * Bl * /B2 * B3* C1 * CO ;11031da121 
+ BO * Bl * B2* B3* Cl* /CO ;1 1 21 21 21 «0 
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7BO Bl B2 B3 Cl CO, 


FUNCTION TABLE 
BO Bl B2 B3 Cl CO 
;BINARY INPUT, 


HAmH 


mom 


HAHAHAHA 


Hume 


ava 


mHvAwH 


AHHH 


mHAH 


HmHAw 
Hone 
HHH 
HAavAH 
Hoa 


| 


Hm 


HHAtnx 


HAHAHAHA 


je ogasomneoizes 


HAHAHAHA 


poombeomroodten: 


HHA 


HAHAH 


Human 
HHmn 
joomreo pron izes: 
HAHAH 
AHHH 


Hoda 


Homan 


HHAms 


HAHAH 


moms 


HAHAH 


HAHAH 


HHAA 


oftcemteodtes: 


Huang 
HHnmm 
HHHH 
joomicomicesneo: 
HHAA 


HHHAvA 


maou 


Hmm 


HAHAH 


HAaAs 


AHA 


mimmy7A 


HAawAH 


HAHAHA 


Hmm 
HHAMme 
joogtoomreodnso) 
joommeotreonsce: 
HAHAH 


HAHAH 


HmHAnr. 


Haan 


| 


i | 


oomtcomoodtee 


AHAHAH 


jeogroomreodnes) 


HAH 


HmHan 


HHAae 


HAHAH 


HAHAH 


eomtoomrooigs = 


HAHAH 


HAmtey 


mea 


HiwyA 


HoH 


Hmm a 


mAHH 


mAHH 


HAHAH 


HmHAw 
HHamen 
eogre ems onic) 
Hee 
jeojeoms eines) 


HAHAH 


mH or 


Hm e 


HAHAH 


AYA rms 


mmm 


mom mr 


mmHAy 


| 


HmHn 
AHomn 
HHH 
mmm 
cm oo 


Hane 


HaHa 


Hour 


HAHAH 


HHHAvA 


je ofksomoodtes! 


mon 


HAHAH 


HAHAH 


Hume 
HHAma 
joo groogns ome 
sagas mks oreo, 
jeoproogneogen) 


Hoa 


momen 


Himme 


Hoan 


Haast 


HeAHA 


AHAAH 


HAHAH 


mommy 


Hmm 
HHuer 
HAH 
HHH 
HAHAHA 


bofteoste ods») 
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=e 


carr Ler Te ROKK Rae BEM mame 


=e =e me 


we 


Knorr Lr Roe Rem ree Peete Peete) 
none Tam FPP ee eee Re Ree Pe ee 
amor Pore Ree PP ee Ree Pee Ome. 
Thee Ree ae RMP eRe eee eee 
sete mea Ree RP ee ee Re ee eee 
room Pe ee Pee Pee Re ee OP 
ree me Rae Poe Rm Pe ee OP eee Oe ee 
Pore Pree Pee Pee Pee Oo eet Oe 
Pore Peete eee eee OP eee OR eae eee 
Pore rrr Pree PPP Pee ee ee 
Trot e Tem RPP eee Re Re Reel 
Teen Beer Peete See eee eae oP eee 
reer Ree Tee Pee eee Oe eee Oe ee 


DESCRIPTION 


THIS REGISTERED PROM (63RS481A) IMPLEMENTS A 6-TO-8 TABLE 
CONVERTER. SIX INPUTS ARE BO, Bl, B2, B3 (BINARY INPUTS), 
AND Cl, CO (CURRENT COLUMN NUMBER). BECAUSE THESE FOUR 
PARALLEL BINARY INPUTS ARE FROM THE 4-BIT SERIAL-IN 
PARALLEL-OUT SHIFT REGISTER, THIS PROM (63RS481A) MUST 
OPERATE AT A QUARTER OF THE SHIFT REGISTER’S RATE. EIGHT 
OUTPUTS ARE PTO, PT1, PT2, NTO, NT1, NT2, NC1, AND NCO. THE 
PTO, PT1, AND PT2 ARE POSITIVE TERNARY SIGNALS WHICH FORMS 
THREE SERIAL P4B3T SIGNAL THROUGH A 3-BIT PARALLEL-IN 
SERIAL-OUT SHIFT REGISTER. LIKEWISE, NTO, NT1, AND NT2 ARE 
NEGATIVE TERNARY SIGNALS. NC1l AND NCO ARE NEXT COLUMN 
NUMBERS. 


at Monolithic ar Memories a\ 








2-487 





PLD Devices Implement 4B3T Line Transcoder 





Title 4B3T ENCODER LOGIC2 
Pattern EN4B3T2.pds 
Revision A 

Author Cindy Lee 

Company Monolithic Memories 
Date 3/5/87 


CHIP EN4B3T2 PAL16R6 


;PIN1 PIN2 PIN3 PIN4 PINS PIN6 PIN7 PIN8 PINS PIN1C 
CLK MR PTO PT1 PT2 NTO NT1 NT2 LOAD GND 


7PIN11 PIN12 PIN13 PIN14 PIN15 PIN16 PIN17 PIN18 PIN19 PIN20 
/OE NC NS2 NS1 N4B3T PS2 PS1 P4B3T NC vec 


INPUT SIGNALS 
CLK : EXTERNAL CLOCK 
MR : MASTER RESET 
PTO, PT1, PT2 : THREE POSITIVE TERNARY SIGNALS 
NTO, NT1, NT2 : THREE NEGATIVE TERNARY SIGNALS 
LOAD : LOAD SIGNAL FOR TWO SHIFT REGISTERS 
/OE : ASSERTIVE LOW OUTPUT ENABLE 


~e ~e te te Me NO MS 


OUTPUT SIGNALS 
NS2, NS1 : NEGATIVE TERNARY SHIFT REGISTER VARIABLES 

N4B3T : OUTPUT SIGNAL OF NEGATIVE TERNARY SHIFT REGISTER 
PS2, PS1 : NPOSITIVE TERNARY SHIFT REGISTER VARIABLES 

P4B3T : OUTPUT SIGNAL OF POSITIVE TERNARY SHIFT REGISTER 


me Se te we NO 


EQUATIONS 


;THE 16R6 GENERATES: 
;(1) A 3-BIT PARALLEL-IN SERIAL-OUT SHIFT REGISTER 
; FOR POSITIVE TERNARY SIGNAL 


/P4B3T := /PTO * /PS1 ; SHIFT REGISTER OUTPUT 
+ /LOAD * /PS1 
+ /PTO * LOAD 
+ MR 


/PS1 := /PT1 * /PS2 
+ /LOAD * /PS2 
+ /PT1 * LOAD 
+ MR 


/PS2 := /PT2 
+ /LOAD 
+ MR 


;(2) A 3-BIT PARALLEL~IN SERIAL-OUT SHIFT REGISTER 
; FOR NEGATIVE TERNARY SIGNAL 


/N4B3T := /NTO * /NS1 ; SHIFT REGISTER OUTPUT 
+ /LOAD * /NS1 
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+ /NTO * LOAD 
+ MR 


/NS1 := /NT1 * /NS2 
+ /LOAD * /NS2 


+ /NT1 * LOAD 
+ MR 
/NS2 := /NT2 
+ /LOAD 
+ MR 
SIMULATION 


TRACE_ON MR CLK LOAD PT2 PT1 PTO P4B3T NT2 NT1 NTO N4B3T 





SETF /LOAD MR OE ; RESET CONDITION 
SETF /MR ; DISABLE MASTER RESET (2 | 
SETF LOAD PT2 PT1 PTO NT2 NT1 NTO ; LOAD INPUT DATA 
CLOCKF 

SETF /LOAD /PT2 /PT1 PTO /NT2 /NT1 NTO ; SHIFT DATA 
CLOCKF 

SETF PT2 /PT1 PTO NT2 /NT1 NTO ; SHIFT DATA 
CLOCKF 

SETF LOAD PT2 /PT1 PTO NT2 /NT1 NTO ; LOAD INPUT DATA 
CLOCKF 

SETF /LOAD PT2 /PT1 PTO NT2 /NT1 NTO ; SHIFT DATA 
CLOCKF 

SETF /PT2 PT1 /PTO NT2 NT1 /NTO ; SHIFT DATA 
CLOCKF 

SETF LOAD PT2 /PT1 PTO /NT2 NT1 NTO ; LOAD INPUT DATA 
CLOCKF 

SETF /LOAD PT2 /PT1 PTO /NT2 NT1 NTO ; SHIFT DATA 
CLOCKF 

SETF PT2 /PT1 PTO /NT2 NT1 NTO ; SHIFT DATA 
CLOCKF 

SETF LOAD PT2 PT1 /PTO /NT2 /NT1 /NTO ; LOAD INPUT DATA 
CLOCKF 

SETF /LOAD PT2 /PT1 /PTO NT2 /NT1 /NTO ; SHIFT DATA 
CLOCKF 

SETF PT2 /PT1 /PTO NT2 /NT1 /NTO ; SHIFT DATA 
CLOCKF 
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SETF LOAD PT2 /PT1 /PTO NT2 /NT1 /NTO ; LOAD INPUT DATA 
CLOCKF 


SETF /LOAD PT2 /PT1 /PTO NT2 /NT1 /NTO ; SHIFT DATA 
CLOCKF 


SETF PT2 /PT1 /PTO NT2 /NT1 /NTO ; SHIFT DATA 
CLOCKF 


TRACE _OFF 
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Title 4B3T DECODER LOGIC 
Pattern DE4B3T.pds 
Revision A 

Author Cindy Lee 

Company Monolithic Memories 
Date 3/18/87 


CHIP DE4B3T PAL22V10 


;PIN1 PIN2 PIN3 PIN4 PIN5 PIN6 
/CLK NC MR P4B3T N4B3T NC 


;PIN7 PIN8 PINS PIN10 PIN11 PIN12 
NC NC NC NC NC GND 


;PIN13 PIN14 PIN15 PIN16 PIN17 PIN18 


NC ERROR NC /Q4 Q3 - Q2 
;PIN19 PIN20 PIN21 PIN22 PIN23 PIN24 
Ql Qo NC NC NC vcc 
GLOBAL 


INPUT SIGNALS 
/CLK : ASSERTIVE LOW EXTERNAL CLOCK 
MR : MASTER RESET 
P4B3T : POSITIVE TERNARY SIGNAL 
N4B3T : NEGATIVE TERNARY SIGNAL 


me se Se Se TO 


OUTPUT SIGNALS 

ERROR : ERROR FLAG 

/Q4 : ASSERTIVE LOW Q4 SIGNAL WHICH IS A /LOAD SIGNAL 
FOR A SHIFT REGISTER 

Q3, Q2, Q1, Q0 : 4B3T DECODING STATE VARIABLES 


me Me Me Se Me 


EQUATIONS 


;ERROR SIGNAL IS GENERATED WHEN P4B3T AND N4B3T SIGNALS BOTH 
;ARE HIGH, OR P4B3T AND N4B3T SIGNALS BOTH ARE LOW AT THE 
;TRANSITION FROM STATE4 TO STATEO. 


ERROR = P4B3T * N4B3T ; P4B3T AND N4B3T ARE HIGH 
+ /Q4 * /Q3 * /Q2 * /Ql * /QO * /P4B3T * /N4B3T 


7;/Q4 IA A ASSERTIVE LOW LOAD SIGNAL. THE FOUR PARALLEL DATA 
s;ARE LOADED INTO 4-BIT PARALLEL-IN SERIAL-OUT SHIFT REGISTER 
;WHEN LAOD IS LOW, OTHERWISE, THE DATA IS SHIFTED OUT. 


/N4B3T * P4B3T * /Ql * Q3 * /Q4 ; LOAD SIGNAL 
J/N4B3T * /QO * Q2 * /Q3 * /Q4 

/N4B3T * /QO * Ql * Q3 * /Q4 

/N4B3T * /Ql * Q2 * /Q3 * /Q4 

N4B3T * /P4B3T * /Ql * Q3 * /Q4 


Q4 : 


tet t il 





@N monolithic LH) memories &\ 2-491 











2-492 


PLD Devices Implement 4B3T Line Transcoder 


tees 


/P4B3T 
/P4B3T 
/P4B3T 
/P4B3T 
/P4B3T 


GLOBAL.RSTF = MR 


7Q3, 


Q3 


Q2 


Ql 


Q0 


Q2, 


/QO 
/Q0 
/QL 
/QL 


* 
* 
* 
* 


Q2 * /Q3 * /Q4 
Ql * Q3 * /Q4 

/Q2 * Q3 * /Q4 
Q2 * /Q3 * /Q4 


Qo * /Ql * Q3 * /Q4 


Ql, AND QO ARE THE FOUR BINARY SIGNALS WHICH ARE 
;CONVERTED FROM THTREE TERNARY CODE WORD. 


t++tetetseil tHtetteeee 


t+eeeeteeegae 


tte et etestil 


/N4B3T 
/N4B3T 
/N4B3T 
/N4B3T 
/N4B3T 
/P4B3T 


N4B3T * /P4B3T * /QO * Q2 * /Q4 


* 
* 
* 
* 
k 
* 


P4B3T * /QO * /Q1l * /Q2 * /Q3 
P4B3T * QO * /Q2 * Q3 
QO * Ql * /Q2 * /Q4 
Ql * /Q2 * Q3 * /Q4 
P4B3T * Q4 

/Q0 * /Q2 * Q3 * /Q4 


/P4B3T * /QO * Ql * Q3 * /Q4 


N4B3T * /P4B3T * /Ql * Q2 * /Q4 
N4B3T * /P4B3T * /Qi * Q3 * /Q4 


/P4B3T * QO * Ql * /Q2 * /Q4 


/N4B3T * /QO * Ql * Q3 * /Q4 


/N4B3T * P4B3T * Q2 * /Q3 * /Q4 


/N4B3T * QO * Ql * /Q3 * /Q4 


N4B3T * /P4B3T * /QO * /Q1 * /Q3 


/P4B3T * /QO * Q2 * /Q3 * /Q4 
N4B3T * /P4B3T * /QO * Q2 
/P4B3T * QO * Ql * /Q3 * /Q4 
N4B3T * /P4B3T * Q4 


/N4B3T * 
/N4B3T * 
/N4B3T * 
/N4B3T * 
/N4B3T * 
/N4B3T * Q4 
/P4B3T * 
/P4B3T * 
/P4B3T * Q4 
/N4B3T * 
/N4B3T * 
/N4B3T * /QO * 
/N4B3T * P4B3T 
/N4B3T * P4B3T 
/N4B3T * Q4 
N4B3T * /P4B3T 
/P4B3T * /Ql * 
N4B3T * /P4B3T 
N4B3T * /P4B3T 
/P4B3T * Q4 


ON sonotithic Kl memories &\ 


/Q0 * /QL * /Q2 * /Q3 


P4B3T * /QO * /Ql * Q2 


QO * /Ql *® Q2 * /Q3 
P4B3T * QO * Ql * /Q2 
P4B3T * Q2 * /Q3 


{QO * /Ql * /Q2 * /Q3 
/QO0 * QL * /Q2 * Q3 
N4B3T * /P4B3T * /QO * Q3 
/P4B3T * QO * /Q1 * /Q2 * Q3 


/P4B3T * /QO * Ql * Q3 


/QO * /Ql * /Q3 


Ql * Q2 * Q3 
* QO * /Ql * Q3 
* /Ql * Q2 * Q3 


* /Q0 * /Ql 

Q2 * /Q3 

* QO * Ql * /Q2 
* QO * Q2 * /Q3 


; MSB OF THE FOUR 
; BINARY SIGNALS 


; LSB OF THE FOUR 
; BINARY SIGNALS 
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SIMULATION 

TRACE_ON MR /CLK P4B3T N4B3T /Q4 Q3 Q2 Ql QO ERROR 

?TO RESET WHOLE DEVICES 

SETF MR ;STATE # = Q4(LOAD) Q3 Q2 Q1 QO & ERROR 


;TEST VECTOR #1 TO CHECK SO, S1, $4, S13, AND S16 





SETF /MR P4B3T N4B3T ;SO0 = 0 0 0 O O, ERROR 
CLOCKF CLK ;INITIAL CONDITION 
SETF /P4B3T /N4B3T 7S1 =00011 

CLOCKF CLK 

SETF /P4B3T /N4B3T 784 =01100 

CLOCKF CLK 

SETF /P4B3T N4B3T 7613 =11111 
CLOCKF CLK 

SETF /P4B3T /N4B3T 781 =00011 

CLOCKF CLK 

SETF /P4B3T /N4B3T 764 = =0 1100 
CLOCKF CLK 

SETF P4B3T /N4B3T 7814 =10011 
CLOCKF CLK 

SETF /P4B3T /N4B3T 781 =00011 

CLOCKF CLK 

SETF /P4B3T /N4B3T 784 =01100 

CLOCKF CLK 

SETF /P4B3T /N4B3T 780 = 0 0 0 O O, ERROR 
CLOCKF CLK 


;TEST VECTOR #2 TO CHECK Si, S5, S15, S16, AND S17 


SETF /P4B3T /N4B3T 731 =00011 

CLOCKF CLK 

SETF /P4B3T N4B3T 785 =01101 

CLOCKF CLK 

SETF /P4B3T /N4B3T 7$15 =10000 
CLOCKF CLK 

SETF /P4B3T /N4B3T 7S1=00011 

CLOCKF CLK 

SETF /P4B3T N4B3T $5 =01101 

CLOCKF CLK 
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SETF /P4B3T N4B3T 7816 =11000 
CLOCKF CLK 
SETF /P4B3T /N4B3T ?S1=O0 0011 
CLOCKF CLK 
SETF /P4B3T N4B3T 7S5=011201 
CLOCKF CLK 
SETF P4B3T /N4B3T 7817 = 10001 
CLOCKF CLK 


*TEST VECTOR #3 TO CHECK S1, S6, S18, S19, AND S20 


SETF /P4B3T /N4B3T 7$1=00011 
CLOCKF CLK 

SETF P4B3T /N4B3T 7$6 =01110 
CLOCKF CLK 

SETF /P4B3T /N4B3T 7918 =11101 
CLOCKF CLK 

SETF /P4B3T /N4B3T 7$1=00011 
CLOCKF CLK 

SETF P4B3T /N4B3T 786 =01110 
CLOCKF CLK 

SETF /P4B3T N4B3T 7$19 =11110 
CLOCKF CLK 

SETF /P4B3T /N4B3T 781 =00011 
CLOCKF CLK 

SETF P4B3T /N4B3T 7$6 =01110 
CLOCKF CLK 

SETF P4B3T /N4B3T 7820 =10101 
CLOCKF CLK 


*TEST VECTOR #4 TO CHECK S2, S7, S20, S18, AND S21 


SETF /P4B3T N4B3T 7S2 =O001211 

CLOCKF CLK 

SETF /P4B3T /N4B3T 7S7 =00100 

CLOCKF CLK 

SETF /P4B3T /N4B3T 7820 =1012101 
CLOCKF CLK 

SETF /P4B3T N4B3T ?S2=00111 

CLOCKF CLK 
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SETF /P4B3T /N4B3T 7S7 =00100 
CLOCKF CLK 

SETF /P4B3T N4B3T 7818 =111201 
CLOCKF CLK 

SETF /P4B3T N4B3T 7*S2=0O001211 
CLOCKF CLK 

SETF /P4B3T /N4B3T 7S7 =00100 
CLOCKF CLK 

SETF P4B3T /N4B3T 7S21=101211 
CLOCKF CLK 


7;TEST VECTOR #5 TO CHECK SO, S2, S7, S8, S22, AND S23 





SETF /P4B3T N4B3T 7s2 =O O111 
CLOCKF CLK 

SETF /P4B3T /N4B3T 7S7 =00100 
CLOCKF CLK 

SETF P4B3T N4B3T 7;SO = 0 0 0 0 0, ERROR 
CLOCKF CLK 

SETF /P4B3T N4B3T ?S2=00111 
CLOCKF CLK 

SETF /P4B3T N4B3T 7S8 =001201 
CLOCKF CLK 

SETF /P4B3T /N4B3T 7814 =10011 
CLOCKF CLK . 

SETF /P4B3T N4B3T ?S2=OO021211 
CLOCKF CLK 

SETF /P4B3T N4B3T 788 = 001201 
CLOCKF CLK 

SETF /P4B3T N4B3T 7622 =11001 
CLOCKF CLK 

SETF /P4B3T N4B3T 7S2 =O 0111 
CLOCKF CLK 

SETF /P4B3T N4B3T 7S8 =O0101 
CLOCKF CLK 

SETF P4B3T /N4B3T 7S23 =103110 
CLOCKF CLK 

SETF /P4B3T N4B3T 7*S2=0 0111 
CLOCKF CLK 
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SETF /P4B3T N4B3T 7S8 =00101 

CLOCKF CLK 

SETF P4B3T N4B3T 7SO = 0 0 0 O O, ERROR 
CLOCKF CLK 


*TEST VECTOR #6 TO CHECK S2, S9, S24, S25, AND S23 


SETF /P4B3T N4B3T 7?S2=00111 
CLOCKF CLK 

SETF P4B3T /N4B3T 7869 =00110 
CLOCKF CLK 

SETF /P4B3T /N4B3T 7S24 =10100 
CLOCKF CLK 

SETF /P4B3T N4B3T 7S2=00111 
CLOCKF CLK 

SETF P4B3T /N4B3T 789 =00110 
CLOCKF CLK 

SETF /P4B3T N4B3T 7S25=11100 
CLOCKF CLK 

SETF /P4B3T N4B3T ?S2=00111 
CLOCKF CLK 

SETF P4B3T /N4B3T 789 =00110 
CLOCKF CLK 

SETF P4B3T /N4B3T 7S23 =10110 
CLOCKF CLK : 


;TEST VECTOR #7 TO CHECK S3, S10, S16, S26, AND S15 


SETF P4B3T /N4B3T 783 =01011 

CLOCKF CLK 

SETF /P4B3T /N4B3T 7$10 =01000 
CLOCKF CLK 

SETF /P4B3T /N4B3T 7816 =11000 
CLOCKF CLK 

SETF P4B3T /N4B3T 783 =01011 

CLOCKF CLK 

SETF /P4B3T /N4B3T 7810 =01011 
CLOCKF CLK 

SETF /P4B3T N4B3T 7S26=11011 
CLOCKF CLK 

SETF P4B3T /N4B3T 783 =01011 

CLOCKF CLK 
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SETF /P4B3T /N4B3T 7810 = 01000 
CLOCKF CLK 
SETF P4B3T /N4B3T 7815 = 10000 
CLOCKF CLK 


7;TEST VECTOR #8 TO CHECK S3, S11, S27, S28, AND S22 





SETF P4B3T /N4B3T 783 =01011 
CLOCKF CLK 

SETF /P4B3T N4B3T 7$11 =01001 
CLOCKF CLK 

SETF /P4B3T /N4B3T 7827 =10010 
CLOCKF CLK 

SETF P4B3T /N4B3T 183 =01011 
CLOCKF CLK 

SETF /P4B3T N4B3T 7811 =01001 
CLOCKF CLK 

SETF /P4B3T N4B3T 7828 =11010 
CLOCKF CLK 

SETF P4B3T /N4B3T 783 =01011 
CLOCKF CLK 

SETF /P4B3T N4B3T ;S11. 50 1.0 02 
CLOCKF CLK 

SETF P4B3T /N4B3T ;$22=11001 
CLOCKF CLK 


;TEST VECTOR #9 TO CHECK SO, S3, S12, S13, S28, AND S25 


SETF P4B3T /N4B3T 7S3 =O10i11 
CLOCKF CLK 

SETF P4B3T /N4B3T 7812 = 01010 
CLOCKF CLK 

SETF /P4B3T /N4B3T 7813 = 1 1-1) 22 
CLOCKF CLK 

SETF P4B3T /N4B3T 783 =01011 
CLOCKF CLK 

SETF P4B3T /N4B3T 7S12 =01010 
CLOCKF CLK 

SETF /P4B3T N4B3T 7828 = 11010 
CLOCKF CLK 

SETF P4B3T /N4B3T 7S3=O1011 
CLOCKF CLK 
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01010 


SETF P4B3T /N4B3T 7812 
CLOCKF CLK 


SETF P4B3T /N4B3T 7S25 
CLOCKF CLK 


11100 


SETF P4B3T /N4B3T 783 
CLOCKF CLK 


01011 


SETF P4B3T N4B3T 780 
CLOCKF CLK 


0 00 0 0, ERROR 


TRACE_OFF 
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Manchester Encoding 


The PALC22V10 has been programmed to give a Manchester 
encoded output for eight digital TTL-level inputs or a bit-serial 
TTL level input. The reason for using such encoding is to 
remove the DC component from a digital waveform prior to 
transmission. If the data were transmitted without encoding, 
‘DC wander' could result. 


A transmitted binary signal consists of a series of High and 
Low voltage pulses. If the signal consists of primarily High 
pulses, a mean DC component will exist. If the signal then 
becomes primarily Low pulses, the mean DC component will 
shift, or wander. This can cause transmission problems on 
capacitive lines. A Manchester encoded signal cannot have a 
DC component. 


The principle of Manchester encoding is to introduce phase 
shifting in a carrier wave, which is continuously switching, 
such that logic input changes are represented by a phase shift 


of that carrier. If data at the input of an encoder switches from 
a logic High to a logic Low, then a phase shift of 180 degrees 
(equivalent to a polarity shift) is performed. When a Low-to- 
High transition occurs at the input, the carrier phase is 
switched back by 180 degrees to the original phase. 


For example, a logic High could be encoded as a zero phase 
shift, and a logic Low as a 180-degree phase shift from a 
reference. The result in the encoded signal is that two 
successive Highs or Lows represent either a positive or 
negative data transition (Figure 1). When the data in the figure 
toggles Low to High the encoded output gives two successive 
High levels as the phase of the carrier shifts by 180 degrees. 
The transition of data from High to Low will generate two 
successive Low outputs in the encoded signal, returning the 
signal to the original phase. 


1 
DATA INPUT | | | | 
0 
MANCHESTER 14 
ENCODED Ae Le la nT] 
SIGNAL 9 


Figure 1. A Manchester Encoded Signal from a Digital TTL Input 


Applications 


Manchester encoding is applied to local area networks, such 
as Ethernet, and can be used in storage of data on magnetic 
media, or in other transmissions including infrared. Both data 
and clock are imbedded in the Manchester encoded signal. 
Clock information may easily be extracted from the encoded 
waveform by using phase-locked loop techniques, and the 
clock can be used to recover the encoded data. Thus, the data 
can be resynchronized on reception. 


Circuit 

Figure 2 shows a schematic diagram of the Manchester 
encoder circuit. The design has been made to be compatible 
with either serial or parallel data output from standard UART 
(Universal Asynchronous Receiver/Transmitter) circuits. The 
clock input to the system will come from the clock input to the 
UART circuit, which is typically sixteen times the data rate. 
The divide-by-sixteen circuit will generate a square wave 
output at R3, the original data rate as applied to the UART. This 
signal is available as an output from the system. On a PCB, the 
R3 signal can be used to decode the data. 
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ENCODED OUTPUT 


READY OUTPUT 
FLIP-FLOP 


Figure 2. Manchester Encoder Circuit 


A further counter synchronously clocked is a divide-by-eight 
counter with outputs RA, RB and RC. These outputs will select, 
in the case of a parallel data input, one of the individual data 
lines DO-D7. The selected data line is routed through the 
multiplexer and applied to the input of an eXclusive-OR (XOR) 
gate. One of the inputs to the XOR gate is the data clock rate, 
available at the R3 flip-flop output. The function of the XOR 
gate has been configured in the PALC22V10 from a 'sum-of- 
products’ architecture. 


The phase encoded signal has been derived at the output of 
flip-flop RX. Inverting the data gives a 180-degree phase shift 
from the reference established at the R3 output. The selected 
data input line determines whether or not phase (polarity) 
inversion should take place. If the data input is Low, no 
inversion takes place; if High, the reference signal is inverted 
to give the required phase shift. 


Equations 
The general output equation is: 
RX := R3*DATA + R3*DATA (= R3 :+: DATA). (1) 


The data input is selected by a one-of-eight condition of the 
state machine RA, RB, and RC. RA is the clock reference 
divided by two, RB is the clock divided by four and RC by eight. 


For a parallel input, data DO is selected to control the he output 
polarity at RX when R RA‘RB‘RC i is True, D1 when RA*RB*RC is 
True, D2 when RA*RB*RCG is True, etc. This procedure 
serializes the parallel data. The Manchester encoded signal is 
created by this dynamic control of the inverting input to the 
XOR configuration through the selected data input. 


There is a serial select input to the multiplexer circuit such that 
when driven active High, serial data applied to the DO input will 
encode the carrier signal on a bit-by-bit basis. Input lines D1- 
D7 are ignored when the encoder is used in this mode. The SER 
mode should be hard-wired Low for parallel data input. 


To enable external handshake to be accomplished in either 
serial or parallel mode, a RDY output is available. When a 
complete byte or bit is encoded and transmitted to the output 
RX, the RDY pin will go High to request more data. Note that the 
Manchester encoder does not latch the data; it must remain 
valid for the duration of the encoding. The logic circuit required 
for the RDY pin is shown as a sum of product terms as applied 
to the 'D' input of the RDY flip-flop. 


The data that is applied to the input is available in a serial form 
at the DTST output and can be used for test purposes. DTST is 
simply the serialized output of the data input DO-D7. 
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PAL Device Design Considerations 


The PALC22V10 was chosen for this application because of 
the large number of product terms required to encode the XOR 
function. Up to sixteen product terms can be used to create an 
XOR function on eight data inputs. The PALC22V10 also offers 
low power with TTL compatibility, reducing power supply 
requirements while increasing reliability. 


The flip-flops RO, R1, R2, R3, RA, RB, and RC form a long 
binary count chain. The output from R3 forms the reference 
phase output which is the clock input divided by sixteen. As 
discussed, RA, RB, and RC form a binary state machine to 
select DO-D7 data inputs individually. 


Design Procedure 


The following figures show the design procedure used to derive 
equations for the binary divide-by-sixteen counter RO-R3. The 
Boolean equations have been derived using the PALASM® 2 
software operators. 


Figure 3 shows the counter states in hexadecimal and binary 
formats. Figure 4 shows two Karnaugh maps, the state 
assignment and state transition; Figure 4a shows the 
hexadecimal value of the current state of the counter, and 
Figure 4b shows the next state. For example, the location in 
the state assignment map of hex 'A' would be represented by 
the next state hex 'B’ in the transition map. 











Figure 3. Binary Counter Truth Table. The Conditions of the 
Flip-flops RO, R1, R2, and R3 Are Shown in the Truth 
Table with the Hexadecimal Value of the Binary 
Weighting 





4b. State Transition Map 


Figure 4. The State Assignment Map (4a) Shows the Current 
Hexadecimal Value in the Appropriate Position in the 
Map. The State Transition Map (4b) Shows Next 
State. For Example, State A in the Assignment Map 
is Shown as B in the Transition Map Because that is 
the Next Count in the Sequence 


Figure 5 shows the binary weighting of the hexadecimal values 
of the flip-flops RO, R1, R2, and R3 of the transition map. From 
this map, individual Karnaugh maps are drawn for each 
individual flip-flop, and equations derived by performing 
minimization for each flip-flop. 


R1 RO 





Figure 5. State Transition Map with the Binary Weighting of 
the Hexadecimal Entries. The Register Significance 
of the Entry is from Left to Hight. For Example, 1101 
Represents the Binary Weighting of RO R1 /R2 R3 
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Figure 6 shows the Karnaugh maps and the equations for either 
true or complement outputs. For true outputs, minimization is 
performed on the logic One entries in the map. Where an 
inverting buffer is used between the internal flip-flop and the 
outside world, minimization is performed on the logic Zero 
entries in the map (shown bolder). This is because the internal 
flip-flop's output is set High, but the state of the output pin is a 
logic Low due to the inverting action of the buffer. With the 
output configuration of the PALC22V10, it is possible to 
choose either inverted or non-inverted outputs, so either 
equation will be suitable. 


R1 RO 
10 11 01 





(RO:=RO (1) 
RO:=/RO (2) 


6a. Map for Flip-flop RO. 


IR1 := RO*R1 + /ROYRI (3) 
Ri :=/RO'R1 + ROVR1 (4) 


6b. Map for Flip-flop R1. 


R1 RO 
R3 R2 01 


AL Tey 
"| YI@l Ky) 


ay KO) (Ra) 
oa CURE ICRI) 


/R3 := R3*R2*R1*RO + /R3*/R2 
+ /R3*/R1 + /R3*/RO (7) 


R3 :=/R3°R2*R1*RO + R3"/R2 
+R3/R1 + R3RO (8) 


6d. Map for Flip-flop R3. 







/R2 := R2*R1*RO + /A2*/R1 + /R2*/RO (5) 
R2 := /R2*R1*RO + R2*/R1 + R2*/RO (6) 


6c. Map for Flip-flop R2. 


Figure 6. The Karnaugh Map of Figure 5. Broken Down for 
Individual Flip-flops RO, R1, R2, and R3. 
Minimization is Performed for Each Individual Flip- 
flop 


The PALASM 2 software operators describing the equation 
are: 


* logical AND 
+ logical OR 
logical inversion 
‘= registered output equation. 


The equations derived for a general counter of length N 
become: 


QN — := QN*QN-1*QN-2*...Q2°Q1"Q0 
+ QN*tON-1 
+ QN*QN-2 


+ QN'Qi 
+ QN*Q0 
using the inverting option, or: 


QN c= GN*QN-1*QN-2*...Q2°Q1*Q0 
+ QN‘ONA 
+ QN'OND 


+ QN*OQi 
+ QN*Q0 
for the non-inverting option. 


This generalized equation requires N + 1 product terms for a 
general flip-flop, QN. For example, Q7, in a binary count 
sequence, will require a summation of eight product terms. The 
complete design specification is shown in the PAL device 
Design Specification of Pattern 04. 


Design Results 


Figure 7 is an oscilloscope photograph of the outputs of the 
programmed device. The top trace is the RDY signal. It goes 
High to indicate that a complete data byte has been 
transmitted and that the next data byte may be presented to 
the data inputs DO-D7. The second trace is DTST, the serial 
data stream DO-D7 of the parallel data input DO-D7. The bottom 
trace is the Manchester encoded waveform of the parallel data 
input, or RX. The markers show one complete frame of 
information. 





Figure 7. Oscilloscope Trace of Programmed Device. The 
Manchester Encoded Output is RX 
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TITLE MANCHESTER ENCODER. 
PATTERN . 04. 

REVISION 01. 

AUTHOR CHRIS JAY. 

COMPANY MMI SANTA CLARA, CA. 
DATE 17 JULY 1986. 


, 

;THE PALC22V10 HAS BEEN DESIGNED TO FUNCTION AS A MANCHESTER 
; ENCODER CIRCUIT. THE BINARY DATA INPUT WILL DETERMINE WHETHER 
7;THE TRANSMISSION SIGNAL SHOULD BE IN PHASE WITH A CLOCK 
;REFERENCE OR SHIFTED BY 180 DEGREES FROM THAT REFERENCE. 
;THE CHARACTERISTIC OF A MANCHESTER ENCODED SIGNAL IS A 
7CHANGE OF PHASE FOR EACH TRANSITION OF THE DATA INPUT. 
ASSUMING THAT 'X' IS THE PHASE OF THE CARRIER THAT ENCODES 
7A LOGIC LOW INPUT, WHEN THE INPUT DATA TRANSITIONS 

7TO A LOGIC HIGH THE PHASE OF THE CARRIER WILL BE SHIFTED 
7TO 'X + 180'. IT WILL CHANGE BACK TO 'X' ONLY WHEN THE 
;DATA INPUT TOGGLES FROM A LOGIC HIGH TO A LOGIC LOW. 

7;THE KEY FEATURES OF THE ENCODED SIGNAL ARE: 





1. DATA AND CLOCK FREQUENCY INFORMATION IS 
CONTAINED IN THE ENCODED SIGNAL. CLOCK 
INFORMATION MAY BE EXTRACTED BY USING A 
PHASE LOCKED LOOP IN THE RECEIVER. 

2. NO DC COMPONENT IS CONTAINED IN THE 
PHASE ENCODED SIGNAL. 


Oe Te Me TR NS NO NO Ne 


;THE APPLICATIONS OF PHASE ENCODED SIGNALS CAN BE IN SHORT 
sCOMMUNICATIONS LINKS, SUCH AS ETHERNET OR INFRARED TRANS- 
;MISSIONS, OR IN STORAGE ON SOME MAGNETIC MEDIA. 

;THERE ARE EIGHT DATA INPUTS, DO TO D7. DATA PRESENT ON 
7;THESE PINS WILL BE ENCODED INTO A BI-PHASE SIGNAL AT THE 
;OUTPUT QX, WHEN THE SER (SERIAL) INPUT DATA PIN IS CONFIGURED 
7TO A LOGIC ZERO. TO CONVERT A SERIAL DATA STREAM OF LOGIC 
7;ONES AND ZEROS INTO A PHASE ENCODED SIGNAL THE SER INPUT 
7;IS TIED HIGH, AND THE DATA IS APPLIED TO THE DO INPUT 

;OF THE PAL DEVICE. A CLOCK INPUT OF 16 TIMES THE DATA RATE 
7;IS REQUIRED FOR THE PARALLEL OR SERIAL ENCODING OF THAT 
;DATA, SO THE DESIGN MAY BE USED WITH MANY POPULAR UART LSI 
s;CIRCUITS. THERE IS ONE HANDSHAKE OUTPUT FROM THE DEVICE, 
*RDY. THIS SIGNAL GOES HIGH WHEN THE NEXT DATA BYTE OR BIT 
7IS REQUIRED TO BE SET UP AT THE INPUT. 
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CHIP MANENC PAL22V10 


;PIN 1 2 3 4 5 6 

CLK /RST DO Dl D2 D3 
;PIN 7 8 9 10 il 12 

D4 D5 D6 D7 NC GND 
;PIN 13 14 15 16 17 18 

SER RDY /DTST RC RB RX 
;PIN 19 20 21 22 23 24 

RA R3 R2 Rl RO vcc 
GLOBAL ;GLOBAL TERM 


;ENABLES RESET. 
;STRING DECLARATIONS. 


STRING SO '/RC*/RB*/RA' ;ENCODE STATE 0 
STRING Sl '/RC*/RB* RA' ;ENCODE STATE 1 
STRING S2 '/RC* RB*/RA' ;ENCODE STATE 2 
STRING $3 '/RC* RB* RA' ;ENCODE STATE 3 
STRING S4 ' RC*/RB*/RA' ;ENCODE STATE 4 
STRING S5 ' RC*/RB* RA! ;ENCODE STATE 5 
STRING S6 ' RC* RB*/RA' ;ENCODE STATE 6 
STRING S7 ' RC* RB* FA! ;ENCODE STATE 7 


;FOR PARALLEL INPUT 
;S0 SELECTS DO INPUT, 
;S1 SELECTS Dl INPUT, 


; ETC. 
EQUATIONS 7 
GLOBAL.SETF = RST ;SET INITIAL 
;CONDITIONS, RESET. 
/RO := RO ;RO DIVIDES THE CLOCK 
; INPUT BY 2. 
/R1 := R1* RO Rl DIVIDES THE CLOCK 
+ /R1*/RO ;BY 4. 
/R2 := R2* R1* RO s;R2 DIVIDES THE CLOCK 
+ /R2*/R1 ;BY 8. 
+ /R2* /RO ; 
/R3 := R3* R2* R1* RO *>R3 DIVIDES THE CLOCK 
+ /R3*/R2 ;BY 16. 
+ /R3* /R1 ; 
+ /R3* /RO ; 
;RDY OUTPUT PERFORMS 
RDY : RC*RB*RA*R3*R2*R1*RO*X/SER ;A HANDSHAKE OPERATION. 


+ it 


R3*R2*R1*RO* SER ;WHEN HIGH, NEW DATA 
;MAY BE APPLIED. 





2-504 a\ Monolithic rar Memories Pm | 
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/RC : 


/DTST: 


++tteetil 


/RX 


t++ettei 


++eeteen +++ ¢ 4+ 40 
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RA* R3* R2* R1* RO*/SER 


/RA* /R3* /SER 
/RAt /R2* /SER 
/RA* /R1* /SER 
/RA* /RO*/SER 
SER 
RB* RA* R3* R2* R1* RO*/SER 
/RB* /R3* /SER 
/RB* /R2* /SER 
/RB* /R1* /SER 
/RB* /RO*/SER 
/RB* /RA* /SER 
SER 
RC* RB* RA* R3* R2* R1* RO*/SER 
/RC* /R3* /SER 
/RC* /R2* /SER 
/RC* /R1* /SER 
/RCK /RO*/SER 
/RCK /RA* /SER 
/RC*/RB* /SER 
SER 
S0*DO 
S1*D1*/SER 
S2*D2*/SER 
S$3*D3*/SER 
S4*D4*/SER 
S5*D5*/SER 
S6*D6*/SER 
S7*D7*/SER 
SO*/R3*DO + SO*R3*/D0 
S1*/R3*D1*/SER + S1*R3*/D1*/SER 
S2*/R3*D2*/SER + S$2*R3*/D2*/SER 
S3*/R3*D3*/SER + S3*R3*/D3*/SER 
S4*/R3*D4*/SER + S4*R3*/D4*/SER 
S5*/R3*D5*/SER + S5*R3*/DS*/SER 
S6*/R3*D6*/SER + S6*R3*/D6*/SER 
S7*/R3*D7*/SER + S7*R3*/D7*/SER 


;COUNTER [RA, 


RB, RC] 


;COUNTS THE NUMBER 
;OF BITS TRANSMITTED 


;WHEN A BYTE IS 
;APPLIED TO THE 


;DO - D7 DATA INPUTS. 
;THE LOGIC CONDITION 
;OF THE SELECTED 
;INPUT IS MULTIPLEXED 
;TO THE RX REGISTERED 


;OUTPUT BY THE 


‘ONE OF 


;EIGHT' STATE SELECTOR, 
;IN THE DIVIDE BY EIGHT 


;COUNTER RA, RB 


, RC. 


;SERIAL MODE /RA*/RB*/RC 
;SELECTS DATA INPUT DO, 
;RA*/RB*/RC SELECTS Dl, 
;ETC., WHILE /SER SELECTS 


; PARALLEL OPERATION. 


;SER IS ACTIVE, 


IF 
SERIAL 


;ENCODING IS SELECTED. 
;THE SERIAL DATA APPLIED 
;TO THE DO INPUT IS 


;ENCODED. D1 ~ 


D7 


; INPUTS ARE DESELECTED. 
;DO0 ~ D7 IS SERIALIZED 
;FOR PARALLEL MODE. 
;FOR SERIAL MODE, DO ONLY 
;IS CLOCKED THROUGH TO 
;THE DTST OUTPUT. 
;/DTST OUTPUT IS THE 
;SERIALIZED ENCODING 
;OF THE DATA INPUT 


;DO0 — D7, OR DO 


;THE RX OUTPUT PROVIDES 


; THE MANCHESTER 


ENCODED 


;OUTPUT OF THE REFERENCE 
; FREQUENCY, WHICH OCCURS 


;AT THE R3 OUTPUT. 


;PHASE AT R3 IS 


THE 
SHIFTED 


;AT THE RX OUTPUT IF THE 
;SELECTED DATA INPUT PO- 


;LARITY CHANGES 


;IS INACTIVE, DO - D7 


; INPUTS ARE SELECTED. 
, ONLY 
;SERTAL INPUT APPLIED 


;/SER IS ACTIVE 


;D0 IS SELECTED 


ON monorithic Kt memories &\ 


TO 





FOR 


IF /SER 


IF 
THE 
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SIMULATION ; 
TRACE ON CLK RO R1 R2 R3 RA RB RC RX ;TRACE ALL SIGNALS. 
/DTST RDY DO Dl D2 D3 D4 D5 ; 
D6 D7 SER /RST ;SET INITIAL CONDITIONS 
SETF /CLK RST DO D1 /D2 D3 D4 /D5 ;FOR DATA & CONTROL IN- 
/D6 /D7 /SER ;PUTS. PARALLEL MODE SET. 
SETF RST ;RESET ACTIVE TO 
CLOCKF CLK ; INITIALIZE THE 
SETF /RST ; REGISTERS 
FOR I := 1 TO 128 DO ;CLOCK 8 DATA BITS 
BEGIN CLOCKF CLK ;THROUGH TO R3 & RX 
END ;OUTPUTS. 
SETF SER DO ;SET SERIAL MODE VIA 
FOR I := 1 TO 32 DO ;SERIAL CONTROL PIN. 
BEGIN CLOCKF CLK ;CLOCK SYSTEM TO SELECT 
END ;SERIAL OPERATION. 
TRACE OFF ; 
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Peripherals are integral to any computer system. They range 
from basic keyboards and printers to mass storage disks/tapes. 
In this section we will focus on mass storage systems such as disk 
drives and tape drives. 


Atypical mass storage subsystem can be thought of in terms of 
two major sections: the controller and the storage device. The 
controller provides an interface between the host and the storage 
device. It may control one or several storage devices at a time. 
Common storage devices are disk drives (winchester and floppy) 
and tape drives (reel and cartridge). 


The discussion that follows will be oriented toward the storage 
device being adisk drive although it could just as easily have been 
done using tape drives. 


Figure 1 shows two configurations of disk storage subsystems. In 
Figure 1A the storage device is connected to the host via a string 
controller. Communication between the storage device and the 
controller is accomplished through standardized interface proto- 
cols. Examples of common disk drive interfaces in use today are 
SMD (Storage Module Device), ESMD (Enhanced SMD), ESDI 
(Enhanced Small Disk Interface), ST506, and SA465. An appli- 
cation example of an ESDl interface controller is shown on page 
2-509. An intelligent storage device is created when the string 
controller function is embedded into the storage device (Figure 
1B). Intelligent storage devices typically communicate to the host 
through a host adapter using either SCSI (Small Computer 
System Interface) or IPI (Intelligent Peripheral Interface) interface 
protocols. 






HOST 












STORAGE 
DEVICE 


STRING 
CONTROLLER 





a. String Controller With a Storage Device 


IPI 











HOST 










HOST 
ADAPTER 


INTELLIGENT 
STORAGE DEVICE 





b. Host Adapter With an Intelligent Storage Device 


412 01 
Figure 1. 


Controller 


The controller function can be conceptually divided into several 
sections: the Host Bus Interface, the Data Buffer, the Kernel and 
the Device Interface. 







KERNEL 





HOST BUS 
INTERFACE 


DEVICE 
INTERFACE 
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Figure 2. 


The Host Bus Interface provides the connections to the host CPU 
and main memory. It can serve as either aslave or a master on 
the host bus. When the controller is a bus slave, the host CPU 
must be able to access the controller, pass commands, and 
obtain subsystem status using established host bus protocol. 
The controller behaves as a bus master while transferring data in 
either direction, or posting status. Abus master must be able to 
arbitrate for the host bus and execute the necessary handshake 
protocols. 


The Data Buffer stores the data being read from or written to the 
disk and checks for data integrity. The physical buffer can serve 
several purposes: data transfer rate adaptor between the host 
and the disk, a temporary storage for data going to the host until 
it can be verified error free, and a cache. Depending on the 
complexity of the data buffer it may be as small as one byte or as 
large as a complete track of data. 


Data integrity can be monitored for datatransfers from the host as 
well as from the disk drive. The sophistication used to monitor 
data integrity varies significantly from simple single bit error 
detection (parity) to large multiple bit error detection and correc- 
tion. The host may elect to transfer data with a parity bit for each 
byte. Data error detection and correction schemes are employed 
between the controller and the disk drive. As storage densities 
have increased, so have the size of the errors. The demand for 
more capable error detection and correction codes that can more 
accurately detect and correct larger number of bits in error has 
increased accordingly. 
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Such error detection codes as CRC code are satisfactory if the 
error occurs inthe data transmission channel and the datacan be 
recovered by retransmission. Data errors caused by media 
defects require the capability of both error detection and correc- 
tion. As the nature of media errors has changed, so have the error 
detection and correction codes being used. Fire Codes and 
computer generated codes have been the most popular codes. 
Symbol oriented codes, such as Reed-Solomon codes, are 
emerging as the solutions for the future. 


The Kernel is the brain of the controller and the manager of the 
subsystem. It translates commands from the hostinto commands 
for the storage device. It receives requests from the operating 
system through the host interface and notifies the host when the 
request is completed. To execute the host's request for data, the 
kernel may have to access the storage device or it may only have 
to access acache. Whenaccessing the storage device the kernel 
sends commands according to the device interface protocols. 


The Device Interface communicates directly with the storage 
device. Using the particular interface protocols, the device inter- 
face sends commands, retrieves status, and transfers data 
between the controller and the storage device. The device inter- 
faces shown in Figure 3 are predominantly disk and would have 
to be changed to accommodate tape drives. An application 
example of a tape drive interface controller is shown on 
page 2-519. 


Storage Device 


The storage device can also be divided into several sections: the 
Interface & Control, the Servo and the Read/Write. 










INTERFACE 
& CONTROL 


READ / WRITE 
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Figure 3. 


The Interface & Control is the brain of the storage device. 
It interprets the commands from the controller. It supervises the 
servo and the read/write and maintains the device status. 


The Servo is responsible for the movement of the media and the 
positioning of the read/write heads. For disk drives D.C. brush- 
less motors are predominantly used to drive the spindle. Head 
positioning is done either with a stepper motor or a voice coil 
actuator. In the case of tape drives, the Servo is only responsible 
for capstan motor control because the position of the read/write 
heads is fixed. 


The Read/Write is responsible for storing and retrieving datafrom 
the media. The data encoding method must balance data density 
and ease of data recovery. Several Run-Length-Limited (RLL) 
coding techniques are in use today.. Winchester disk drives have 
used frequency modulation (FM), modified FM (MFM), modified 
MFM (MMF), and are currently using 2,7 to obtain present day 
densities. Tape drives employ different RLL codes than disk 
drives. The two common codes in use for tape storage are phase 
encoding (PE) and group-coded recording (GCR). The applica- 
tion note on page 2-541 shows an example of a GCR encoder im- 
plemented with PAL devices. 


When the storage device is configured with an embedded control- 
ler (Figure 4) the Device Interface section of the controller and the 
Interface & Control section of the storage device are not required. 
The Kernel will directly control the storage device. 


IPI KERNEL | SERVO 
SCSI 
BUS 
¢ —__} INTERFACE 





DATA 
BUFFER READ / WRITE 
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Figure 4. 


PLDs have been used in designs throughout mass storage 
subsystems. Applications range from consolidation of random 
logic to address decoding, bus arbitration logic to protocol! state 
machines, and complex encoding and decoding. Application 
notes and technical articles are included here to aid designers in 
understanding the benefits of implementing different designs with 
programmable logic. 
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The ESDI Translator 


ESDI is a low-cost, high-performance interface standard suita- 
ble for the smaller, high-performance Winchester disk drives 
currently being produced. The ESDI interface consists of a 
control cable and a data cable. The control cable allows for a 
daisy chain connection of up to seven devices (disk or optical 
drives) with only the last device being terminated. In our de- 
sign, we assumed that the device is a disk drive. The data 
cable is attached in a radial configuration (See Figure 1). 


HOST 


CONTROL 





CONTROLLER 






L u2 DRIVENO. 2 
|i tA, 


DC VOLTAGES 
(RADIAL) 
FRAME GROUND 
TERMINATOR 
‘ i 2 LAST DRIVE 
Note: Maximum number of drives = 7 ONLY 


Figure 1. Connection Between the Controller and Multiple 
Drives 


MSB 






CMD FUNCTION CMD MODIFIER 


REGISTER A 


a ee eee 
CMD FUNCTION CMD PARAMETER | P| 
Se i el 


The ESDI Translator handshakes serial commands from a disk 
controller, deserializes the commands and passes the com- 
mands to a microcontroller. The command data word structure 
is shown in Figure 2. 


The Command Function bits define functions to be executed by 
the disk drive. These functions are seek, recalibrate, request 
status, request configuration, select head group, control, data 
strobe offset, track offset, initiate diagnostics, set bytes per 
sector and set configuration. Some of these functions, such 
as the control function, have modifiers for more detailed func- 
tional description. Other commands have parameters that con- 
tain numbers. For the seek command, the parameter specifies 
the cylinder number that the drive will seek to. The request 
status and request configuration commands require data from 
the disk drive to be transferred back to the disk controller. In 
our current design, internal registers A and B in the LCA device 
represent the upper and lower bytes of the command respec- 
tively. 


Figure 3 illustrates the relationships between the disk control- 
ler, the ESDI Translator, and the microcontroller. The PROM is 
used to store the configuration data for the LCA device. The 
Done/Program (D/P) output is driven LOW when the device is 
being configured. Configuration data is read from the PROM 
device during configuration. After configuration is complete, 
the LCA device drives the D/P pin HIGH to deselect the PROM. 
Drive selection, write protection, command completion and 
fault detection are also handled by the ESD! Translator. 


ODD 
PARITY 


LSB 





REGISTER B 


Figure 2. Command Data Word Structure 


Logic Cell™ Array and XACT™ are trademarks of X!LINX, Inc. 
P-SILOS™ is a trademark of SimuCad Corp. 
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RESET 
CLOCK 


COMMAND DATA 
CONFIG/STATUS DATA 
TRANSFER REQUEST 
TRANSFER ACKNOWLEDGE 
ATTENTION 
DISK COMMAND COMPLETE 
CONTROLLER 


WRITE GATE IN 
READ GATE IN 
DRIVE SELECT 0 
DRIVE SELECT 1 
DRIVE SELECT 2 
DRIVE SELECTED 








ADDRESS 
BUS 
= 2Kx8 
DATA BUS 
A(1-0)/CS MICRO 
CONTROLLER 
RD 
WR 
INT 
LCA 
mene? O DRIVE SWITCH 0 
O DRIVE SWITCH 1 
0 DRIVE SWITCH 2 
0 WRITE PROTECT 


EXT. CHANGE OF STS. 
READ GATE OUT 
WRITE GATE OUT 


Figure 3. An ESD! Translator Implemented on the LCA Device 


Why Use an LCA Device in an ESDI 
Translator 


The ESD! interface standard requires more logic functions to 
be built into the disk drive than some other interface stan- 
dards, such as the ST506 standard. However, the external di- 
mensions of a disk drive usually have to conform to an industri- 
al standard form factor. Thus, the use of high-density semicus- 
tom chips is the logical solution to increase functionality with- 
out increasing the external dimensions. 


The LCA device is a high-density CMOS integrated circuit 
available from Monolithic Memories. Its high gate density al- 
lows the implementation of an ESDI Translator in a single chip. 
Fifteen standard SSI and MSI chips would be necessary for the 
same application. If PLDs are used to implement the ESDI 
Translator, more than one would be necessary because a large 
amount of logic is required, thus occupying more board space. 


A major advantage in using the LCA device is that it can speed 
up the design cycle, enabling the manufacturer to have a 
shorter time-to-market. Also, many peripheral products are 
produced in relatively small quantities aiming at very special- 
ized markets. The LCA device, which has no NRE cost, makes 
the production of small quantities more economical than the 
gate array. Another advantage of the LCA device over other 
semicustom chips, such as the gate array, is its reprogramma- 
bility feature. The LCA device is RAM-based which can easily 
be reprogrammed by the user in the final system. This feature 
is especially important in the peripheral products market, 
where many products have short life spans. 


Design Implementation 


The ESDI Translator is responsible for all contro! interfaces 
between the disk controller and the disk drive. An internal 
block diagram of the ESDI Translator is shown in Figure 4. It 
consists of five major logic building blocks: 


° Drive selection 

e Read gate/write gate 

e¢ Counter/controller 

e Shift register and parity generator/checker 
e Internal register address decoder 


Drive selection on ESDI-compatible drives involves three 
signals from the disk controller, Drive Select 0-2. These three 
drive select lines are encoded so that up to seven drives may 
be connected to the same ESDI port, as shown in Table 1. 


On the LCA device, the drive number is selected by connecting 
the drive switch pins to either VCC or GND. When the code on 
the drive select lines, DSX, equals the code on the drive switch 
pins, DSWX, where X may be 0, 1, or 2, the drive is selected 
and the Drive Selected signal, DSELD, is asserted. Once the 
drive has been selected, serial commands output by the disk 
controller will be read by the LCA device. The actual implemen- 
tation is shown in Figure 5, using two CLBs and seven IOBs. 
BDSO and BDS1 are the names of the CLBs in the current de- 
sign. In our design, names that begin with the letter B or P are 
used to designate a CLB or an IOB respectively. SCLK is the 
system clock. 
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TRANSFER REQUEST 
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COMMAND COMPLETE 
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OF STATUS 
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WRITE GATE IN 
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Figure 4. ESDI Translator Internal Block Diagram 









DSW2 DSW1 DSWwo 
ee a a a ee 
| oo | oft | Select rvs 
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ae 
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Poa | Select Drive 6 
Select Drive 7 


Figure 5. Configuration of the BDSO and BDS1 CLBs to 
Table 1. Drive Selection Provide the Drive Selected Signal 
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Figure 6 shows the configurations of the two CLBs as dis- 
played on the computer screen by the XACT development soft- 
ware. In Figure 6a, the CLB is configured as one function of 
four variables. The D flip-flop is not used. The logic representa- 
tion, truth table, Karnaugh map, signal names, block name and 
Boolean equation are shown. 


In Figure 6b, the CLB is configured as one function of three 
variables, the output of which is connected to the D flip-flop. 
The Q output of the D flip-flop becomes the output of this CLB. 





F = ~(D@B+A@C) 


Figure 6a. Drive Selection Logic Implemented in a 
CLB (BDS0) 


A: DS2 
B: DSELDA 


Cc: 
D: DSW2 
K: SCLK 


xX: 
Y: DSELD 


F = ~(D@A)'B 





Figure 6b. Drive Selection Logic Implemented in a 
CLB (BDS1) 


The LCA device also performs logic functions for the Read 
Gate and Write Gate logic blocks. The Read Gate signal allows 
data to be read from the disk, and the Write Gate signal allows 
data to be written on the disk. These signals from the disk 
controller are input to the LCA device as Read Gate In and 
Write Gate In. Under normal operating conditions, Read Gate 
Out is asserted when Read Gate In is asserted and Write Gate 
Out is asserted when Write Gate In is asserted. When both 
Read Gate In and Write Gate In are asserted, a write error 
condition results and the Attention line is asserted, signalling 
the disk controller that an error has occurred. Also, the LCA 
device may be used to provide write protection to a disk drive. 
When the Write Protect signal is asserted, the Write Gate Out 
signal will not be asserted when the Write Gate In signal is 
asserted, preventing the write circuitry from being activated. 
These logic functions are implemented in two CLBs, BRWGO 
and BRWG1, as shown in Figure 7. 


DSELD 
WRGIN 


RDGIN 
WRPRT 


WRGOUT 





DSELD 


RDGOUT 
RDGIN 


Figure 7. Read Gate/Write Gate Logic Implementation 


The Counter/Controller handshakes commands from the disk 
controller. It also handshakes status/configuration data to the 
disk controller. It is also responsible for generating the Inter- 
rupt, Attention and Command Complete signals. Seventeen bit 
commands (one bit is parity) are transferred from the disk con- 
troller to the LCA device via the Command Data line. The serial 
bit transfer is performed using a pair of handshaking signals, 
Transfer Request (TREQ) and Transfer Acknowledge (TACK). 
TREQ is asserted by the disk controller when a bit is valid on 
the Command Data line, and TACK is asserted by the LCA de- 
vice when the command bit has been read. The handshaking 
action is shown in Figure 8. 


CMD VALID DATA 
TACK Ne a 


Figure 8. Command Data Transfer 


A 17-state counter counts the number of command bits shifted 
in or shifted out of the data registers. Its implementation is 
shown in Figure 9. The SHIFT signal is asserted, thus 
incrementing the counter, whenever there is a transfer request 
and the LCA device is selected. CNT_QQ0 is the lowest bit of the 
shift register counter. This bit is inverted whenever SHIFT is 
asserted unless sixteen bits have already been shifted into the 
LCA device (CNT16 asserted, or CKCMD asserted when all 
seventeen bits have been shifted in). The DATAOUT signal is 
asserted after a Request Status command or a Request 
Configuration command has been transferred and the internal 
data registers have been loaded with data to be serially shifted 
out. Hence, when DATAOUT is asserted, CKCMD is negated 
and the counter is enabled. 
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Figure 9. 17-State Counter Implemented in six CLBs 


The logic generation of the Attention signal, ATTEN, is shown 
in Figure 10. Whenever there is a write fault, WRFLT, a parity 
error, PARERR, an interface fault, INTFLT, or an external error, 
CSTS(Change of Status), the ATTEN signal is asserted. When 
both WRGIN and RDGIN signals are asserted and the LCA 
device is selected, the WRFLT signal is asserted. The WRFLT 
signal is negated when the command transferred to the LCA 
device Register A is the Reset command defined by the ESDI 
standard, which has a Command Function of 0101 (Control) 
and a Command Modifier of 0000 (Reset Attention and 
Standard Status). The Command Function and Command 
Modifier formats are shown in Figure 2. 


Three CLBs, BPGO, BPG1, and BPG2, are responsible for 
generating the PARERR signal. BPGO is a multiplexer that 
selects the source of the input to the parity generator/checker 
(BPG1). If data is being shifted into the LCA device (parity 
checker mode), the CMDBITA input is chosen. If data is being 
shifted out of the LCA device (parity generator mode), the 
RA_Q7 output is chosen. These two signals are also shown in 


Figure 11. BPG1 is an odd parity generator/checker. In the 
parity checker mode, whenever an odd number of ones are 
passed through this CLB, the output is one. This output signal 
is connected to BPG2, which inverts the signal and asserts or 
negates the PARERR signal accordingly. If parity is correct, an 
interrupt is asserted to the microcontroller informing the 
microcontroller that a command has been received and is 
ready to be read. !n the parity generator mode, the output of 
BPG1 is the parity bit. BPG1 is clocked by the SHIFT input, 
which is asserted whenever there is a transfer request and the 
LCA device is selected. BPG1 is reset by the Reset Parity 
Generator input, RSTPGEN. This signal is asserted when ei- 
ther INT is asserted or an interface fault is detected. 


The Interface Fault signal is asserted when the LCA device is 
selected and CNTR is negated before seventeen bits have 
been transferred. CNTR is asserted after the first command bit 
is shifted in and is negated after the seventeenth bit is shifted 
out. BINTFLTO is clocked by the system clock, SCLK, and 
reset by the RSTCOS signal. The IOB PCSTS is configured as 
a buffered input to signal external error conditions. 


The microcontroller is able to address four register locations in 
the LCA: two data registers, BRGA and BRGB, one error 
register, BDMX, and one command complete register, BRC7. 
Only three bits in the error register are used. They are bit 0 for 
parity error, bit 4 for interface fault and bit 7 for write fault. Only 
one bit in the command complete register is used. This is bit 0, 
which has a value of zero when the command is completed. 
The addresses and contents of the registers are shown in 


Table 2. 
BIT NUMBER 

Fat a0 | AEGISIES 76543210 
poo] recstena | | | | 
pos | recsters | | | | | | | 

ERROR 

REGISTER 

COMMAND 

COMPLETE REGISTER 


Table 2. Register Addresses and Contents 












The LCA device implementation of the registers and multiplexer 
is shown in Figure 11. When the interrupt signal is asserted, 
the microcontroller reads the two data registers by setting the 
Ai and AO address lines appropriately and asserting the RD 
signal to the LCA device. These data registers contain the 
command that is transferred from the disk controller through 
the CMDBITA input. If the command is a request data 
command, configuration or status data is written to these two 
data registers by the microcontroller. These two bytes, plus a 
parity bit that is generated by the parity generator in the LCA 
device, are serially transferred to the disk controller over the 
Config/Status Data line through RA_Q7. After all seventeen 
bits have been transferred, the Command Complete signal is 
asserted. If the command is not a request data command, the 
microcontroller executes the command and upon completion, 
writes a byte of zeros to the command complete register of the 
LCA. When the command complete register is written with all 
zeros, the Command Complete signal is asserted by the LCA 
device.The command complete register may only be written, 
not read. The status register contains error bits that are set 
when errors are detected. This register may only be read and 
not written. 
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Figure 10. Generation of the Attention Signal by the Four Possible Error Conditions 
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Design Considerations 


The circuit diagram of the programmed LCA device is shown in 
Figure 12. This design implementation uses sixty-three of the 
sixty-four CLBs within the LCA. This translates into a 98% 
usage. The rightmost column of CLBs contains the multiplexer, 
which selects between registers A, B, or the error register to 
be placed on the data bus. Registers A and B are placed on the 
third and second column from the right end, respectively. The 
registers and multiplexer are placed physically close to each 
other to simplify routing. The CLBs which execute a particular 
function are placed physically next to each other. These 
functions include the parity generator/checker, 17-state 
counter, read gate/write gate logic, drive selection logic and 
error detection logic. The read gate/write gate logic and the 
drive selection logic, which require much I/O activity, are 
placed in CLBs near the edges of the device. 


Another consideration in implementing an LCA design is 
routing. Long lines are available for signals that have to travel 
a long distance within the LCA device. These lines can also be 
used for signals that must have minimal skew between differ- 
ent destinations. An automatic routing program is available in 
the XACT software package. 


Routing requires careful consideration when a design uses a 
high percentage of the available CLBs (above 95%). Although 
automatic routing is available, designs with high CLB usage 
may require point-to-point routing (EDITNET command in 
XACT). Some of the techniques that can be used are swapping 
input pins, swapping CLBs, and implementing buffers in 
unused sections of the CLBs. When swapping input pins, the 
designer must be careful because certain functions, such as 
the clock input to the flip-flop, may only be input on certain 
pins. Swapping CLBs is very simple because the XACT 
software provides a command for swapping CLBs, but 


sometimes not all of the signals can be successfully rerouted. 
Passing a signal through a CLB presents a routing channel that 
is not otherwise available. However, a delay is added to the 
signal. Usually, a combination of these techniques can be 
used to successfully route a high-density design. 


The high number of IOBs in the LCA device gives the designer 
much flexibility in designing the pinout of the device. In this 
design, thirty-five of the IOBs are used. The IOBs that are not 
being used for the actual design are not left unused, but are 
configured as outputs and are connected to various signals 
within the LCA device to provide test points for the LCA device. 
This greatly increases the testability of the design once it is 
placed on the board. 


XACT is used by the designer to define the CLBs and IOBs, 
and to perform EDITNET. Alternately, the design may be input 
using the schematic capture software offered by Daisy and 
Futurenet. In these methods, an automatic place and route 
software package divides the design into blocks that may be 
implemented in CLBs and IOBs, and routes the CLBs and 
{OBs automatically. Once the design has been completed, it may 
be simulated using the software package P-SILOS. 


Conclusion 


The LCA device provides many advantages to the user. Its 
high gate count and I/O capability could potentially replace 
several PLDs in many applications, hence reducing board 
space. The LCA device is preferred by many customers over 
gate arrays because it is reprogrammable ‘on-the-fly' and there 
are no long design cycles and initial NRE cost of a gate array. 
The current design file, XDES15.LCA, is available upon 
request. The bit pattern and the .LCA file will be provided for 
programming the LCA device in an EPROM. 
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Figure 12. ESDI Translator LCA Design 
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Introduction 


The increasing complexity of today’s computers is requiring in- 
creased use of distributed control design techniques. Such a 
system might include a 32-bit microprocessor as the Central 
Processing Unit (CPU), a 16- or 8-bit microcontroller as a periph- 
eral controller, and a complex sequencer as a peripheral drive 
controller. The new PROSE (PROgrammable SEquencer) de- 
vice from Monolithic Memories offers a simple, fast, and effective 
method for implementation of peripheral drive controllers. 


This article will describe the design of a QIC-02 compatible tape 
drive sequencer in the PROSE device. The design requirements 
will be outlined, and then the design process will be described in 
detail for a section of the design. 


QIC-02 Tape Drive 


The peripheral to be controlled by our circuit is a 1/4” streaming 
tape drive that operates according to the QIC-02 interface stan- 
dard. This standard specifies the command set and resulting tape 
drive control signals, as shown in Figure 1. 






QIiC-02 COMMAND* 


D7 D6 D5 D4 D3 D2 D1 DO 


Read Status 0 0 1 1 4 4 1 ~41 


Beginning of Tape 1101 14 4 #1 «90 


Erase Entire Tape 











initialize Tape 


Write File Mark 


*The 8 bit commands shown in this table are inverted from the 
QIC-02 specification. This is because the PAL device has 
been programmed to accept active low signal inputs as 
defined by the QIC-02 specification. 


Figure 1. QiIC-02 Commands 
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Each command requires a certain sequence of events for execu- 
tion. For example, the Erase Tape command requires rewinding 
the tape, asserting the Erase and Initialize signals, moving the 
tape forward to the end, removing Erase and Initialize, and then 
rewinding the tape again. This type of function requires feedback 
of current state information, and additional inputs, for next state 
selection. This is the basic definition of a sequencer, or state 
machine (Figure 2). 


Sequencers are a common design element; they include any 
digital device which traverses through a sequence of states in an 
orderly fashion. Astate is a set of values (acount) measured at 
different parts of the circuit. State machines are often used for 
timing delays, control signal generation, arbitration, event moni- 
toring, and multiple condition testing. 


Our tape drive sequencer must provide the proper contro! signals 
for the tape drive and then hold them until executed, monitor the 
status of the tape drive, and watch for new commands from the 
microcontroller. The block diagram of the system is shown in 
Figure 3. 


The peripheral microcontroller generates the 8-bit QIC-02 com- 
mands in response to the central 32-bit microprocessor. The 
sequencer decodes the command, determining which of the 
commands has been sent, and then begins execution of the 
command. 


Handshaking is performed by a Request signal from the micro- 
controller, signifying that a valid command is available, and a 
Ready signal from the sequencer, signifying that the command 
has been decoded or executed and anew command can be sent 
(Figure 4). Upon reset, the PROSE device asserts READY, 
informing the peripheral controller that the tape drive is ready to 
accept a command. The peripheral controller asserts REQUEST 
after issuing a command on the eight-bit data bus. When the 
PROSE device detects REQUEST asserted, READY is negated 
and the command is decoded. After the command is decoded, 
READY is asserted by the PROSE device, at which time RE- 
QUEST is negated by the peripheralcontroller. When REQUEST 
is negated, READY is negated by the PROSE device. 


INPUTS OUTPUTS 






REGISTER 
COMBINATORIAL 
LOGIC 


ce REGISTER a) 


Figure 2. General State Machine. 
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After the PROSE device decodes the command, the command is 
executed by outputting the proper drive codes and asserting the 
proper signals. After the command has been completed, READY 
is asserted by the PROSE device, informing the peripheral 
controller that the next command may be issued. 


Command Execution 


The PROSE device must be reset after power up before any 
command execution takes place. This is accomplished by 
pulsing the Preset pin low. This places the PROSE device into the 
initialize state. The first command that is issued by the peripheral 
controller must be a Select Drive command. If the correct drive 
is selected, the PROSE device will enter a Drive Selected state. 
A Select Drive command that selects an incorrect drive or any 
other command willbe ignored. Once inthe Drive Selected state, 
any command issued will be executed. The drive is unselected 
and returned to the Initialize state by issuing a Select Drive 
command for a different drive. 


The following text describes the execution of each of the QIC-02 
commands (see QIC-02 specification for timing diagrams): 


Beginning of Tape Command 


The Beginning of Tape command code directs the PROSE device 
to output the Rewind drive code. This causes the tape drive to 
rewind the tape. When BOT (Beginning Of Tape) is asserted by 
the tape drive, the Rewind drive code is negated and READY is 
asserted. BOT is asserted when the Beginning of Tape hole is 
detected by the tape drive. 


ONLINE 


COMMAND 
a 
REQUEST 


PERIPHERAL 
CONTROLLER 


Erase Command 


The Erase command code directs the PROSE device to output 
the Rewind drive code until BOT is asserted, then assert ERASE 
and INITIALIZE. When ERASE is asserted, the tape drive 
activates the erase head and moves the tape forward until EOT 
(End Of Tape) is asserted by the tape drive. EOTis asserted when 
the End of Tape hole is detected by the tape drive. When EOT is 
asserted, ERASE and INITIALIZE are negated and the Rewind 
drive codeis output until BOT is asserted. When BOTis asserted, 
the Rewind drive code is negated and READY is asserted. 


Initialize Command 


The Initialize command code directs the PROSE device to assert 
INITIALIZE and to output the Rewind drive code. When BOT is 
asserted, the Rewind drive code is negated and the Fast Forward 
drive code is output. When EOT is asserted, the Fast Forward 
drive code is negated and the Rewind drive code is output. 
Finally, when BOT is asserted, the Rewind drive code and 
INITIALIZE are negated and READY is asserted. 


Write File Mark Command 
The Write File Mark command directs the PROSE device to 
output the Write File Mark drive code. When FMis asserted, the 


Write File Mark drive code is negated and READY is asserted. 
FM is asserted after the file mark has been written. 


EXECUTION 


peters 
sequencen | status | BANE 


READY S| 














416 03 
Figure 3. Peripheral Controller System Including Tape Drive Sequencer. 
DATA BUS VALID COMMAND 
REQUEST 
READY 
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Figure 4. Command Transfer Protocol. 
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Read Command 


Priorto issuing the Read command, ONLINE mustbe asserted by 
the peripheral controller. If ONLINE is not asserted when the 
Read command code is decoded, the Invalid Command drive 
code will be output and the PROSE device will wait for the next 
command. When the Read command is issued and ONLINE is 
asserted, the PROSE device outputs the Read drive code. Once 
the Read drive code has been output, the Read command may be 
terminated by either the tape drive or the peripheral controller. 
The tape drive terminates the Read command by asserting FM. 
FM is asserted when a file mark is detected. When this occurs, 
the Read command code is negated and READY is asserted. The 
peripheral controller may terminate the Read command by either 
negating ONLINE or by issuing a Read File Mark command. 
When ONLINE is negated, the Read drive code is negated and 
the Rewind drive code is output until BOT is detected, at which 
time READY is asserted. When the Read File Mark command is 
issued, the Read drive code is negated and the Read File Mark 
command is executed. 


Read File Mark Command 


The Read File Mark command directs the PROSE device to 
output the Read File Mark drive code. The Read File Mark code 
causes the tape drive to move to the next file mark. When FMis 
asserted by the tape drive, the Read File Mark drive code is 
negated and READY is asserted. FMis asserted when afile mark 
is detected. 


Read Status Command 


The Read Status command directs the PROSE device to transfer 
six bytes of status data to the peripheral controller. After the 
command is decoded, the PROSE enables the first byte by 
outputting the Status Byte 1 drive code and asserts READY. The 
peripheral controller responds by reading the status byte and 
asserting REQUEST. When the PROSE device detects RE- 
QUEST asserted, READY is negated and the next status byte is 
enabled. READY is not asserted until after the peripheral 
controller negates REQUEST. The remaining five status bytes 
are transferred in a similar manner. After all six bytes are 
transferred, READY is asserted. 


Write Command 


We will look at the Write command in detail as an example of the 
design process. The Write command allows the peripheral 
controller to write data onto the tape. Data is transferred via the 
same eight-bit bus that provides the command code. 


Prior to issuing the Write command, Online must be asserted by 
the peripheral controller. If Online is not asserted when the Write 
command code is decoded, the sequencer will assert the Invalid 
Command signal and wait for the next command. 


If Online is asserted when the Write command is decoded, the 
sequencer will assert the Write signal for the tape drive. Datais 
sent from the peripheral controller to the tape drive on the eight- 
bit bus, and the data is written onto the tape. 


The Write command may be terminated by either the tape drive 
or the peripheral controller. The tape drive may cause the Write 
command to be terminated ifthe end ofthe tapeis reached. When 
the early warning hole is detected by the tape drive, it asserts 
EWH. When this happens, the sequencer must negate the Write 
signa! and then assert the End of Media signal. It then waits to 
decode the next command. 


The peripheral controller may terminate the Write command 
when all of the data has been written. This may be noted by 
negating the Online signal, or by asserting the Request signal. 
When Online is negated, the sequencer must negate the Write 
signal. Itthen outputs the Write File Mark signal until the file mark 
is activated by the tape drive. Once the file mark is activated, the 
sequencer must rewind the tape. 


When the Request signal is asserted by the peripheral controller, 
the sequencer negates the Write signal. It then decodes the new 
command, which usually would be a Write File Mark command. 


State Diagram Representation 


All of this complex interaction can be described in a simple state 
diagram. State diagrams are the universal language for describ- 
ing state machines. The diagrams depict the two major opera- 
tions in a state machine: control sequencing and output genera- 
tion. Control sequencing is changing from present state to next 
state. It can be direct, based only upon present state, or 
conditional, based upon present state and input conditions. 
Similarly, output generation can be direct, based upon present 
state only, or conditional, based upon present state and input 
conditions. The former is called a Moore state machine, while the 
latter is a Mealy state machine. 


Transitions for our tape drive sequencer depend onthe state and 
conditional inputs, as with the Request and Online inputs during 
execution of the Write command. Outputs, however, will depend 
only on the present state; thus, our sequencer will be a Moore 
state machine. 


The state diagram for a Moore machine is simple to understand. 
Each state is represented by a bubble, and each transition is 
represented by an arrow. Conditions required for transitions, if 
any, are listed next to the arrows. Outputs associated with each 
state are listed with the state name in the bubble. 


Write Command State Diagram 


The state diagram for the Write command is shown below. 
Thirteen states are shown. The diagram is entered from the top, 
from the part of the state diagram that performs command 
decoding. WRCMD isthe state entered whenthe Write command 
is decoded. In this state the sequencer asserts the Ready signal, 
telling the peripheral controller that it has decoded the signal. The 
sequencer then waits for the peripheral controller to remove the 
Request signal before continuing to state WRCMDA, or A for 
short. 


In state A the sequencer immediately checks to make sure that 
Online is asserted before beginning the Write sequence. If it is 
not, the Invalid Command signal is output and the sequencer 
enters another part of the state diagram (CMDERRA). 
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If Online is asserted, the sequencer moves to state B, asserting 
the Write signal to the tape drive to allow writing to begin. The 
sequencer moves from states B, E, and F and back to B, until the 
Write process is interrupted. The tape drive responds by moving 
forward and writing the information provided on the eight-bit 
data bus. 


(COMMAND DECODING) 






(CMDERRA) 


(DRVSELD) 


(BOTCMDA) 


416 05 
Figure 5. Write Command State Diagram. 


As described, the Write commandcan be terminated by eitherthe 
tape drive or the peripheral controller. State B checks for 
assertion of the EWH (Early Warning Hole) signal from the tape 
drive. lf asserted, the sequencer immediately negates the Write 
signal in state C, and then asserts the EOM (End of Media) signal 
to the peripheral drive controller in state D. The sequencer then 
goes to the Drive Selected state (DRVSELD) and begins the 
command decoding procedure in another part of the state 
diagram. 


State E checks for assertion of the Request signal, signalling the 
end of the data to be written. In this case, the sequencer would 
immediately negate the Write signal and go to the Drive Selected 
state. 


State F checks for removal of the Online signal. If Online is true, 
the sequencer moves back to state B andcontinues writing. If not, 
it will move to state G, removing the Write signal, and then state 
H, outputting the Write File Mark signal. It then waits in state H 
until the File Mark signalis activated by the tape drive, indicating 
that the file mark has been written. When File Mark is activated, 
the Write File Mark signalis removed in state |, and the sequencer 
moves to the Beginning of Tape command sequence to rewind 
the tape. 


Device Selection 


So far, the design has been described and defined without regard 
to the implementation methodology. The complete sequencer 
can be implemented in several medium-size discrete logic de- 
vices, or one large custom circuit. 


The best choice is often a programmable logic device. Program- 
mable logic provides the integration of custom logic with the quick 
time-to-market of standard products. Especially important in this 
application is the ability to adapt to design changes. The QIC-02 
specification can be modified to enhance performance or add 
features. Of course, any design errors can also be quickly fixed, 
even after layout of the PC board. 


Programmable logic devices offer the complexity to implement 
large designs in few devices. At the same time, software support 
tools allow very simple design entry, even of complex designs. 
This design will require almost no additional work to describe for 
PLD software. Device programmers provide an instant custom 
device, and can even test the device immediately after 
programming. 


Selection of PROSE Device 


The PROSE device is selected as the heart of this application. 
The primary reason is the number of states required for implem- 
entation of the sequencer. We saw that the Write command alone 
had thirteen states, although some were transitions to other parts 
of the state diagram. There are eight other commands required, 
totaling over 100 states. Only the PROSE device provides 
enough states in a single programmable chip. 


Another requirement in many applications is high speed. The 
PROSE device offers a maximum frequency of 30 MHz internally. 
At the same time, the PROSE device has enough logic on board 
to require very few surrounding logic chips. 
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PROSE Device Description 


The PROSE device has the architecture shown in Figure 6. Itis 
based on a 128x21 PROM array, providing 128 states of 21 bits 
each. The 128 states will include the states WRCMD and its 
following states from A through I. Eight of the twenty-one bits in 
each state are the outputs of the device, providing signals such 
as the Write and Ready signals. 


The other thirteen bits in each state feed back internally for next 
state selection. Six of the feedback signals feed back into a PAL 
array, which is also fed by eight external inputs. These external 
inputs are the condition inputs, such as the Request and Online 
signals, which determine the next state. The feedback signals 
determine which of these eight inputs will determine the next 
branch. 


The outputs of the PAL array are two address lines to the PROM 
array. Before reaching the PROM array, however, these signals 
go through two Exclusive-OR (XOR) gates controlled by feed- 
back signals. These gates are used by the software to optimize 
the fit of the design within the device, and will be described later. 
The other five address lines to the PROM array are fed back 
directly from the previous state. Thus, two of the lines can be 
changed by input conditions, providing four-way branching capa- 
bility from any state. 


Expanding Inputs for the PROSE 
Device 


The PROSE device has a balanced architecture of eight inputs 
and eight outputs. However, we already know that we need more 
than eight inputs in the design. The eight bits of the command 
code from the peripheral controller need to be decoded, along 


(7-0) 


128X21 
PROM ARRAY 


MODE 
SDI SHADOW & SDO 
DCLK OUTPUT 
CLK REGISTERS 





Q(7-0) 
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Figure 6. PMS14R21 Architecture 


with the Request and Online signals, and the sequencer must 
monitor the tape drive signals, including the Early Warning Hole 
and File Mark signals already mentioned. In addition to these 
signals, the sequencer must monitor the End of Tape and Begin- 
ning of Tape signals from the tape drive, making atotal of fourteen 
inputs. 


Asimple solution is encoding the eight different eight-bit com- 
mands into a three-bit command code before reaching the 
PROSE device. This can be easily done in almost any 20-pin PAL 
device. Witha PALdevice as the solution, wecan easily add even 
more logic surrounding the sequencer, including drive selection. 
Drive selection logic requires that more commands be added to 
the original eight, forcing the use of four bits for the encoded 
commands. Three of the independent status signals from the 
tape drive can also be combined in the PAL device (Beginning of 
Tape, End of Tape, and File Mark), providing one Status signal. 
(Early Warning Hole remains a separate input.) 


The resulting logic block fits easily into a simple PAL20L8 combi- 
natorial 24-pin PAL device (Figure 7). Several versions of this 
device are available to choose from, including a CMOS device 
offering zero standby power, and high-speed devices at 15 
nanoseconds or less propagation delay. 


The encoding of the command codes, including drive selection, 
is shown in Figure 8. Now we must write the equations for 
PALASM 2 software for implementation in a PAL20L8. Writing the 
equations for each output is a simple matter; active-low equations 
are written where each product term is one of the commands in 
which that particular command code bit is zero. For example, the 
equation for CC3 is simply the sum of the products defining the 
Write, Read, Read File Mark, and Write File Mark commands. 
The equation would be: 


D7*/D6* DS* D4* D3* D2* D1* DO (Write) 
/D7* D6* D5* D4* D3* D2* D1* DO (Read) 
/D7* D6*/D5* D4* D3* D2* D1* DO (Read 
File Mark) 

+ D7*/D6*/DS* D4* D3* D2* D1* DO (Write 
File Mark) 


/CC3 


+ + Il 


where / means inversion, * means AND, and + means OR. The 
equations can be minimized either by observation or by running 
them through the program MINIMIZE in the PALASM 2 software 
suite. The full PALASM 2 file can be simulated for design 
verification, and then processed to create a JEDEC file for 
programming. 


Do—-D7 
DSO-DS1 CCO-CC3 
PAL20L8 
FM 
BOT STATUS 
EOT 


416 07 
Figure 7. PAL Encoding Logic Block. 
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Expanding Outputs for the PROSE 
Device 


The section of the design that we looked at in detail required five 
output signals: Ready, Invalid Command, Write, End of Media, 
and Write File Mark. Other required outputs include Read, Read 
File Mark, Fast Forward, Rewind, Erase, Initialize, and Drive 
Selected. These twelve outputs already extend us beyond the 
eight outputs available in the PROSE device. In addition, six 
Enable Status bits are required, making a total of eighteen 
outputs required. 


Asimple solution can again be provided by a programmable logic 
device, to decode outputs from the PROSE device into the 
required output signals. The requirement for many outputs is met 
by two PAL devices, the PAL6L16 and PAL8L14. However, even 
faster solutions are provided by two PLE (PROM as Logic 
Element) devices, the PLE5P16 and PLE6P16. These devices 
are specialized PROMs, both with sixteen outputs. The PLE5P16 
offers the highest speed at 18 ns propagation delay, and provides 
an output enable, which can be useful when testing the tape drive 
independent of the sequencer logic. 





COMMAND 


QIC-02 COMMAND 


Inthe PLE5P16 we want to decode four outputs from the PROSE 
device into fourteen signals. The four signals to be left as direct 
outputs from the PROSE device can be the signals used more 
often, or those that require higher speed. In this case, Ready, 
Drive Selected, Erase, and Initialize are left direct, while the other 
signals are decoded through the PLE device. The equations for 
the decoding are arbitrary, as long as every signal in asserted by 
its own code. (None of these signals are to be asserted at the 
same time.) The decoding is shown below. 


The generation of the equations is similar to that shown previ- 
ously. For example, 


STSO = /CD3*/CD2*/CD1* CDO 

is the decoding for asserting the Enable Status byte 0 signal. 
However, this time PLEASM software is used for design entry. 
The design method is equivalent to that for PAL devices in 


PALASM software, but the software creates a programming file in 
the PROM format. 


PAL COMMAND CODE 



























































D7 D6 D5 D4 D3 D2 D1 Do cc3) CC2 CC1 cco 
Select Drive 2* 1 1 1 1 1 1 0 1 1 1 1 0 
Select Drive 3* 1 1 1 1 1 0 1 1 1 1 j 0 
Select Drive 4" 1 1 1 1 0 1 1 1 1 1 1 0 
Read Status 0 0 1 1 1 1 1 1 ai 1 0 1 1 
Beginning of Tape 1 1 0 1 1 1 1 0 = 1 0 1 0 
1 1 0 1 j j 0 1 1 0 0 1 
Initialize Tape 1 1 0 1 1 0 1 1 1 0 0 0 
1 0 1 1 1 1 1 1 0 0 1 1 
0 1 1 1 1 1 1 1 0 0 1 0 
Read File Mark 0 1 0 1 1 1 1 1 0 0 0 1 
1 1 1 0 0 0 0 


Write File Mark 1 0 0 1 1 





* If the drive is the incorrect drive number, the four bit command code is 1101 
** Any command code not shown above will result in an illegal command code. 


Figure 8. PAL Command Codes Table. 
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CD3 CD2 CD1 CDpo SIGNAL ASSERTED 


| 


STSO (Enable Status byte 0) 














No Command 











STS1 (Enable Status byte 1) 





STS2 (Enable Status byte 2) 








STS3 (Enable Status byte 3) 








STS4 (Enable Status byte 4) 














—_— = 4 
0 1 1 0 STS5 (Enable Status byte 5) | 
L 0 1 1 EOM (End of Media) 














0 RFM (Read File Mark) 


1 0 0 WFM (Write File Mark) 




















1 0 1 READ (Read Enable) 
1 0 1 WRITE (Write Enable) 





FFWD (Fast Forward) 






RWD (Rewind) 








1 1 1 0 ICMD (Invalid Command) 


a ee a 


No Command 














Figure 9. PLE Device Decoding 


PROSE Design Specification 


We are now ready to discuss the details of the design specifica- 
tion for the PROSE device. The software used is a part of the 
PALASM 2 software suite called PROASM for PROSE Assem- 
bler. It requires almost no additional work on the part of the 
designer; it simply requires a description of the state diagram. 
PROASM has the following file format: 


* Declaration Section 
¢ State Section 
* Conditions Section 


Declaration Section 


The Declaration Section is the location of the general information 
about the design, including the design engineer's name and 
company. It provides for device selection; in this case, we are 
using the PROSE device, which has the part number PMS14Ra21. 
It also allows the definition of all of the inputs and output signal 
names. Remember, four of the inputs are PAL device outputs 
(CCO-CC3), and four of the outputs feed inputs to the PLE device 
(CDO-CD3). 


TITLE QIC-02 COMMAND DECODER 
PATTERN 1 
REVISION B 
AUTHOR KEN WON 
COMPANY MMI 
DATE JANUARY 19, 1987 
CHIP QIC_02_CMD DEC PMS14R21 
;INPUT PIN 1 2 3 4 

CLK DCLK /ONLINE /REQUEST 
;INPUT PIN 5 6 7 8. 9 10 


/cco /cci /Cc2 /CC3 STATUS EWH 
7 INPUT PIN 11 12 
SDI GND 


;OUTPUT PIN 13 14 #15 16 17 #18 
PRESET SDO CD3 CD2 CD1 CDO 


;OUTPUT PIN 19 : 20 21 22 
INITIALIZE ERASE DRSELD READY 
;OUTPUT PIN 23 24 
MODE VCC 


State Section 


The next part of the design file is the State section. This section 
begins with the keyword STATE. The first item required is 
selection of either a Moore or a Mealy type state machine. As 
discussed previously, a Moore type state machine is required. 


STATE 
MOORE_MACHINE 


Next, several general or default values are specified. First is 
selection of the programmable Preset/Enable function pin. Either 
can be used: in this case, Preset is desired, so the keyword 
MASTER_RESET is entered. 


MASTER_RESET 


Next are global default output values. Global default output 
values are the values to be assigned to the outputs when the 
design specification does not specify a particular value. This is 
not used, so no default output function is entered. However, the 
design file can be simplified by specifying a default value for the 
outputs (either High or Low), and then only specifying the output 
value when it differs from the default value. 


Next are global default transitions or branches. Global default 
branches are the branches to be taken when no branch is 
specified in the design file. This is amethod of not only simplifying 
the design file, but also guaranteeing that there is atransition from 
every state for every input condition. In this design, the default is 
chosen to be HOLD_STATE; this means that the sequencer will 
remain in the same state if no other transition is enabled. 


DEFAULT_BRANCH HOLD STATE 


Next is the specification of the state to enter after power-up. The 
PROSE device has power-up preset, whereby all twenty-one flip- 
flops power up into the logic One state. This state is called 
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POWER_UP. The transition from this state (on the first clock 
edge) shows how transition equations are written. The format is 


STATE_X := CONDITION_A -> STATE_Y 


meaning that when in State X, if condition A is true the machine 
will transition to state Y. In this design, we want to transition to 
state INIT on any condition, so the equation is written 


POWER_UP := VCC -> INIT 
where VCC means unconditionally. 


The first transition in the Write command state diagram requires 
two branches. The local default branch is indicated by the + sign, 
which is the “otherwise” operator. Thus, from state WRCMD, 
we have 


WRCMD := REQ -> WRCMD 
+-> WRCMDA 


which means “if REQ (Request) is true, go to state WRCMD (stay 
in the same state), otherwise (if REQ is not true) go to state 
WRCMDA.” 


The remaining transition equations are written similarly. In effect, 
the arrows in the state diagram are directly transcribed into 
arrows in the equations. 


WRCMDA := ONLN ~> WRCMDB 
+-> CMDERRA 


WRCMDB := EW -> WRCMDC 
+-> WRCMDE 


WRCMDC := VCC —-—> WRCMDD 
WRCMDD := VCC -> DRVSELD 


WRCMDE := REQ -> DRVSELD 
+-> WRCMDF 


WRCMDF := ONLN -> WRCMDB 
+-> WRCMDG 


WRCMDG := VCC  -> WRCMDH 


WRCMDH := STS -> WRCMDI 
+-> WRCMDH 


WRCMDI := VCC -> BOTCMDA 


Note that the equation for WRCMDH does not need the second 
line; the global default transition is to stay in the same state, so it 
does not need to be specified here. 


Output Definition 

The next task is to define the eight outputs for each of these 
states. Outputs are defined in the form of an equation. For 
example, 

STATE _X.OUIF := OUT_1 * /OUT_2 

means that in state X, output 1 will be High and output 2 will be 


Low. If other outputs exist, they will be assigned their default 
values. 


Output equations for the Write command section are written 
according to the state diagram. In all of the Write states, Drive 
Selected (DRVSELD) remains active. Thus, in state WRCMD 
only Drive Selected and the Ready signal are active, telling the 
peripheral controller that the Write command has been decoded 
and is ready to be executed. The resulting output equation for 
state WRCMD is 
WRCMD.OUTF := READY*/CD3* /CD2*/CD1*/CDO 
* /ERASE* /INITIALIZE* DRSELD 


The other states have similar output equations, translating the 
required output signal to the appropriate encoding for the PLE 
device to decode. 


WRCMDA.OUTF := /READY*/CD3*/CD2* /CD1*/CDO 
* /ERASE* /INITIALIZE* DRSELD 


WRCMDB.OUTF := /READY* CD3*/CD2* CD1* CDO 
* /ERASE* /INITIALIZE* DRSELD 


WRCMDC.OUTF := /READY*/CD3*/CD2*/CD1*/CDO 
* /ERASE* /INITIALIZE* DRSELD 


WRCMDD.OUTF := /READY*/CD3* CD2* CD1* CDO 
* /ERASE* /INITIALIZE* DRSELD 


WRCMDE.OUTF := /READY* CD3*/CD2* CD1* CDO 
* /ERASE*/INITIALIZE* DRSELD 


WRCMDF.OUTF := /READY* CD3*/CD2* CD1* CDO 
* /ERASE* /INITIALIZE* DRSELD 


WRCMDG.OUTF := /READY*/CD3*/CD2* /CD1*/CDO 
* /ERASE* /INITIALIZE* DRSELD 


WRCMDH.OUTF := /READY* CD3*/CD2*/CD1* CDO 
* /ERASE*/INITIALIZE* DRSELD 


WRCMDI.OUTF := /READY*/CD3*/CD2* /CD1*/CDO 
* /ERASE* /INITIALIZE* DRSELD 


Again, if default output values were used, these equations could 
be extremely simplified. 


Conditions Section 


The final section is the Conditions section, designated by the 
keyword Conditions. This section defines the conditions used in 
the state equations. In this design, the conditions are simply the 
inputs to the PROSE device, and are straightforward: 


CONDITIONS 


ONLN = ONLINE 
STS = STATUS 
EW = EWH 

REQ = REQUEST 


Other conditions used in the design file are also placed in this 
Conditions section. 
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Automatic State Assignment 


The other sections of the design file are completed in a similar 
manner, and the whole design is assembled by PROASM soft- 
ware. Akey feature of PROASM software, in addition to design 
verification, is automatic state assignment. State locations in the 
PROM and conditional product terms in the PAL array are used 
to the greatest possible efficiency, to provide the best fit of the 
logic within the device. 


The automatic fitting is easily seen in Figure 10. Each individual 
state can branch to one of four locations: these four locations form 
a “primary location.” For example, the hex addresses 1A, 3A, 5A, 
and 7A form a primary location, which can be designated by the 
smallest address, 1A. 


The software optimizes use of these primary locations to elimi- 
nate excessive duplication of states in different primary locations. 
For example, the state WRCMD is duplicated in primary location 


1A 19 39 
RFMCMD RDCMDA {| RDCMD 


WFMCMD RDCMD | { 









> 
= 
D 
ro) 
= 
o 






416 10 


Figure 10. State Assignments for the Six Primary PROM 
Locations Used for the Write Command Sequence. 
Arrow Indicate Potential State Transitions, to a Set of 
Circled Next States. All Next States Must be in the Same 
Primary Location. 


1A (hex address 3A) and 19 (hex address 79). This duplication 
is necessary because of the branch from WRCMD to either 
WRCNMD itself or WRCMDA (or A in the shortened form). The 
branch cannot be made to the primary location where WRCMD 
already exists, because the other three locations have already 
beenused. Thus, WRCMD must be duplicated in another primary 
location which has an open location for state A. 


The software then puts the two-way branch from state Ain primary 
location 5. It then makes use of the other half of the primary 
location for the two-way branch from state B. However, when it 
then assigns the two-way branch from state F to states BorG, it 
must duplicate B in primary location 4 since there is no room left 
in its primary location (5) for G. 


Note that the software avoids duplication of state DRVSELD, 
however. The two-way branch from state E to states DRVSELD 
or F is placed in the other half of primary location 4. When 
DRVSELD is branched to again from state D, the same location 
can be used, 


This duplication avoidance is where the XOR gates are used. To 
place a branch in the lower order half of a primary location, the 
branch addresses must be 00 or 01. This is accomplished by 
making the XOR feedback 00 and allowing a PAL product term to 
determine whether the second address bit should be a1. To use 
the higher order half of a primary location (branch addresses 10 
or 11), the XOR feedback would be made 10 and again a PAL 
product term would be used to determine the other bit. XOR 
feedback is also used to determine “right or left side” of a primary 
location, or to use a product term that already exists on one side 
of the PAL array. 


The PROASM software provides detailed implementation re- 
sults, showing how the states and product terms were assigned 
for every state and transition. 


PROSIM Simulation 


The entire sequencer can be simulated by PROSIM software, to 
verify the design. This can be done before any parts are 
programmed. Simulation results can be added to the program- 
ming file to provide automatic testing after programming. 


Completed Design 


The complete design requiredless than aweektoimplement from 
scratch. Two days were required to define the application. Only 
one day was required to write the PROSE design specification. 
Simulation and design checking took an additional day. Only half 
a day each were required for the PAL and PLE device designs. 
This included revisions to the PAL and PROSE devices to fix 
minor design errors. The complete design is shown in Figure 11, 
and the design files are shown on the following pages. 
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Conclusion 


Programmable logic devices were once the glue logic that tied 
together the intelligence of a digital system. Today, advanced 
chips such as the PROSE device are capable of implementing 
distributed control functions and make up the heart of the control 
of the system. With complementary advanced design tools 
available with PALASM 2 software, PLDs are an effective ap- 
proach to even the most complex tasks. 





READY 
CLK 
pease DCLK 
ONLINE 
REQUEST 
ot! 
06 cco 
D5 ea 
D4 
Da Z 
D2 5 § 1p|—_STATUS 
01 8 
va 9 
OSO 
im DS1 
VV 
VY 
EOT SDI 
FM BOT EWH 


Figure 11. QIC-02 Command Sequencer 











The PROSE device has been designed to implement a complex 
tape drive sequencer. Design implementation in the PROSE 
device is made easy for several reasons, including: 


1. Design file requires merely transcription of the state diagram 

2. Software automatically optimizes fit 

3. Programmability of the device allows design errors to be fixed 
quickly 


As aresult, the complete tape drive sequencer required only four 
days to design and implement. 
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INITIALIZE 
STSO 
STS1 
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STS4 
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THIS PAL DEVICE IS PART OF THE QIC-02 COMMAND SEQUENCER 


8 BIT COMMANDS INTO 4 BIT COMMAND CODES. IT IS ALSO USED TO 


; DESIGN. THE PRIMARY PURPOSE OF THIS PAL DEVICE IS TO ENCODE 


ENCODE TAPE DRIVE STATUS SIGNALS AND SELECT THE DRIVE NUMBER. 


TITLE QIC-02 COMMAND DECODER PAL 
PATTERN 

REVISION B 

AUTHOR KEN WON 

COMPANY MONOLITHIC MEMORIES 

DATE JANUARY 26, 1987 

CHIP QIC_02_CMD PAL PAL20L8 


D7 D6 D5 D4 D3 D2 D1 DO DSO DS1 FM GND 
BOT EOT NC /DSELD /CC2A /STATUS CC3 CC2 


cci cco NC vcc 


EQUATIONS 
STATUS = /EOT*/BOT*/FM ;ASSERT STATUS TO PMS14R21 
/CCO D7*D6*/D5*D4*D3*D2*D1*/D0 BOT COMMAND 








+ D7*D6*/D5*D4*D3*/D2*D1*DO ; INITIALIZE 

+ D7*/D6*/D5*D4*D3*D2*D1*DO ; WRITE FILE MARK 

+ /D7*D6*D5*D4*D3*D2*D1*DO ; READ 

+ D7*D6*D5*D4*DSELD ; SELECT DRIVE 
/CC1l = D7*D6*/D5*D4*D3*D2*/D1*DO ; ERASE 

+ D7*D6*/D5*D4*D3*/D2*D1*DO ; INITIALIZE 

+ D7*/D6*/DS*D4*D3*D2*D1*DO ; WRITE FILE MARK 

+ /D7*D6*/D5*D4*D3*D2*D1*DO ; READ PILE MARK 

+ D7*D6*D5*D4*/DSELD ; UNSELECT DRIVE 
/CC2 = D7*D6*/D5*D4*D3*D2*D1*/D0 ; BOT COMMAND 

+ D7*D6*/D5*D4*D3*D2*/D1*D0 ; ERASE 

+ D7*D6*/D5*D4*D3*/D2*D1*DO ; INITIALIZE 

+ D7*/D6*D5*D4*D3*D2*D1*D0 ; WRITE 

+ D7*/D6*/D5*D4*D3*D2*D1*DO ; WRITE FILE MARK 

+ /D7*D6*/D5*D4*D3*D2*D1*DO ; READ FILE MARK 

+ /D7*D6*D5*D4*D3*D2*D1*DO ; READ 

+ /D7*/D6*D5*D4*D3*D2*D1*DO ; READ STATUS 

; (MINIMIZATION REDUCES 
; PRODUCT TERMS) 

/CC3 = D7*/D6*D5*D4*D3*D2*D1*DO ; WRITE 

+ D7*/D6*/D5*D4*D3*D2*D1*DO ; WRITE FILE MARK 

+ /D7*D6*D5*D4*D3*D2*D1*D0 ; READ 

+ /D7*D6*/D5*D4*D3*D2*D1*DO ; READ FILE MARK 
DSELD = /DO*/DS1*/DS0*D1*D2*D3 ; DRIVE 1 SELECT 

+ /D1*/DS1*DS0*DO*D2*D3 ; DRIVE 2 SELECT 

+ /D2*DS1*/DS0*D0*D1*D3 ; DRIVE 3 SELECT 

+ /D3*DS1*DSO*DO*D1*D2 ; DRIVE 4 SELECT 
SIMULATION 


; THIS SIMULATION FILE TESTS THE OUTPUT FOR ALL VALID AND ONE INVALID 
; COMMAND INPUT 


TRACE ON D7 D6 DS D4 D3 D2 D1 DO FM EOT BOT CCO CCl CC2 CC3 /STATUS /DSELD 


SETF D7 D6 /DS5 D4 D3 D2 Dl /DO ;BEGINNING OF TAPE 
SETF D7 D6 /D5 D4 D3 D2 /D1 DO 7; ERASE 

SETF D7 D6 /D5 D4 D3 /D2 D1 DO ; INITIALIZE 

SETF D7 /D6 D5 D4 D3 D2 D1 DO ;WRITE 

SETF D7 /D6 /D5 D4 D3 D2 D1 DO ;WRITE FILE MARK 
SETF /D7 D6 DS D4 D3 D2 D1 DO ;READ 


SETF 
SETF 
SETF 
SETFP 


/D7 D6 /D5 D4 D3 D2 Dl DO 
/D7 /D6 D5 D4 D3 D2 Dl DO 
/D7 D6 D5 D4 /D3 /D2 D1 DO 
EOT 


;READ FILE MARK 
;READ STATUS 

; INVALID COMMAND 
;ASSERT EOT 


SETF /EOT ;NEGATE EOT 

SETF BOT ;ASSERT BOT 

SETF /BOT ;NEGATE BOT 

SETF FM ;ASSERT FM 

SETF /FM ;NEGATE FM 

SETF DS1 /DSO ;SET DRIVE NUMBER TO 3 
SETF D7 D6 D5 D4 D3 /D2 D1 DO ;SELECT DRIVE 3 


SETF 


D7 D6 D5 D4 D3 D2 /D1 DO 


TRACE_OFF 


;UNSELECT DRIVE 3 
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;THIS IS A SECOND REVISION OF AN APPLICATION FOR THE PROSE DEVICE. 
;THE ORIGINAL APPLICATION IS CALLED 14R21.APP. THIS APPLICATION 

; IMPLEMENTS A QIC-02 COMMAND DECODER USING A PROSE DEVICE, A PAL 
;DEVICE, AND A PLE DEVICE. THE PROSE DEVICE FILE IS CALLED 
;CMD_PAL.APP AND THE PLE DEVICE FILE IS CALLED CMD PLE.APP. 


TITLE QIC-02 COMMAND DECODER 


PATTERN 

REVISION B 

AUTHOR KEN WON 

COMPANY MMI 

DATE JANUARY 19, 1987 


CHIP QICO2_CMD_DEC PMS14R21 


CLK DCLK /ONLINE /REQUEST /CCO /CCl /CC2 /CC3 STATUS EWH SDI GND 
PRESET SDO CD3 CD2 CDl CDO INITIALIZE ERASE DRSELD READY MODE VCC 
STATE 

MOORE_MACHINE 

MASTER_RESET 

DEFAULT BRANCH HOLD_STATE 

POWER_UP ;=VCC->INIT- 


7@@@@@@ EQUATIONS @@eeGe 
peakKKK DRIVE SELECTION *&*4e% 


INIT s= REQ -> DRVSELDA ;IF /REQUEST IS TRUE, CHECK 
; FOR DRIVE SELECTION 
+-> INIT ;OTHERWISE GOTO INIT 
DRVSELDA := CONDO -> DRVSELDB 
+-> INIT 
DRVSELDB := COND1 -> INIT 
+-> DRVSELDC 
DRVSELDC := COND2 -> INIT 
+-> DRVSELDD 
DRVSELDD := COND3 -> INIT 
+-> DRVSELDE ;DRIVE IS SELECTED 
DRVSELDE := REQ -> DRVSELDE ;WAIT FOR REQ DEACTIVATED 
+-> DRVSELDF ;THEN GOTO DRVSELDF 
DRVSELDF := VCC -> DRVSELD ;DRIVE SELECTION COMPLETE 


INIT.OUTF:= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 
DRVSELDA.OUTF:= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE* /DRSELD 
DRVSELDB.OUTF:= READY*/CD3*/CD2*/CD1* /CD0*/ERASE* /INITIALIZE*/DRSELD 
DRVSELDC.OUTF:= READY*/CD3*/CD2*/CD1*/CD0* /ERASE*/INITIALIZE*/DRSELD 
DRVSELDD.OUTF:= READY*/CD3*/CD2*/CD1*/CD0*/ERASE* /INITIALIZE* /DRSELD 
DRVSELDE.OUTF:= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/ INITIALIZE* /DRSELD 
DRVSELDF.OUTF:= READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*/DRSELD 
DRVSELD.OUTF;= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE* /INITIALIZE*DRSELD 
;****** COMMAND DECODING *#***% 


DRVSELD := REQ -> CMDDECA ;IF /REQUEST IS TRUE, 
; DECODE COMMAND 
++> DRVSELD ;OTHERWISE GOTO DRVSELD 
CMDDECA := COND3 -> RDWRCMD ;IF CC3 IS TRUE, GOTO 
; READ/WRITE DECODING 
+-> CMDDECB ;OTHERWISE CONTINUE 
; COMMAND DECODING 
RDWRCMD := COND2 -> RDWRCMDA ;IF CC2 IS TRUE, GOTO 
; READ/WRITE DECODING 
+~-> CMDERR ;OTHERWISE GOTO CMDERR 
RDWRCMDA := COND1 -> FMCMD ;IF CCl IS TRUE, COMMAND 
; IS A FILE MARK COMMAND 
+-> RDWRCMDB ;OTHERWISE READ OR WRITE 
RDWRCMDB := CONDO -> RDCMD ;IF CCO TRUE, COMMAND READ 
+-> WRCMD ;OTHERWISE COMMAND IS WRITE 
FMCMD := CONDO -> WFMCMD ;IF CCO IS TRUE, COMMAND IS 
; WRITE FILE MARK 
+-> RFMCMD ;OTHERWISE COMMAND IS READ 
; FILE MARK 
CMDDECB := COND2 -> CMDDECC ;IF CC2 IS TRUE, CONTINUE 
3; COMMAND DECODING 
+-> DESELDA ;OTHERWISE CHECK FOR 
; DESELECTING DRIVE 
CMDDECC := COND1 -> CMDDECD ;CONTINUE DECODING COMMAND 
+-> CMDDECE ;CONTINUE DECODING COMMAND 
CMDDECD := CONDO -> INITCMD ;IF CCO TRUE, COMMAND INIT 
+-> ERASECMD ;OTHERWISE COMMAND IS ERASE 
CMDDECE := CONDO -> BOTCMD ;IF CCO TRUE, COMMAND BOT 
+-> RDSTSCMD ;OTHERWISE COMMAND IS READ 
; STATUS 
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CMDDECA.OUTF: 
CMDDECB.OUTF: 
CMDDECC.OUTF: 
CMDDECD.OUTF: 


READY */CD3*/CD2*/CD1*/CD0* /ERASE* /INI TIALIZE* DRSELD 
READY * /CD3*/CD2*/CD1* /CD0* /ERASE* / INITIALIZE*DRSELD 
READY */CD3*/CD2*/CD1* /CD0* /ERASE* / INITIALIZE*DRSELD 
READY */CD3*/CD2*/CD1*/CD0* /ERASE* / INITIALIZE*DRSELD 
CMDDECE.OUTF:= READY*/CD3*/CD2*/CD1*/CD0* /ERASE* /INITIALIZE*DRSELD 
RDWRCMD.OUTF:= READY*/CD3*/CD2* /CD1* /CD0* /ERASE* / INITIALIZE*DRSELD 
RDWRCMDA.OUTF:= READY*/CD3*/CD2*/CD1* /CD0* /ERASE* / INITIALI ZE*DRSELD 
RDWRCMDB.OUTF:= READY*/CD3*/CD2*/CD1*/CD0* /ERASE* /INITIALIZE*DRSELD 
FMCMD.OUTF:= READY*/CD3*/CD2*/CD1*/CD0*/ERASE* /INITIALIZE*DRSELD 


;***k*kKK DESELECT DRIVE ***ee% 


DESELDA >= COND1 -> DESELDB ;CONTINUE CHECKING FOR 
; DESELECT COMMAND 
+-> DESELDC ;OTHERWISE GOTO DESELDA 
DESELDB := CONDO -> CMDERR ;IF CCO IS TRUE, COMMAND 
; IS INVALID 
+-> DESELDD ;OTHERWISE DESELECT DRIVE 
DESELDC := CONDO -> DRVSELDE ;IF CCO IS TRUE, GOTO DRIVE 
3 SELECTION 
+-> CMDERR ;OTHERWISE COMMAND INVALID 
DESELDD := REQ -> DESELDD ;WAIT FOR REQ DEACTIVATED 
+-> DESELDE ;THEN GOTO DESELDE 
DESELDE 2= vce -> INIT ;GOTO DRIVE SELECTION 


DESELDA.OUTF: 
DESELDB.OUTF: 
DESELDC.OUTF: 
DESELDD.OUTF: 
DESELDE.OUTF: 


/READY* /CD3*/CD2* /CD1* /CD0* /ERASE* /INITIALIZE*DRSELD 
/READY*/CD3*/CD2*/CD1* /CD0*/ERASE* /INITIALIZE*DRSELD 
/READY*/CD3*/CD2*/CD1* /CD0* /ERASE* /INITIALIZE*DRSELD 
/READY* /CD3*/CD2* /CD1* /CD0* /ERASE* /INITIALIZE*DRSELD 
READY* /CD3* /CD2*/CD1*/CD0* /ERASE*/INITIALIZE*DRSELD 





;***k*A TNVALID COMMAND *4 #444 


CMDERR := REQ -> CMDERR ;WAIT FOR REQ DEACTIVATED 
+-> CMDERRA ;THEN OUTPUT INVALID 
; COMMAND CODE 
CMDERRA 2= VCC ~> CMDERRB ; INVALID COMMAND CODE 
CMDERRB := vcc -> CMDERRC ;NEGATE CODE 
CMDERRC 7= VCC -> DRVSELD ;GOTO COMMAND DECODING 


CMDERR.OUTF:= /READY*/CD3*/CD2* /CD1*/CD0* /ERASE*/INITIALIZE*DRSELD 
CMDERRA. OUTF:= READY*CD3*CD2*CD1*/CD0O* /ERASE* /INITIALIZE*DRSELD 
CMDERRB.OUTP:= READY*CD3*CD2*CD1*/CD0*/ERASE* /INITIALIZE*DRSELD 
CMDERRC.OUTF:= READY*/CD3*/CD2*/CD1*/CDO0* /ERASE* /INITIALIZE*DRSELD 


pxxk*KAK BOT COMMAND *4**44% 


BOTCMD := REQ -> BOTCMD ;WAIT FOR REQ DEACTIVATED 
+-> BOTCMDA ;THEN REWIND COMMAND CODE 
BOTCMDA := STS -> BOTCMP ;WHEN STS IS TRUE, NEGATE 
; OUTPUT CODE 
BOTCMP := Vcc -> DRVSELD ;GOTO DRVSELD 


BOTCMD.OUTF:= /READY*/CD3*/CD2*/CD1*/CD0*/ERASE*/INITIALIZE*DRSELD 
BOTCMDA.OUTF:= READY*CD3*CD2*/CD1*CD0*/ERASE*/INITIALIZE*DRSELD 
BOTCMP.OUTF:= READY*/CD3*/CD2*/CD1*/CD0*/ERASE* /INITIALIZE*DRSELD 


;**A*KEK TNITIALIZE COMMAND ****e* 


INITCMD := REQ -> INITCMD ;WAIT FOR REQ DEACTIVATED 
+-> INITCMDA ;THEN OUTPUT REWIND COMMAND 
+ CODE & ASSERT INITIALIZE 
INITCMDA = STS -> INITCMDB ;WHEN STS IS TRUE, NEGATE 
; COMMAND CODE 
INITCMDB := vec -> INITCMDC 7ASSERT FFWD COMMAND CODE 
INITCMDC := STS -> INITCMDD ;WHEN STS IS TRUE, NEGATE 
; COMMAND CODE 
INITCMDD s= vcc -> INITCMDE +ASSERT REWIND COMMAND CODE 


INITCMDE := STS -> INITCMP ;WHEN STS IS TRUE, NEGATE 
; ; COMMAND CODE & INITIALIZE 


"t 


INITCMP := vcc -> DRVSELD 


INITCMD.OUTF:= /READY*/CD3*/CD2*/CD1*/CD0* /ERASE* /INITIALIZE*DRSELD 
INITCMDA.OUTF:= READY*CD3*CD2*/CD1*CDO*/ERASE* INITIALIZE*DRSELD 
INITCMDB.OUTF:= READY*/CD3*/CD2* /CD1*/CD0* /ERASE* INITIALI ZE*DRSELD 
INITCMDC.OUTF:= READY*CD3*CD2*/CD1*/CDO*/ERASE* INITIALIZE*DRSELD 
INITCMDD.OUTF:= READY*/CD3*/CD2*/CD1*/CD0* /ERASE* INITIALIZE*DRSELD 
INITCMDE.OUTF:= READY*CD3*CD2*/CD1*CDO* /ERASE* INITIALI ZE*DRSELD 
INITCMP.OUTF:= READY*/CD3*/CD2*/CD1*/CD0* /ERASE* /INITIALIZE*DRSELD 


tow anon 
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;*#*e*** ERASE COMMAND **4**** 


ERASECMD 
ERASECMDA 


ERASECMDB 
ERASECMDC 


ERASECMDD 
ERASECMDE 


ERASECMDF 


ERASECMD.OUTF:= 


ERASECMDD.OUTF: 


= REQ 


STs 


vcc 
STS 


vcc 
sts 


vec 


+-> 
-> 


Vv 


-> 
-> 


-> 
-> 


-> 


ERASECMD 
ERASECMDA 
ERASECMDB 


ERASECMDC 
ERASECMDD 


ERASECMDE 
ERASECMDF 


DRVSELD 


;WAIT FOR /REQ DEACTIVATED 
;THEN REWIND COMMAND CODE 
;WHEN STS IS TRUE, NEGATE 
; COMMAND CODE 

;ASSERT ERASE & INITIALIZE 
;WHEN STS IS TRUE, NEGATE 
; ERASE AND INITIALIZE 
;OUTPUT REWIND COMMAND CODE 
;WHEN STS IS TRUE, NEGATE 
; COMMAND CODE 

;GOTO DRVSELD 


/READY* /CD3*/CD2*/CD1*/CD0*/ERASE* /INITIALIZE*DRSELD 
ERASECMDA.OUTF:= READY*CD3*CD2*/CD1*CD0*/ERASE* /INITIALIZE*DRSELD 
ERASECMDB.OUTF:= READY*/CD3*/CD2*/CD1*/CD0*/ERASE* /INITIALIZE*DRSELD 
ERASECMDC.OUTF:= READY*/CD3*/CD2*/CD1*/CD0*ERASE* INITIALIZE*DRSELD 
READY* /CD3*/CD2*/CD1*/CD0* /ERASE* /INITIALIZE*DRSELD 
ERASECMDE.OUTF:= READY*CD3*CD2*/CD1*CD0*/ERASE* /INITIALIZE*DRSELD 
ERASECMDF.OUTF:= READY*/CD3*/CD2*/CD1* /CD0*/ERASE* /INITIALIZE*DRSELD 


p****** READ STATUS COMMAND ****4*% 


RDSTSCMD 


RDSTSAA 
RDSTSA 
RDSTSB 


RDSTSC 


RDSTSD 
RDSTSE 


RDSTSF 


RDSTSG 
RDSTSH 


RDSTSI 


RDSTSJ 
RDSTSK 


RDSTSL 


RDSTSM 
RDSTSN 


RDSTSO 


RDSTSP 
RDSTSQ 


RDSTSR 


RDSTSCMD.OUTF: = 


RDSTSA.OUTF: 
RDSTSB.OUTF: 
RDSTSC.OUTF: 
RDSTSD.OUTF: 
RDSTSE.OUTF: 
RDSTSF .OUTF: 
RDSTSG.OUTF: 
RDSTSH.OUTF: 
RDSTSI.OUTF: 
RDSTSJ.OUTF: 
RDSTSK.OUTF: 
RDSTSL.OUTF: 
RDSTSM.OUTF: 
RDSTSN.OUTF: 
RDSTSO.OUTF: 
RDSTSP.OUTF: 
RDSTSQ.OUTP: 
RDSTSR.OUTF: 


Hou 


= REQ 


vcc 
vec 
REQ 


REQ 


vcc 
REQ 


REQ 


vcc 
REQ 


REQ 


vcc 
REQ 


REQ 


vcc 
REQ 


REQ 


vcc 
REQ 


REQ 


-> 
+-> 
-> 
-> 
-> 


-> 


+-> 
-> 
-> 


-> 
+-> 
-> 
-> 


-> 
+-> 
-> 
-> 


-> 
+-> 
-> 
-> 


—2 
+-> 
<-> 
=>. 


-> 
+-> 


RDSTSCMD 
RDSTSAA 
RDSTSA 
RDSTSB 
RDSTSC 


RDSTSC 


RDSTSD 
RDSTSE 
RDSTSF 


RDSTSF 
RDSTSG 
RDSTSH 
RDSTSI 


RDSTSI 
RDSTSJ 
RDSTSK 
RDSTSL 


RDSTSL 
RDSTSM 
RDSTSN 
RDSTSO 


RDSTSO 
RDSTSP 
RDSTSQ 
RDSTSR 


RDSTSR 
DRVSELD 


;WAIT FOR REQ DEACTIVATED 
;THEN NEGATE READY 

;ENABLE STATUS BYTE 0 
;ASSERT READY 

;WHEN REQUEST IS ACTIVATED, 
3 NEGATE READY 

;WAIT FOR REQUEST TO BE 

; DEACTIVATED 

:THEN ENABLE STATUS BYTE 1 
;ASSERT READY 

;WHEN REQUEST IS ACTIVATED, 
3; NEGATE READY 

;WAIT FOR REQ DEACTIVATED 
;THEN ENABLE STATUS BYTE 2 
; ASSERT READY 

;WHEN REQUEST IS ACTIVATED, 
; NEGATE READY 

;WAIT FOR REQ DEACTIVATED 
;THEN ENABLE STATUS BYTE 3 
;ASSERT READY 

;WHEN REQUEST IS ACTIVATED, 
; NEGATE READY 

;WAIT FOR REQ DEACTIVATED 
;THEN ENABLE STATUS BYTE 4 
;ASSERT READY 

;WHEN REQUEST IS ACTIVATED, 
; NEGATE READY 

;WAIT FOR REQ DEACTIVATED 
;THEN ENABLE STATUS BYTE 5 
;ASSERT READY 

;WHEN REQUEST ACTIVATED, 

; NEGATE READY & COMMAND 
;WAIT FOR REQ DEACTIVATED 
;THEN GOTO DRVSELD 


/ READY * /CD3*/CD2* /CD1 * /CD0* /ERASE* / INITIALIZE*DRSELD 
RDSTSAA.OUTF:= READY*/CD3*/CD2*/CD1* /CD0* /ERASE* /INITIALIZE*DRSELD 


READY * /CD3* /CD2* /CD1*CD0* /ERASE* / INITIALIZE*DRSELD 
/READY*/CD3* /CD2*/CD1*CD0*/ERASE* /INITIALIZE*DRSELD 
READY* /CD3* /CD2* /CD1*CD0* /ERASE* / INITIALIZE*DRSELD 
READY* /CD3*/CD2*CD1* /CD0* /ERASE* / INITIALIZE*DRSELD 
/READY * /CD3* /CD2*CD1*/CD0* /ERASE* /INITIALIZE*DRSELD 
READY * /CD3*/CD2*CD1*/CD0* /ERASE* / INITIALIZE*DRSELD 
READY*/CD3* /CD2*CD1*CD0*/ERASE* /INITIALIZE*DRSELD 
/READY*/CD3* /CD2*CD1*CD0* /ERASE* / INITIALIZE*DRSELD 
READY* /CD3*/CD2*CD1*CD0*/ERASE* / INITIALIZE*DRSELD 
READY */CD3*CD2* /CD1*/CD0*/ERASE* /INITIALIZE*DRSELD 
/READY* /CD3*CD2* /CD1*/CD0* /ERASE* / INITIALIZE*DRSELD 
READY * /CD3*CD2*/CD1*/CD0* /ERASE* /INITIALIZE*DRSELD 
READY */CD3*CD2* /CD1*CD0* /ERASE* /INITIALIZE*DRSELD 
/READY */CD3*CD2* /CD1*CD0* /ERASE* /INITIALIZE*DRSELD 
READY */CD3*CD2*/CD1*CD0 */ERASE*/ INITIALIZE*DRSELD 
READY * /CD3*CD2*CD1*/CD0*/ERASE* /INITIALIZE*DRSELD 
/READY* /CD3*CD2*CD1*/CD0* /ERASE* / INITIALIZE*DRSELD 
READY */CD3* /CD2*/CD1*/CD0* /ERASE* / INITIALIZE*DRSELD 
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pRaaAKK WRITE FILE 


WPMCMD 


WFMCMDA 
WFMCMDB 


WFMCMDC 





:= REQ 


vcc 
:= STS 


vcc 


it 


MARK COMMAND ****#% 


-> WFMCMD 
WFMCMDA 
-> WFMCMDB 
-> WFMCMDC 


-> DRVSELD 








;WHEN REQUEST DEACTIVATED, 
; NEGATE READY & COMMAND 
;OTHERWISE GOTO WFMCMDA 
;OUTPUT WFM COMMAND CODE 
;WHEN STS IS ACTIVATED, 

; NEGATE COMMAND CODE 
;GOTO DRVSELD 


WFMCMD.OUTF:= /READY*/CD3*/CD2*/CD1* /CD0* /ERASE* /INITIALIZE*DRSELD 


WFMCMDA.OUTF: 
WFMCMDB.OUTF: 
WFMCMDC.OUTF: 


pkkkeee READ 
RFMCMD 


REMCMDA 
RFMCMDB 


RFMCMDC 


RFMCMD.OUTF:= 
RFMCMDA.OUTPF: 
RFMCMDB.OUTF: 
RFMCMDC.OUTF: 


“ou at 


FILE MARK COMMAND ****4% 


:= REQ 


-> RFMCMD 
+-> RFMCMDA 
-> RFMCMDB 
-> RFMCMDC 


-> DRVSELD 


READY* /CD3*/CD2*/CD1* /CD0* /ERASE* /INITIALIZE*DRSELD 
READY *CD3* /CD2* /CD1*CD0O* /ERASE* /INITIALIZE*DRSELD 
READY * /CD3* /CD2*/CD1* /CD0* /ERASE* /INITIALIZE*DRSELD 


;WHEN REQUEST DEACTIVATED, 
; NEGATE READY & COMMAND 
;OTHERWISE GOTO RFMCMDA 
;OUTPUT RFM COMMAND CODE 
;WHEN STS IS TRUE, NEGATE 
; COMMAND CODE 

;GOTO DRVSELD 


/READY* /CD3*/CD2*/CD1*/CD0*/ERASE*/ INITIALIZE*DRSELD 


;’*kkK® READ COMMAND *#*#*% 


RDCMD 


RDCMDA 


RDCMDB 


RDCMDD 


RDCMDF 
RDCMDG 
RDCMDE 


RDCMDC 


REQ -> 
+-> 

ONLN -> 
+-> 

sts -> 
+-> 

ONLN -> 
+-> 

vcc -> 
sts -> 
REQ -> 
+-> 

:= VCC -> 


RDCMD 


RDCMDA 
RDCMDB 


CMDERRA 
RDCMDC 


RDCMDD 
RDCMDE 


RDCMDF 
RDCMDG 
RDCMDC 
DRVSELD 


RDCMDB 
DRVSELD 


READY* /CD3*/CD2*/CD1* /CD0* /ERASE*/INITIALIZE*DRSELD 
READY *CD3*/CD2* /CD1*/CDO* /ERASE* / INITIALIZE*DRSELD 
READY*/CD3* /CD2*/CD1* /CD0* /ERASE* /INITIALIZE*DRSELD 





;WHEN REQUEST DEACTIVATED, 
; NEGATE READY 

;OTHERWISE GOTO RDCMD 

;IF ONLINEB IS TRUE, OUTPUT 
; READ COMMAND CODE 

;IF ONLINEB FALSE, OUTPUT 

; INVALID COMMAND CODE 

;IF FILE MARK IS FOUND, 

; NEGATE COMMAND CODE 
;OTHERWISE GOTO RDCMDD 

;IF ONLINE IS TRUE, GOTO 

; RDCMDE 

;OTHERWISE NEGATE COMMAND 

; CODE 

;OUTPUT REWIND COMMAND CODE 
;NEGATE COMMAND CONF 

;IF REQUEST IS TRUE, GOTO 

; DRVSELD 

;OTHERWISE GOTO RDCMDB 
;GOTO DRVSELD 


RDCMD.OUTF:= READY*/CD3*/CD2*/CD1*/CD0* /ERASE* /INITIALIZE*DRSELD 
/READY*/CD3* /CD2*/CD1* /CD0* /ERASE* /INITIALIZE*DRSELD 
/READY *CD3* /CD2*CD1*/CD0* /ERASE* / INITIALIZE*DRSELD 
/ READY */CD3* /CD2*/CD1*/CD0* /ERASE*/ INITIALIZE*DRSELD 
/READY*CD3* /CD2*CD1 */CD0* /ERASE* / INITIALIZE*DRSELD 
/READY *CD3*/CD2*CD1*/CD0* /ERASE* /INITIALIZE*DRSELD 
/READY */CD3* /CD2*/CD1* /CD0*/ERASE* / INITIALIZE*DRSELD 
/READY *CD3*CD2*/CD1 *CD0* /ERASE* / INITIALIZE*DRSELD 


RDCMDA.OUTF: 
RDCMDB.OUTF: 
RDCMDC.OUTF: 
RDCMDD.OUTF: 
RDCMDE.OUTF: 
RDCMDF.OUTF: 
RDCMDG.OUTF: 


peakaaK WRITE 
WRCMD 


WRCMDA 


WRCMDB 
WRCMDC 
WRCMDD 


WRCMDE 


WRCMDF 


COMMAND ***&&* 


:= REQ -> 
+-> 

ONLN -> 
+-> 

EW -> 
+-> 

vcc -> 
vcc -> 
REQ -> 
+-> 

ONLN -> 
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WRCMD 


WRCMDA 
WRCMDB 


CMDERRA 
WRCMDC 


WRCMDE 
WRCMDD 


DRVSELD 
DRVSELD 


WRCMDF 
WRCMDB 


;WHEN REQUEST DEACTIVATED, 
; NEGATE READY 

;OTHERWISE GOTO WRCMD 

;IF ONLINE IS TRUE, OUTPUT 
WRITE COMMAND CODE 

IF ONLINE FALSE, OUTPUT 
INVALID COMMAND CODE 

IF EWH IS DETECTED, 
NEGATE COMMAND CODE 
;OTHERWISE GOTO WRCMDE 
;OUTPUT END OF MEDIA 
; COMMAND CODE 

;GOTO DRVSELD 

;IF REQUEST IS TRUE, 
7 DRVSELD 

;OTHERWISE GOTO WRCMDF 
;IF ONLINEB IS TRUE, GOTO 
; WRCMDB 


me we te we te 8 


GOTO 
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+-> WRCMDG ;NEGATE COMMAND CODE 
WRCMDG s= vcc -> WRCMDH ;OUTPUT WFM COMMAND CODE 
WRCMDH := STS -> WRCMDI ;IF FM IS ACTIVATED, GOTO 
; WRCMDI 
+-> WRCMDH ;OTHERWISE GOTO WRCMDH 
WRCMDI := VCC -> BOTCMDA ;REWIND TAPE 
WRCMD.OUTF:= READY*/CD3*/CD2*/CD1*/CD0* /ERASE* /INITIALIZE*DRSELD 


/READY* /CD3*/CD2*/CD1*/CD0* /ERASE*/ INITIALIZE*DRSELD 
/ READY *CD3* /CD2*CD1 *CD0* /ERASE* / INITIALIZE*DRSELD 
/READY* /CD3*/CD2*/CD1*/CD0* /ERASE* / INITIALIZE*DRSELD 
/READY* /CD3*CD2*CD1 *CD0* /ERASE* /INITIALIZE*DRSELD 
/READY *CD3* /CD2*CD1 *CD0* /ERASE*/ INITIALIZE*DRSELD 

/ READY *CD3* /CD2*CD1*CD0*/ERASE* /INITIALIZE*DRSELD 

/ READY * /CD3*/CD2*/CD1*/CD0* /ERASE*/ INITIALIZE*DRSELD 
/READY *CD3* /CD2*/CD1*CD0* /ERASE* / INITIALIZE*DRSELD 
/READY* /CD3*/CD2*/CD1* /CD0* /ERASE* / INITIALIZE*DRSELD 


WRCMDB.OUTF 

WRCMDC.OUTF: 
WRCMDD.OUTF: 
WRCMDE.OUTF: 
WRCMDF.OUTF: 
WRCMDG.OUTF: 
WRCMDH.OUTF: 
WRCMDI.OUTF: 


WRCMDA.OUTF: 


unui dt tun wo 


7@@@@@@ CONDITIONS @@@eee 


CONDITION 

CONDO = cco 
COND1 = ccl 
COND2 = CC2 
COND3 = CC3 
ONLN = ONLINE 
STS = STATUS 
EW = EWH 
REQ = REQUEST 


:@@@@@@ SIMULATION INPUT @@@@@@ 


SIMULATION 

TRACE ON /CCO /CCl /CC2 /CC3 /REQUEST /ONLINE CD3 CD2 CD1 CDO STATUS ERASE 
INITIALIZE READY DRSELD 

SETF /CC3 /CC2 /CC1 /CCO /REQUEST /ONLINE /STATUS /EWH /PRESET /CLK 

SETF PRESET ;RESET DEVICE 

CLOCKF CLK ;SELECT DRIVE 

SETF /CC3 /CC2 /CC1 CCO 

SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /CC3 /CC2 /CCl /CCO ; INVALID COMMAND 

SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /CC3 CC2 /CC1 CCO ;BOT COMMAND 
SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS ;ASSERT STATUS, BOT 
CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 
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SEVE /CCS CUZ CCL /CCO ;ERASE COMMAND 
SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 


SETF /REQUEST 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS ; ASSERT STATUS, BOT 
CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS ;ASSERT STATUS, EOT 
CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS ;ASSERT STATUS, BOT 
CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

SETF /CC3 CC2 CCl cco ; INITIALIZE COMMAND 
SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS ;ASSERT STATUS, BOT 
CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS ;ASSERT STATUS, EOT 
CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS ;ASSERT STATUS, BOT 
CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

SETF /CC3 CC2 /CCl /cCO ;READ STATUS 


SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF /REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK. 
SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 
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SETP /REQUEST ;READ STATUS BYTE 1 
CLOCKF CLK 

CLOCKF CLK 

SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST ;READ STATUS BYTE 1 
CLOCKF CLK 

CLOCKF CLK 

SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST ;READ STATUS BYTE 2 
CLOCKF CLK 

CLOCKF CLK 

SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST ;READ STATUS BYTE 3 
CLOCKF CLK 

CLOCKF CLK 

SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST ;READ STATUS BYTE 4 
CLOCKF CLK 

CLOCKF CLK 

SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST ;READ STATUS BYTE 5 
CLOCKF CLK 

CLOCKF CLK 

SETF CC3 CC2 /CC1 cco ;READ COMMAND 
SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /ONLINE ;NEGATE ONLINEB 
CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF STATUS 

CLOCKF CLK 

SETF /STATUS 

CLOCKF CLK 

CLOCKF CLK 

SETF CC3 CC2 /CC1 CCOo ;READ COMMAND 
SETF ONLINE 

SETF REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

SETF /REQUEST 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 

CLOCKF CLK 
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SETF STATUS ;PILE MARK FOUND 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF /STATUS 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETP CC3 CC2 /CCl CCO ;READ COMMAND 
SETP REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF /REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF CC3 CC2 CCl /CCO ;READ FILE MARK 
SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF /REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF STATUS 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF /STATUS 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF CC3 CC2 /CC1 cco ;READ COMMAND 
SETF /ONLINE 
SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF /REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF CC3 CC2 /CC1l /CCcO ;WRITE COMMAND 
SETF ONLINE 
CLOCKF CLK 
SETF REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF /REQUEST 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF /ONLINE ;NEGATE ONLINEB 
CLOCKF CLK 
CLOCKF CLK 
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CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF STATUS 
CLOCKF CLK 
CLOCKF CLK 
SETF /STATUS 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
CLOCKF CLK 
SETF STATUS ;ASSERT STATUS, BOT 
CLOCKF CLK 
SETF /STATUS 
CLOCKF CLK 
CLOCKF CLK 
TRACE_OFF 
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PLE5P16 PLE DESIGN SPECIFICATION 
12345 KEN WON 1/27/87 

COMMAND CODE DECODER 

MMI, SANTA CLARA, CALIFORNIA 


STSO = /CD3*/CD2*/CD1*CDO ; ENABLE STATUS BYTE 0 
sTsl = /CD3*/CD2*CD1*/CDO ; ENABLE STATUS BYTE 1 
STS2 = /CD3*/CD2*CD1*CDO ; ENABLE STATUS BYTE 2 
STS3 = /CD3*CD2*/CD1*/CDO ; ENABLE STATUS BYTE 3 
STS4 = /CD3*CD2*/CD1*CDO ; ENABLE STATUS BYTE 4 
STS5 = /CD3*CD2*CD1*/CDO ; ENABLE STATUS BYTE 5 
EOM = /CD3*CD2*CD1*CDO 7 END OF MEDIA 

RFM = CD3*/CD2*/CD1*/CDO ; READ FILE MARK 

WFM = CD3*/CD2*/CD1*CDO ; WRITE FILE MARK 

READ = CD3*/CD2*CD1*/CDO ; READ 

WRITE = CD3*/CD2*CD1*CDO ; WRITE 

FFWD = CD3*CD2*/CD1*/CDO ; FAST FORWARD 

RWD = CD3*CD2*/CD1*CDO ; REWIND 

ICMD = CD3*CD2*CD1*/CDO ; INVALID COMMAND 


FUNCTION TABLE 


NC CD3 CD2 CD1 CDO EOM ICMD RWD FFWD WRITE READ WFM RFM 
STSS STS4 STS3 STS2 STS1 STSO 





;-»+eINPUT...... Te ee ee Eee Oe eee OUTPUT. cc ccc cece cc ccc cece recccen 
LLLLL LLLLLULLLULLLLuinLu + NO COMMAND 

LLLLH LLLLLLLLULLULULLUE ; ENABLE STATUS BYTE 0 
LLLHL LLLLLLLULLLiLuuHtL ; ENABLE STATUS BYTE 1 
LLLHH LLLLLLULLLLUAHLLE ; ENABLE STATUS BYTE 2 
LLHLL LLLULLLULLLHYHLLULE ; ENABLE STATUS BYTE 3 
LLHLH LLLLULLULLLALLULLE ; ENABLE STATUS BYTE 4 
LLHHL LLULLLULLLAHLULLULLE ; ENABLE STATUS BYTE 5 
LLHHH HLLLLLULLULLLLE + END OF MEDIA 

LHELLL LLLLLLLAHLLLLLte ; READ FILE MARK 

LHLLH LLLULLULALLLELLULLE ; WRITE FILE MARK 
LHLHL LLLLULALLLULLiuLe ; READ 

LHLHH LLLLAELLLLULELLL ; WRITE 

LHHLL LLLHLLLLLULLULLE ; FAST FORWARD 

LHHLH LLHLLLULLLLLULLL ; REWIND 

LHHHL LHLLLLLLULLLLLE ; INVALID COMMAND 
DESCRIPTION 


THIS PLE DEVICE IS ONE OF THREE DEVICES USED TO IMPLEMENT A QIC-02 
COMMAND SEQUENCER. THIS DEVICE DECODES THE OUTPUT CODE FROM THE 
PROSE DEVICE TO GENERATE CONTROL SIGNALS. THE OTHER TWO DEVICES ARE 
A 20L8 PAL DEVICE AND A 14R21 PROSE DEVICE. 
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GCR (4B-5B) Encoder/Decoder 





One of the more common logic functions performed on serial 
data is the data encode/decode function. Usually it is desir- 
able to map (encode) the logical bit stream to a physical bit 
stream, adjusting for the peculiarities of the particular trans- 
mission or storage media. 


Noise, bandwidth, and reliability considerations may mean 
that a different data format would be desirable when data is 
sent along to or stored on a given media. For example, group- 
coded recording (GCR) formats take a given number of data 
bits and encode them with a larger number of bits. A 4B-5B 


GCR code would take 4 data bits and encode them into 16 
states with 5 new bits. A particular 4B-5B code is shown in 
Figure 1. 


This mapping allows at most two zeros to occur in succes- 
sion. Also note that data combinations with more than one 
zero at the beginning and end of the word are excluded. This 
is necessary to insure that when data words are serialized, no 
more than two zeros occur in succession at any point in the 
bit stream. Finally, the data combination 11111 is reserved as 
a synchronization mark. In tape systems, this results in in- 
creased bit density and eases clock synchronization. 


4B-5B Code 
4-Bit Data 5-Bit Data 
0000 11001 
0001 1011 
0010 10010 
0011 10011 
0100 Io 
0101 1.0404 
0110 10110 
014141 1014141 
1000 11010 
1001 01001 
1010 01010 
1011 01011 
1100 111410 
1101 01101 
1110 01110 
174171 0114141 
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Figure 1. 4B-5B Code 
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GCR (4B-5B) Encoder/ Decoder 





The system diagram in Figure 2 shows how the GCR Encod- 
er/Decoder (GCR E/D) interfaces to a tape drive and tape con- 
troller. Parallel input data is given to the GCR E/D, converted to 
the 5-bit format, serialized, and written to the tape. On a read, 
the serial data from the tape is parallelized, converted back to 
the 4-bit format and output to the output data bus. Additionally, 
during a read, two status signals are developed. The first signal, 
INV, indicates the presence of an invalid input, i.e., too many 
zeros in succession. The second status signal, H, indicates the 
detection of the synchronization mark (11111). 


The operation modes for the GCR E/D are shown in the Data- 
Flow Diagrams of Figure 3. The contro! signal and operation 
functions are indicated for each operation mode. In particular, the 
data flow between each bit of the output register is indicated 
schematically. 


The first mode of operation of the GCR E/D is the HOLD 
mode. When ENABLE is HIGH, all data operations on the out- 
put register are disabled, independent of the two mode con- 
trols, M; and Mo. The output data is simply fed-back to the 
register inputs. Thus the register content is retained after the 
clock transition. 


When the ENABLE input is LOW, the operations indicated by 
the M, and My mode bits are executed on the clock transition. 
When M, and Mgare both LOW, the SERIAL SHIFT IN modeis 
selected. In this mode the output register is configured as a 
serial shift register. The serial input is consecutively shifted 
into the register until all 5 bits from the tape have been stored, 
MSB at Y3 and LSB at SERIAL OUT. 


The CONVERT SERIAL INPUT AND LOAD operation is se- 
lected when ENABLE is LOW, M, is HIGH and Mo is LOW. 
After the 5 bits of data have been serialized by the SERIAL 
SHIFT IN instruction, the 5B code must be converted to a 4B 
code. This is accomplished by taking the outputs of the 5 
register bits and converting them to 4 bits with combinatorial 
logic. On the clock transition, the result is loaded into the Y 
register. On the same clock transition that loads the con- 
verted data into the Y register, the serial input is loaded into 
the serial output register. Because the serial data is being 
read continuously, one data bit per clock transition, the con- 
version must be done without missing a serial data bit. 


The CONVERT PARALLEL INPUT AND LOAD operation is 
selected when ENABLE is LOW, M, is HIGH and Mo is HIGH. 
This mode takes the 4 input data bits and converts them to 
the 5 bit representation. The result is loaded into the output 
register on the clock transition. The LSB of the 5B representa- 
tion is loaded into the Y3 bit of the output register and the 
MSB is loaded into the serial output bit. This configuration, in 
conjunction with the next instruction, allows the serial data 
to be written to the tape drive one bit per clock transition. 


The final operation, SERIAL SHIFT OUT, is selected when 
ENABLE is LOW, M, is LOW and Mg is HIGH. After the CON- 
VERT PARALLEL INPUT AND LOAD operation is executed, 
the SERIAL SHIFT OUT operation outputs the converted data 
to the tape drive. A series of one convert operation followed 
by 4 shift operations will transfer a sequence of 5-bits to the 
tape drive, one bit per clock cycle. 


INPUT DATA BUS 


ENCODER/DECODER 


OUTPUT DATA BUS 
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Figure 2. Typical Tape Storage System 
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GCR (4B-5B) Encoder/Decoder 





OPERATION DATA-FLOW DIAGRAM 
- [= [= 
Y3 Y2 1 Yo SERIAL 
OUTPUT 
SERIAL 
INPUT 
SERIAL 
SHIFT IN 


“Y¥3 Ye Yi Yo SERIAL Ej 











OUTPUT 
CONVERT 
SERIAL INPUT CONVERT 5B TO 4B 
AND LOAD 
Y3 Y2 v1 Yo SERIAL 
OUTPUT 
CONVERT 
PARALLEL INPUT CONVERT 48 TO 5B 
AND LOAD 
B2 
Y3 Y2 Y1 Yo SERIAL 
OUTPUT 
SERIAL 
SHIFT OUT 
Y3 Y2 Ys Yo SERIAL 
OUTPUT 
Figure 3. GCR E/D Mode Definitions 03862A-95 
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Design Approach (AmPAL16R6) 


The PAL device implementation of the GCR Encoder/Decoder 
takes advantage of the multiplexerlike structure of the AND-OR 
array. Each valid combination of ENABLE, M, and Mo selects a 
different set of AND terms. In some cases, only one term is 
selected (in data steering operations for example). In other 
cases, multiple AND terms are selected to implement a com- 
binatorial logic function (the 5B-to-4B conversion for example). 
This concept, using the control inputs to enable one or more 
AND terms, allows the direct implementation of the PAL device 
design from the mode Data-Flow Diagrams (with a little Kar- 
naugh map help). The K-Maps for the 5B-to-4B conversion logic 
and the 4B-to-5B conversion logic for the Y3 output are shown in 
Figures 4 and 5. Given these maps and the flow diagrams in 
Figure 3, the Boolean equations can be constructed for the Y3 
output. The resulting equation, in PALASM software format, is 
shown in Figure 6. 


It is important to note that the equation in Figure 6 is written for 
the inverse of the Y3 output (Y3). This is necessary if true data is 
desired on the output pin because of the inverting nature of the 
output buffer on the PAL device. The inverted form of the equa- 
tion is easily implemented by selecting the negative version of 
the data (Y3 in the hold operation for example) or by grouping 
zeros in a combinatorial logic function (see Figures 4 and 5). 
Notice that the multiplexer strategy works equally well for active- 
LOW or active-HIGH logic functions. 


Once the transformation of the Data-Flow Diagrams and 
K-Maps to Boolean equations is understood, the interested 





A3 = Y3*Y2 + Y3*Sout 
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Figure 4. 5B-to-4B Conversion K-Map for Y3 Output 


+ EN* Mi * Mo* Sour 


+ EN* M1 * Mo * Y3 * Sout 


+ EN * My * Mo * Y3 * Y2 
+ EN* Mi * Mo * D3* Do 
+ EN «Mi * Mo * D1 * Do 


reader should be able to construct K-Maps for the other Y out- 
puts and, in conjunction with the Data-Flow Diagrams of 


Figure 3, write the PALASM software equations for the resulting 
logic functions. This exercise will help the reader to fully appreci- 
ate the advantages of the Data-Flow Diagram/Multiplexer 
method of PAL design. Consult the full PALASM software listing 
(Figure 10) for the complete solutions. 


It is important to note that the diagrams and equations in Figures 
7, 8, and 9 specify the true output for invalid signal (INV), rather 
than INV which appears in the system diagram of Figure 2. This 
is necessary if the correct data is desired on the output pin and 
is due to the inverting nature of the output buffer on the PAL 
device. 


The INV signal is registered and held until the clear INV flag 
input (CIF) is brought LOW, deactivating the flag. Only during a 
5B-4B conversion operation (M; = HIGH, Mo = LOW) is the 
INV flag activated. Figures 7 and 8 show the INV flag mode 
definitions and the intermediate INVALID logic equation 
respectively. 


In this case, an active-LOW output is desired so the active-HIGH 
form of the INV signal is developed internally. Ones are grouped 
in the intermediate combinatorial logic function (INVALID) and 
the true version of the data is selected. The complete PALASM 
software equation for INV is given in Figure 9. 


D1iDo 
D3D2 00 0111 10 





Bo = D3*Do +D1*Do 
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Figure 5. 4B-to-5B Conversion K-Map for Y, Output 


;HOLD 

;SERIAL SHIFT IN 
;SERIAL SHIFT OUT 
;CONVERT SERIAL 
;INPUT AND LOAD 
s;CONVERT PARALLEL 
;INPUT AND LOAD 
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Figure 6. PALASM Equation for Y3 
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CIF M1 Mo OPERATION DATA-FLOW DIAGRAM 
“oO” 
0 xX x CLEAR INV 
FLAG 
INV 
1 0 0 HOLD INV 
0 1 FLAG 
. 1 1 
INV 
1 1 0 SET INV Y3 Y2 Y1 Yo Sout 
FLAG 





INVALID 


Figure 7. INV Flag Mode Definitions 





INVALID = Y3 « Y2 + 
Y2* ¥i* Yo + 
Yo « Sout + 
Y3 * Y2* Y4* Yo* Sour 03862A-100 


Figure 8. Intermediate Equation for INVALID 


INV:= CIF* INV +  ;HOLDINV FLAG 

IF * Mi * Mo * Y3* Ya +  ;SETINV FLAG IF INVALID IS TRUE 
+ 
+ 


Ids 


IF * My * Mo * Yo * Sout 
IF + M1 * Mo * Y2 * Y1* Yo 
1F * My * Mo * Y3 * Y2* Y1 * Yo* Sout 


| (2) 


fe) 


03862A-101 
Figure 9. PALASM Equation for INV 
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TITLE 4B-5B ENCODER/DECODER 
PATTERN PATO003 

REVISION 01 

AUTHOR WARREN MILLER 

COMPANY ADVANCED MICRO DEVICES 
DATE 11/06/87 


CHIP ENCO_DECO PAL16R6 


CK Ml MO D3 D2 Dl DO /EN /CIF’ GND 
/E+ SIN /INV YO Yl Y2 Y3 SOUT /H vcc 


EQUATIONS 
/SOUT := EN*/SOUT ; HOLD 
+ /EN*/M1*/M0*/SIN ;SERIAL SHIFT IN 
+ /EN*/M1* MO*/YO ;SERIAL SHIFT OUT 
+ /EN* M1*/M0*/SIN ;CONVERT SERIAL INPUT AND LOAD 
+ /EN* M1* MO* D3* D1 ;CONVERT PARALLEL INPUT AND LOAD 
+ /EN* M1* MO* D3* DO 
/X0 := EN*/yY0 
+ /EN*/M1*/M0*/SOUT 
+ /EN*/M1* MO*/Y1 
+ /EN* M1*/M0*/SOUT 
+ /EN* M1*/MO* Y3* Y2*/yO 
+ /EN* Ml* MO*/D3* Dl 
+ /EN* M1* MO*/D3* D2* DO 
/Y1  =:= EN*/y1 
+ /EN*/M1*/M0*/Y0 
+ /EN*/M1* MO*/Y2 
+ /EN* M1*/MO*/YO 
+ /EN* M1*/MO* ¥3* y2 
+ /EN* M1* MO*/D2 
/¥2  o:= EN*/Y2 
+ /EN*/M1*/M0*/Y1 
+ /EN*/M1* MO*/Y3 
+ /EN* M1*/M0*/Y1 
+ /EN* M1* MO*/D3*/D1*/D0 
+ /EN* M1* MO*/D3* D2*/D1 
+ /EN* Ml* MO* D3*/D1* DO 
/X3 := EN*/y3 
+ /EN*/M1*/MO*/Y2 
+ /EN*/M1* MO*/SOUT 
+ /EN* M1*/MO* Y3* SOUT 
+ /EN* M1*/MO* y3*/y2 
+ /EN* Ml* MO* D3*/DO 
+ /EN* M1* MO* D1*/DO 
INV := /CIF* INV ;HOLD INV FLAG 
+ /CIF* M1*/M0*/Y3*/Y2 ;SET INV FLAG IF INVALID TRUE 
+ /CIF* M1*/M0*/Y2*/Y1*/Y0 : 
+ /CIF* M1*/MO0*/Y0*/SOUT 
+ /CIF* M1*/MO* Y3* Y2* y1l* yO* SOUT 
H = ¥3* Y2* yl* yo* sour 


; SIMULATION NOT INCLUDED 
Figure 10. PALASM Listing 
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Industrial control can be defined as simply as a basic state 
machine: process information is gathered and analyzed, and 
outputs feed back to the process to modify it based upon the 
analysis. Thus, PAL device applications are often state 
machines. 


A very basic industrial control system is a heater thermostat. 
Temperature is sensed, compared to a maximum upper or lower 
limit, and the heater is either turned on or off. A small state 
machine PLD can implement this function. 


Much of the circuitry in industrial control applications is analog or 
linear. Process information is gathered by sensors, and then the 
information is conditioned to eliminate noise. The conditioned 
signal is then converted to a digital signal, and processed by a 
digital processor. The control outputs are then converted to 
appropriate analog signals to modify the process (Figure 1). 


Data Acquisition 


The sensing part of the system is data acquisition. Data acquisi- 
tionis important not only in industrial control applications, but also 
in scientific monitoring applications. Data acquisition requires the 
capture of analog signals and conversion to useful! information 
that can be analyzed. This first requires conditioning of the input, 
to filter out noise. The signal can then be analyzed with analog 
circuitry, but the power of digital microprocessors makes conver- 
sion to digital logic more efficient. 


More logic is being incorporated into the acquisition circuitry. The 
analog circuitry for signal conditioning is a natural addition to a 
sensor. Analog-to-digital conversion (ADC) can even be per- 
formed in the same unit. Advanced sensors may even incorpo- 
rate a microprocessor to provide immediate local analysis of the 
data, offloading the central processor. In these applications, PAL 
devices can form the glue logic for the microprocessor. 


Data Analysis 


Data analysis has been performed in the past by a central 
computer. The central computer would have the task of polling all 
of the sensors and analyzing the incoming data. However, ina 
system where thousands or millions of sensors are active, or 
where immediate response to process changes is required, this 
system is inefficient. 


CONDITIONING 


ANALYSIS 


410 01 





The type of system more commonly used today includes distrib- 
uted control and processing (Figure 2). Microcomputers or 
programmable controllers can perform low-level processing and 
react with control signals immediately. Often single-board com- 
puters with added analog conversion circuitry are used. 


ALAN connection between a number of intelligent sensors and 
processors provides a very efficient system. Simple RS-232 links 
connect sensors. Buses such as Multibus or VME bus are 
commonly used to connect the boards together. Standards such 
as MAP (Manufacturing Automation Protocol) are being devel- 
oped for future LAN-based systems, allowing dissimilar devices 
to be connected in a network. Fiber-optic links will provide the 
data reliability over long distances in confined areas required for 
industrial control. 


CONTROLLED 


PROCESS 





410 02 
Figure 2. 


In these areas, standard computer applications for PAL devices 
are numerous. Bus interface is another key application area for 
communication within distributed systems. 


Control 

The most critical part of the system is control. The system must 
respond to the analyzed information with signals that modify the 
process under observation. In a system such as a nuclear 


reactor, immediate and accurate response is the most important 
objective. 


SENSOR 


CONTROL 


CONTROLLED 
PROCESS 


Figure 1. 
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PID Algorithm 


For smooth control, the PID algorithm for analysis and control is 
often used. P stands for Proportional: the incoming data is 
compared to a reference number, and then the process is 
modified in proportion to the difference between the desired and 
measured values. For example, a heater thermostat that meas- 
ures air temperature of 55 degrees, when its reference value was 
set at 65 degrees, would turn on the heater, and reduce the 
amount of heat as the temperature got closer to the reference 
value. 


Proportional analysis alone often causes undershoot or over- 
shoot. If the heater works quickly, it can heat the air well past the 
reference value. And if it works slowly, it may take a long time to 
heat the air back to the reference value. 


For these reasons, the Integral (I) part of the algorithm is added. 
The integral is taken of the difference between the reference 
value and measured value; in effect, this measures how long the 
measured value has been off from the reference value. Using the 
integral prevents undershoot and overshoot. 


However, the result can be a control output that causes the 
measured value to continue oscillating around the reference 
value. For example, if the heater is turned on at 64 degrees and 
an air conditioner is turned on at 66 degrees, the temperature will 
oscillate rapidly around the reference point of 65 degrees. 


For this reason, a Derivative (D) input is used. The derivative 
looks at the rate of change of the measured value, and will slow 
down the response if the measured value is rapidly approaching 
the reference value. 


The complete PID algorithm is 


OUT = Kp x ERR + Ki x JERR + Kd x dERR/dt, 


combining present, past, and future input considerations. Selec- 
tionof the proper constants allows the control response tobe both 
rapid and smooth. 


The PID algorithm requires mathematical processing capabili- 
ties, similar to those used in digital signal processing (see the 
section on Digital Signal Processing, page 2-283). 


Control circuitry is also combining analog and digital logic. Smart 
power ICs can perform logic functions while receiving and send- 
ing high-power signals. These ICs can perform simple functions 
like overvoltage protection, cutting off a circuit if the powersurges. 


Applications 


Applications for industrial control systems are varied, and include 
the following: 


« Factory Automation 
Automated distribution centers 
Power grid monitoring and control 
* Motor Control 
Robots 
Engine testing 
Torque control 


¢ Transportation 
Air traffic control 
Subway system control 
¢ Automotive Electronics 
¢ Satellite Communication 
* Simulators 


Performance Requirements 


The key requirement of industrial control systems is the need for 
real-time processing. An air-traffic control system cannot be 
allowed to slow down if overloaded. During overloading in real 
systems, the information provided on display screens is reduced, 
instead of completely losing all information on a plane. 


An interrupt handling system is more efficient than a standard 
polling system. Many I/O are required, with high DAC and ADC 
accuracy. For data analysis, a large memory and good memory 
management is important. Workstations with a high 1/O speed 
are often used for central industrial controllers. 


PAL Device Usage 


Most of the circuitry in an industrial control application is analog 
or linear. Only the analyzer is primarily digital logic. The analyzer, 
whether local or central, uses a standard computer architecture. 


Programmable logic devices, especially CMOS PLDs, are used 
in industrial control much as they are used in other computer 
applications. High speed is important in real-time control applica- 
tions involving a large amount of data. 


CMOS PAL Devices 


Because of the harsh environment often found in factories or 
around motors, CMOS technology is required. Specifications for 
the industrial range CMOS ZPAL family are guaranteed over a 
wider temperature range, and a wider supply voltage range. In 
addition, CMOS inputs are less sensitive to noise fluctuations. 
The lower power consumed by all CMOS PAL devices allows a 
smaller power supply, important in the small areas controllers 
often must be placed. Also, it is easier for a battery to back upthe 
power supply for critical operations. The reduced heat is good for 
enclosed areas, and allows CMOS devices to be placed closer 
together, even in surface-mounted devices, to save room. 


The erasability of CMOS PAL devices allows changes to refer- 
ence values, PID control constants, or other process control 
changes. Erasability also allows easy field service upgrades or 
repair. 


PAL Device Applications 

PAL devices find applications in all levels of industrial control, 
from the low-level sensing equipment to the mainframe analysis 
computers. 


Low-Level Applications 


Low-level data acquisition and response circuitry can be based 
on an individual state machine PLD. For example, stepper 
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motors are often used for mechanical valves. Asingle PAL device 
can be programmed to convert enable, direction, and stepping 
information into the appropriate signals to hold or step the motor 
(page 2-550). 


Another example is shaft encoding (page 2-558). Shaft encoders 
convert the position of a rotating shaft into a digital signal. The 
position is sensed optically, and the digital information is encoded 
into position, speed, and direction signals. 


More complex applications may be handled by dedicated state 
machine PLDs, including the PROSE device or the Am29PL141. 


Medium-Range Applications 


Dedicated microcontrollers can be used in medium-range appli- 
cations, such as small controllers linked to a central computer. 
These applications require the intelligence of a microcontroller 
but not a complete microprocessor-based computer system. In 
these applications, PAL devices form the glue logic between the 
microcontroller and its surrounding chips. 


High-End Applications 


Dedicated local computers and central data processing comput- 
ers are the high-end applications for PLDs. Here, microproces- 
sors are the brains of the system and PLDs perform many of the 
support functions. 


Microprocessor support is a key area of application (see the 
section titled Microprocessor-Based Systems, page 2-131). This 
is especially true of distributed control systems, where several 
microprocessors perform loca! processing. Astandard architec- 
ture canbe modified for different areas of ihe process by program- 
ming the PAL devices differently. PAL devices also allow the local 
controllers to take up less space. 


Another key application area is bus interface for distributed 
systems (see the section titled Bus Interface, page 2-325). Either 
synchronous or asynchronous buses may be used. 


Since industrial controllers are usually interrupt-driven for effi- 
cient response, interrupt logic forms another potential application 
area for PAL devices. 


Digital signal processing is often performed in the analysis 
section of the industrial controller, usually in a central processor. 
The applications would not require the speed or complexity of 
most DSP applications, but would use similar circuitry. See the 
section titled Digital Signal Processing. 
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Functional Description 


Stepper motors and linear actuators are used in a variety of 
applications requiring precise rotational and/or linear move- 
ment. Examples are printers, floppy disk drives, mechanical 
valves, etc. Stepper motors are two-phase permanent magnet 
motors which provide discrete angular movement every time the 
polarity of a winding is changed. In linear actuators the angular 
movement is converted to linear movement via a load screw. In 
essence, they are dc motors without brushes, where the user 
provides commutation with external logic. 


In these examples, the control and drive circuitry for such a 
motor is implemented digitally using PAL devices. 


Circuit Operation 


One type of drive circuit, unipolar drive, is shown in Figure 1. Two 
drive sequences are given in Tables 1a and 1b. Angular rotation 
is achieved by saturating the transistor drivers in the sequence 
shown in the appropriate table (full or half step). Now, assume 
the circuit of Figure 1 is connected to a stepper motor designed Figure 1 
for 7.5° steps. By following the step sequence of Table 1a (full 

step), the shaft will rotate 7.5° each time the state is changed. If 

the sequence of Table 1b is followed, a3.75° (half-step) rotation 

will result for each change of state. For both step sequences, the 

direction can be reversed by stepping backwards through the 

table (step 4-3-2-1-4-etc.). 





Table 1a 
FULL STEP SEQUENCE 





CLOCKWISE COUNTER- 

ROTATION CLOCKWISE 
ROTATION 

Table 1b 
HALF STEP SEQUENCE 

CLOCKWISE COUNTER- 

ROTATION CLOCKWISE 
ROTATION 


1 
2 
3 
4 
5 
6 
7 
8 
1 


- oO 0 000; — + 
oo +++ + 000 0 
—-~ 32 32 OOoO0ncd70o = 
oo o 0-1. + —- O C8 
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PAL Device Implementation e Enable/Disable inputs to enable stepping in either section. 
In this application, one PAL16R4 can be used to provide the logic (E inputs). 

levels required to drive two stepper motors in the full step mode. © Select clockwise or counter-clockwise rotation. 

Due to the high current drive required (100-400 mA/phase), * Set the motor to logic state step 1. 


external inverting high current buffers must be used (ULN 2001 
or equivalent). In the design, the following features are provided 
within the PAL device: 


A block diagram/pinout is shown in Figure 2. 


MOTORA 






EXT CLOCK 
(DETERMINES 





STEP RATE) 

ETA 
E2A 

MOTOR A 

CONTROLS SA vcc 
DA 
E1B 
E2B 

MOTOR B 

CONTROLS 


MOTOR B 





Figure 2 


A function table for each motor control section is given below. 


| cock | Ei | & | s | D | FUNCTION 
Hold motor in current position 
Hold motor in current position 
1 


ee ae ee ie Set outputs to step 1 levels 
fo | oO | 0 | 0 [ Step motor clockwise 
po | 8 fT 8 tT Step motor counter-clockwise | 





The full step sequence (Table 1a) can be simplified from 4 
outputs to 2 outputs since Q1 = QO and Q3 = Q2. The sequence 
can then be expressed as follows: 


Q0n+4 = 1, Q14+4 = 0, Q2n+4 = 1, Q3n+4 = 0 
when E1 = 1 or E2 = 1: 
Q0n+4 = Q0p, Q1n+4 = Qi, Q2n+4 = Q2py, Q3q+4 = Q3 
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The step sequences can be converted to equations by use of a 
Karnaugh map. 


pes, 
' Pl [c 
- eh 
—- 7 
Q0,4+4 = Q2,:5+02,-D 
Q0, 
Ft 
“ Bid 
D 
Q2,+4 = Q0,°D+Q0,°D 


Factor in E1 and E2: 
Q0,+4 = Ei bs E2 : Q2, 
Q2n+1 = ET: E2-Q0, 


-D+E1-E2-Q2,-D 
-D+E1-E2-Q0,:D 


Express the set function as an equation: 
Q0,+4 = E1-E2°S Q2,4+1 = E1-E2-S 


Express the hold function (when E1 or E2 = 1) 
Q0,+4 = QO, + E1 + Q0, + E2 
Q2n+1 = Q2p,°E1 + Q2,° E2 


Combining all the above: 
Q0n+4 = E1+ E2-S+Q0,-E1+ Q0,+E2+E1-E2-Q2 
Qn = Wy 44 


These equations make up the first design file. 


Conclusion 


Although this example could be used “as is” in a stepper motor 
application, the programmability of PAL devices could allow for 
any desired modifications. Changes to the circuit might include: 


1. Drive only one stepper motor, using a PAL16R6. The other 
flip-flops could be used as a programmable counter, allowing 
for different speed settings. 


2. Drive only one stepper motor, using the extra inputs and 
outputs to handle other circuit functions. 

3. Drive only one stepper motor, using a PAL16R6. The other 
flip-flops could be used as a 4-bit position counter. 

4. The substitution of a PAL16R8, and another inverting buffer 
would allow the driving and control of four stepper motors. 

5. Reprogram for half-step operation. This is done in the second 
design example. 


n'D+E1-E2-Q2,-D 


Q2y+1 = E+ E2+S+ Q2,-E1+ Q29-E2+E1-E2-Q0,-D+E1-E2-Q0,°D 


Q3q44 = Q2n44 
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TITLE STEPPER MOTOR CONTROLLER 
PATTERN P7015 

REVISION 1.00 

AUTHOR DAVE SACKETT 

COMPANY DEVOE COMPANY, INDIANAPOLIS, INDIANA 
DATE 02/23/81 


CHIP SMC PAL16R4 


CLK /E1A /E2A SA DA /E1B /E2B SB DB GND /OC 
/Q3B /Q1B /Q2B /Q0B /Q2A /QO0A /Q3A /Q1A VCC 





EQUATIONS 
QOA := QOA*/EI1A ;HOLD IF NOT El 
+ QOA* /E2A ;HOLD IF NOT E2 
+ SA * ELA* E2A ;STEP 1 IF SET 
+ /Q2A* ElA* E2A* DA ;LOAD /Q2A IF COUNTER-CLOCKWISE 
+ Q2A* ElA* E2A*/DA ;LOAD Q2A IF CLOCKWISE 
QlA = /QOA 
Q1LA.TRST = VCC 
Q2A := Q2A*/E1A ;HOLD IF NOT El 
+ Q2A* /E2A ;HOLD IF NOT E2 
+ SA * ElA* E2A ;STEP 1 IF SET 
+ QOA* ElA* E2A* DA ;LOAD QOA IF COUNTER-CLOCKWISE 
+ /QOA* ELA* E2A*/DA ;LOAD /QOA IF CLOCKWISE 
Q3A = /Q2A 
Q3A.TRST = VCC 
Q0B := QOB*/E1B ;HOLD IF NOT El 
+ QOB* /E2B ;HOLD IF NOT E2 
+ SB * E1B* E2B ;STEP 1 IF SET 
+ /Q2B* E1B* E2B* DB ;LOAD /Q2B IF COUNTER-CLOCKWISE 
+ Q2B* E1B* E2B*/DB ;LOAD Q2B IF CLOCKWISE 
Q1B = /QOB 
Q1B.TRST = VCC 
Q2B := Q2B*/E1B ;HOLD IF NOT El 
+ Q2B* /E2B ;HOLD IF NOT E2 
+ SB * E1B* E2B ;STEP 1 IF SET 
+ QOB* E1B* E2B* DB ;LOAD QOB IF COUNTER-CLOCKWISE 
+ /QOB* El1B* E2B*/DB ;LOAD /Q0B IF CLOCKWISE 
Q3B = /Q2B 


Q3B.TRST = VCC 
SIMULATION 


SETF OC E1A E2A SA E1B E2B SB 
CLOCKF CLK 
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CHECK QOA /Q1A Q2A /Q3A QO0B /Q1B Q2B /Q3B 


SETF /E1LA /E2A /E1B /E2B 
CLOCKF CLK 
CHECK QOA /Q1A Q2A /Q3A Q0B /Q1B Q2B /Q3B 


SETF El1A E2A /SA /DA E1B E2B /SB DB 
CLOCKF CLK 
CHECK QOA /QI1A /Q2A Q3A /Q0B Q1B Q2B /Q3B 


CLOCKF CLK 
CHECK /QOA Q1A /Q2A Q3A /Q0B Q1B /Q2B Q3B 


CLOCKF CLK 
CHECK /QOA Q1A Q2A /Q3A QOB /Q1B /Q2B Q3B 


CLOCKF CLK 
CHECK QOA /Q1A Q2A /Q3A QOB /Q1B Q2B /Q3B 


CLOCKF CLK 
CHECK QOA /Q1A /Q2A Q3A /Q0B Q1B Q2B /Q3B 


CLOCKF CLK 
CHECK /Q0A Q1A /Q2A Q3A /Q0B Q1B /Q2B Q3B 


SETF DA /DB 
CLOCKF CLK 
CHECK QOA /Q1A /Q2A Q3A /Q0B Q1B Q2B /Q3B 


SETF /E1LA /E1B 
CLOCKF CLK 
CHECK QOA /Q1A /Q2A Q3A /Q0B Q1B Q2B /Q3B 


SETF ElA /E2A E1B /E2B 
CLOCKF CLK 
CHECK Q0A /Q1A /Q2A Q3A /Q0B Q1B Q2B /Q3B 


SETF SA SB 
CLOCKF CLK 
CHECK QOA /Q1A /Q2A Q3A /Q0B Q1B Q2B /Q3B 


7 DESCRIPTION 


7;THIS PAL16R4 PROVIDES THE LOGIC LEVELS REQUIRED TO DRIVE TWO 
;STEPPER MOTORS IN THE FULL STEP MODE. 


7THE FOLLOWING OPERATIONS MAY BE PERFORMED FOR EACH STEPPER 
7;MOTOR CONTROLLER INDIVIDUALLY: 


Q 
Ei 
a 
™y 
53) 
hk 
~~ 
ie) 
NO 
Nn 
Oo 


OPERATION 
; XxX H X X X HOLD MOTOR IN CURRENT POSITION 
; X X H Xx X HOLD MOTOR IN CURRENT POSITION 
r oC L L H X SET OUTPUTS TO STEP 1 LEVELS 
Po L L L L STEP MOTOR CLOCKWISE 
7; C L L L H STEP MOTOR COUNTER-CLOCKWISE 


. 
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TITLE DUAL STEPPER MOTOR CONTROLLER 
PATTERN P7094 

REVISION 1.00 

AUTHOR COLI/SACKETTE 

COMPANY MONOLITHIC MEMORIES 

DATE 12/07/82 


CHIP DUAL SMC PAL16R8 


CLK /ENA SETA ROTA MODEA /ENB SETB ROTB MODEB GND 
JOC /SW1B /SW2B /SW3B /SW4B /SW4A /SW3A /SW2A /SW1A VCC 


EQUATIONS 
SW1A:= SWI1A * /ENA ;HOLD SW1A:DISABLE 
+ /SW2A * /SW4A* ENA* ROTA* MODEA ;HALF-STEP MTR CW 
+ SW3A * ENA* ROTA*/MODEA ;FULL-STEP MTR CW 
+ /SW2A*/SW3A * ENA*/ROTA* MODEA ;HALF-STEP MTR CCW 
+ SW4A* ENA* /ROTA* /MODEA ;FULL-STEP MTR CCW 
+ ENA * SETA ;SET A TO STEP 1 
SW2A:= SW2A * /ENA ;HOLD SW2A:DISABLE 
+/SW1A * /SW3A * ENA* ROTA* MODEA*/SETA ;HALF-STEP MTR CW 
+ SW4A* ENA* ROTA*/MODEA*/SETA ;FULL-STEP MTR CW 
+/SW1A * /SW4A* ENA*/ROTA* MODEA*/SETA ;HALF-STEP MTR CCW 
+ SW3A * ENA*/ROTA* /MODEA*/SETA ;FULL-STEP MTR CCW 
SW3A:= SW3A * /ENA ;HOLD SW3A:DISABLE 
+/SW1A */SW4A* ENA* ROTA* MODEA ;HALF-STEP MTR CW 
+ SW2A * ENA* ROTA*/MODEA ;FULL~STEP MTR CW 
+ /SW2A */SW4A* ENA*/ROTA* MODEA ;HALF-STEP MTR CCW 
+ SWI1A * ENA* /ROTA* /MODEA ;FULL-STEP MTR CCW 
+ ENA * SETA ;SET A TO STEP 1 
SW4A:= SW4A*/ENA ;HOLD SW4A:DISABLE 
+ /SW2A* /SW3A * ENA* ROTA* MODEA*/SETA ;HALF-STEP MTR CW 
+ SWI1A * ENA* ROTA*/MODEA*/SETA ;FULL-STEP MTR CW 
+/SW1A * /SW3A * ENA*/ROTA* MODEA*/SETA ;HALF-STEP MTR CCW 
+ SW2A * ENA* /ROTA* /MODEA*/SETA ;FULL-STEP MTR CCW 
SW1B:= SW1B * /ENB ;HOLD SW1B:DISABLE 
+ /SW2B */SW4B* ENB* ROTB* MODEB ;HALF-STEP MTR CW 
+ SW3B * ENB* ROTB*/MODEB ;FULL-STEP MTR CW 
+ /SW2B* /SW3B * ENB*/ROTB* MODEB ;HALF-STEP MTR CCW 
+ SW4B* ENB*/ROTB* /MODEB ;FULL-STEP MTR CCW 
+ ENB * SETB ;SET B TO STEP 1 
SW2B:= SW2B * /ENB ;HOLD SW2B:DISABLE 
+/SW1B * /SW3B x ENB* ROTB* MODEB*/SETB ;HALF-STEP MTR CW 
+ SW4B* ENB* ROTB*/MODEB*/SETB ;FULL-STEP MTR CW 
+/SW1B * /SW4B* ENB*/ROTB* MODEB*/SETB ;HALF-STEP MTR CCW 
+ SW3B * ENB* /ROTB* /MODEB*/SETB ;FULL-STEP MTR CCW 
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SW3B:= 
+/SW1B 


SIMULATION 


SETF OC ENA 
CLOCKF CLK 
CHECK SW1A 


SETF /SETA 
CLOCKF CLK 
CHECK SW1A 


CLOCKF CLK 
CHECK SW1A 


SETF /ENA / 
CLOCKF CLK 
CHECK SW1A 


SETF ENA EN 
CLOCKF CLK 


Stepper Motor Controllers 


SW3B * /ENB 
*/SW4B* ENB* ROTB* MODEB 
* ENB* ROTB*/MODEB 
/SW2B */SW4B* ENB*/ROTB* MODEB 
* ENB* /ROTB* /MODEB 
ENB * SETB 


SW4B* /ENB 
/SW2B* /SW3B * ENB* ROTB* MODEB*/SETB 
ENB* ROTB*/MODEB*/SETB 
ENB* /ROTB* MODEB*/SETB 
ENB* /ROTB* /MODEB* /SETB 


+ FF 


* /SW3B 
SW2B 


SETA ENB SETB 
/SW2A SW3A /SW4A SW1B-/SW2B SW3B /SW4B 
ROTA MODEA /SETB /ROTB MODEB 


/SW2A /SW3A /SW4A /SW1B /SW2B SW3B /SW4B 


/SW2A /SW3A SW4A /SW1B SW2B SW3B /SW4B 
ENB 
/SW2A /SW3A SW4A /SW1B SW2B SW3B /SW4B 


B 


CHECK /SW1A /SW2A /SW3A SW4A /SW1B SW2B /SW3B /SW4B 


CLOCKF CLK 


CHECK /SW1A SW2A /SW3A SW4A /SW1B SW2B /SW3B SW4B 


CLOCKF CLK 


CHECK /SW1A SW2A /SW3A /SW4A /SW1B /SW2B /SW3B SW4B 


CLOCKF CLK 


CHECK /SW1A SW2A SW3A /SW4A SW1B /SW2B /SW3B SW4B 


CLOCKF CLK 


CHECK /SW1A /SW2A SW3A /SW4A SW1B /SW2B /SW3B /SW4B 


CLOCKF CLK 
CHECK SW1A 


/SW2A SW3A /SW4A SW1B /SW2B SW3B /SW4B 


SETF /ENA /ROTA /MODEA /ENB ROTB /MODEB 


CLOCKF CLK 
CHECK SW1A 


/SW2A SW3A /SW4A SW1B /SW2B SW3B /SW4B 


SETF ENA ENB 
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;HOLD SW3B 


; HALF-STEP 
; FULL-STEP 
; HALF-STEP 
; FULL-STEP 


: DISABLE 


MTR CW 
MTR CW 
MTR CCW 
MTR CCW 


;SET B TO STEP 1 


sHOLD SW4B: 


;HALF-STEP 
; FULL-STEP 
;HALF-STEP 
; FULL-STEP 


DISABLE 
MTR CW 
MTR CW 
MTR CCW 
MTR CCW 


Stepper Motor Controllers 





CLOCKF CLK 
CHECK /SW1A SW2A SW3A /SW4A SW1B /SW2B /SW3B SW4B 


CLOCKF CLK 
CHECK /SW1A SW2A /SW3A SW4A /SW1B SW2B /SW3B SW4B 


CLOCKF CLK 
CHECK SW1A /SW2A /SW3A SW4A /SW1B SW2B SW3B /SW4B 


CLOCKF CLK 
CHECK SW1A /SW2A SW3A /SW4A SW1B /SW2B SW3B /SW4B 





; DESCRIPTION 

;CLK /OC /EN SET ROT MODE SW1-SW4 COMMENTS 

; X H xX X X X vA HI-Z 

eS L H X X X HOLD HOLD MOTOR POSITION 

aa L L H X Xx 1 SET MOTOR POSITION TO STEP 1 
26 L L L H H SW PLUS 1 HALF-STEP MOTOR CLOCKWISE 

3 € L L L H L SW PLUS 2 FULL-STEP MOTOR CLOCKWISE 

ae L L L L H SW MINUS 1 HALF-STEP MOTOR COUNTERCLOCKWI 
2€ L L L L L SW MINUS 2 FULL-STEP MOTOR COUNTERCLOCKWI 


~ 
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The trend away from analog systems to digital numerical control 
systems focuses new attention on instruments that convert the 
analog output of position-sensing devices into digital format. 


The recording of the position of moveable parts of a machine 
requires high accuracy and noise immunity which can be 
attained through shaft encoder circuits of the type used in speed 
controllers and optical position-sensing devices. 


Principles of Shaft Encoding 


Most commonly used are opto-electrical encoders. The advan- 
tages of this type of circuit are direct shaft-to-digital encoding, a 
minimum number of power supplies, low power requirements, 
low cost and high speed. 


ONE CODE 
SEGMENT 





L OPTICAL DISC 
LIGHT —»> 


SOURCES —> 


<— LIGHT 
~*—- SENSORS 


Optical encoders measure shaft rotation by detecting the light 
which passes through a rotating code disc and a fixed slit. 


It consists of anumber of light sources and sensors whose paths 
are interrupted by a disc that has transparent and opaque areas. 


By using various concentric patterns, the shaft position can be 
defined. Light shines through the disc onto sensors. A sensor 
turns on when the light passes through its corresponding trans- 
parent part of the disc giving an output that depends on the 
opacity of the segment. The combined output of all the segments 
is digital information representing the disc and the shaft position. 


~— Anabsolute encoder has anumber of concentric tracks on the 
code disc, which provide paralle! readout of shaft angle with- 
out counting pulses. The code patterns on the disc are a kind 
of storage. 


Thus, the readout is also present after a power interruption. 


— Incremental encoders, which will be described here, have a 
single code track on the disc. Angular position is determined 
by counting pulses produced by the modulated light falling 
onto the photodetectors. 


Direction sensing is obtained by the use of quadrature signals 
which are provided by appropriate phasing of the paths. 





Vcc 


CIRCUIT WHICH PRODUCES MODULATED 
OUTPUT SIGNALS USING LIGHT SENSORS 
A (PHOTO DIODES) 


LIGHT SENSOR 


90° 


PHIO PHI90 


1 ~~ 


Thus, if power is interrupted an incremental encoder must have 
its zero position reestablished. 


The two signals from the light sensors have to be amplified and 
converted into digital format using Schmitt triggers. 


ANALOG ——>| AMPLIFIER [|_| ear 


The two signals are out of phase by a quarter of a period. The 
direction of movement can be determined by which signal leads 
or Jags the other. 


Out of this phase relationship the shaft encoder produces 
information about direction and speed for a following up/ 
down-counter. 


To avoid random discrepancies during switching operations 
shaft encoders should be built using synchronous clocked 
circuitry. 


The diagram below shows a typical circuit diagram for shaft 
encoding. 


CLK PHIO 


UP 


DOWN 
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The phase relationship of the two signals PHIO and PHI90 is 
determined by the four registers, and their outputs are encoded. 


The underlying principle of the above circuit is the time delay 
between the clock pulses for the two signals PHIO and PHI90. 


S1= PHIO at CLK $2 = PHIO at CLK + 1 
S1 = PHI90 at CLK S4 = PHI9O at CLK + 1 


A transition in PHIO or PHI90 from L to H or vice versa causes a 
change in the data stored in the registers, which is encoded for 
direction sensing. 


The logic equations for the encoder circuitry are given below 


PHIO leads PHI90: S1* S2* S3*/S4 
+/S1 * /S2* /S3* S4 
+ $1 * /S2* /S3 * /S4 
+/S1* S2* S3* S4 


/S1* /S2* S3* /S4 
+S1* S2*/S3* S4 
+S1*/S2* S3* S4 
+/S1* S2* /S3* /S4 


PHIO lags PHI90: 


cex-LLLLit tit tt iti 


PHIO LEADS PHI90 BY 90° 





PHIO LAGS PHI90 BY 90° 


pital es eed 
pie Le 


UP = HIGH 


DOWN T_T LLL 


When the shaft rotates in a clockwise direction, the input PHIO 
leads the input PHI90 by 90° and the logic will generate pulses 
only at the UP-output. 

~ Onthe other hand, when the shaft’s rotation is counterclockwise 
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the input PHIO /ags the input PHIS90 by 90°. In this case, four 
pulses per square wave are presented at the DOWN-output. Note 
that both the UP and DOWN outputs of the counter are normally 
held high. 


To ensure that no shaft encoder transition is missed, the clock 
frequency should be at jeast 8xNxS, where N is the number of 
pulses produced by the encoder for each shaft revolution and S 
is the maximum speed in revolutions per second to be expected. 


Most commonly CLK frequencies in the range of 1 MHz are used 
for optimum circuit operation. 


Synchronous two-channel shaft encoders as described above 
are relatively insensitive to encoder phase errors. They use no 
temperamental Mono-Flops and can detect any illegal transition 
states generated by the circuit. 


Interference on the input lines is ignored if it occurs between two 
clock cycles. Random noise on both input lines results in one UP 
count for each DOWN count so that in the end the digital infor- 
mation remains unchanged. Hence synchronous shaft encoders 
are extremely useful in electrically noisy environments. 


The PAL Device Applications are examples of such circuits 
using a single PAL device each. 


SHAFT ENCODER #1 









74193 


UP/DOWN c= Q0-03 
COUNTER 








SHAFT ENCODER #2 





74697 
UP/DOWN 
COUNTER 






=> Q0-03 







SHAFT ENCODER #3 
WITH INTERNAL 4-BiT 
UP/DOWN COUNTER 


CLK 


PH!0 ——»> 
PH!Ig90 ——> 

x4 
DATA+LD 
SET 


oc 
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TITLE SHAFT ENCODER No. 1 
PATTERN P7097 

REVISION 1.00 

AUTHOR WILLY VOLDAN 
COMPANY MONOLITHIC MEMORIES 
DATE 09/09/82 


CHIP SE1l PALI16R4 


CLK PHIO PHI90 X4 NC NC NC NC /SSET GND 
/OC DOWN NC S4 S3 S2 S1 NC UP VCC 


EQUATIONS 
/Sl1 := /PHIO 
+ SSET 
/S2 := /S1l 
SSET 
/S3 := /PHI90 
+ SSET 
/S4 := /83 
+ SSET 
/ DOWN S1* S$2* S3*/S4* PHIO* PHI90 


/S1*/S2*/S3* S4*/PHI0*/PHI90 

S1*/S2*/S3*/S4* PHI0*/PHI9I0 
/S1* S2* S3* S4*/PHIO* PHI90 
DOWN.TRST = VCC 


++i 


+ 


/UP /S1*/S2* S3*/S4*/PHIO* PHI90 
S1* S2*/S3* S4* PHIO*/PHI90 
S1*/S2* S3* S4* PHIO* PHI90 

/SL* S2*/S3*/S4*/PHIO*/PHI90 

UP.TRST = VCC 


++ Il. 


+ 


SIMULATION 


SETF OC SSET 
CLOCKF CLK 
CHECK /S4 /S3 /S2 /Sl1 UP DOWN 


SETF /SSET /PHIO /PHI90 
CLOCKF CLK 
CHECK /S4 /S3 /S2 /Sl UP DOWN 


CLOCKF CLK 
CHECK /S4 /S3 /S2 /Sl1 UP DOWN 


SETF PHI90 
CLOCKF CLK 
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;CHECK FOR PHIO 
; INITIALIZE S1=L 


;CHECK FOR Sl 
; INITIALIZE S2=L 


;CHECK FOR PHI90 
; INITIALIZE S3=L 


;CHECK FOR S3 
; INITIALIZE S4=L 


;PHIO LEADS PHI90 
;PHIO LEADS PHI90 
*PHIO LEADS: PHI90 
;PHIO LEADS PHI90 


> PHI90 
> PHI90 
7 PHI90 
; PHI90 


LEADS PHIO 
LEADS PHIO 
LEADS PHIO 
LEADS PHIO 


PAL16R4 
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CHECK /S4 $3 /S2 /Sl /UP DOWN 


CLOCKF CLK 
CHECK S4 S3 /S2 /S1 UP DOWN 


SETF PHIO 
CLOCKF CLK 
CHECK S4 S3 /S2 S1 /UP DOWN 


CLOCKF CLK 
CHECK S4 S3 S2 Si UP DOWN 


SETF /PHI90 
CLOCKF CLK 
CHECK S4 /S3 S2 Sl /UP DOWN 


CLOCKF CLK 
CHECK /S4 /S3 S2 S1 UP DOWN 


SETF /PHIO 
CLOCKF CLK 
CHECK /S4 /S3 $2 /Sl /UP DOWN 





CLOCKF CLK 
CHECK /S4 /S3 /S2 /Sl1 UP DOWN 


SETF PHI90 
CLOCKF CLK 
CHECK /S4 S3 /S2 /S1 /UP DOWN 


CLOCKF CLK 
CHECK S4 S3 /S2 /S1 UP DOWN 


SETF PHIO 
CLOCKF CLK 
CHECK $4 $3 /S2 Sl /UP DOWN 


CLOCKF CLK 
CHECK S4 S3 S2 Sl UP DOWN 


SETF /PHI90 
CLOCKF CLK 
CHECK S4 /S3 $2 Sl /UP DOWN 


CLOCKF CLK 
CHECK /S4 /S3 S2 Sl UP DOWN 


SETF /PHIO 
CLOCKF CLK 
CHECK /S4 /S3 S2 /S1 /UP DOWN 


CLOCKF CLK 
CHECK /S4 /S3 /S2 /S1 UP DOWN 
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SETF PHI90 
CLOCKF CLK 
CHECK /S4 $3 /S2 /S1 /UP DOWN 


CLOCKF CLK 
CHECK S4 S3 /S2 /S1 UP DOWN 


SETF PHIO 
CLOCKF CLK 
CHECK $4 $3 /S2 Sl /UP DOWN 


CLOCKF CLK 
CHECK S4 S3 S2 Sl UP DOWN 


SETF /PHI90 
CLOCKF CLK 
CHECK S4 /S3 S2 Sl /UP DOWN 


CLOCKF CLK 
CHECK /S4 /S3 $2 Sl UP DOWN 


SETF /PHIO 
CLOCKF CLK 
CHECK /S4 /S3 S2 /Sl /UP DOWN 


CLOCKF CLK 
CHECK /S4 /S3 /S2 /Sl UP DOWN 


SETF SSET 
CLOCKF CLK 
CHECK /S4 /S3 /S2 /Sl UP DOWN 


SETF /SSET 
CLOCKF CLK 
CHECK /S4 /S3 /S2 /Sl UP DOWN 


CLOCKF CLK 
CHECK /S4 /S3 /S2 /Sl UP DOWN 


SETF PHIO 
CLOCKF CLK 
CHECK /S4 /S3 /S2 Sl UP /DOWN 


CLOCKF CLK 
CHECK /S4 /S3 S2 Sl UP DOWN 


SETF PHI90 
CLOCKF CLK 
CHECK /S4 $3 S2 Sl UP /DOWN 


CLOCKF CLK 
CHECK S4 S3 S2 Sl UP DOWN 


SETF /PHIO 
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CLOCKF CLK 
CHECK S4 $3 $2 /Sl1 UP /DOWN 


CLOCKF CLK. 
CHECK $4 $3 /S2 /S1 UP DOWN 


SETF /PHI90 
CLOCKF CLK 
CHECK S4 /S3 /S2 /Sl UP /DOWN 


CLOCKF CLK 
CHECK /S4 /S3 /S2 /Sl UP DOWN 


SETF PHIO 
CLOCKF CLK 
CHECK /S4 /S3 /S2 Sl UP /DOWN 


CLOCKF CLK 
CHECK /S4 /S3 S2 Sl UP DOWN 





SETF PHI90 
CLOCKF CLK 
CHECK /S4 S3 S2 S1 UP /DOWN 


CLOCKF CLK 
CHECK S4 S3 S2 S1 UP DOWN 


SETF /PHIO 
CLOCKF CLK 
CHECK S4 S3 S2 /Sl UP /DOWN 


CLOCKF CLK 
CHECK S4 S3 /S2-/S1 UP DOWN 


SETF /PHI90 
CLOCKF CLK 
CHECK S4 /S3 /S2 /Sl1 UP /DOWN 


CLOCKF CLK 
CHECK /S4 /S3 /S2 /Sl1 UP DOWN 


SETF PHIO 
CLOCKF CLK 
CHECK /S4 /S3 /S2 S1 UP /DOWN 


CLOCKF CLK 
CHECK /S4 /S3 S2 Sl UP DOWN 


SETF PHI90 
CLOCKF CLK 
CHECK /S4 S3 S2 Sl UP /DOWN 


CLOCKF CLK 
CHECK S4 S3 S2 S1 UP DOWN 
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SETF /PHIO 
CLOCKF CLK 
CHECK S4 $3 S2 /S1 UP /DOWN 


CLOCKF CLK 
CHECK S4 S3 /S2 /S1 UP DOWN 


SETF /PHI90 
CLOCKF CLK 
CHECK S4 /S3 /S2 /Sl UP /DOWN 


CLOCKF CLK 
CHECK /S4 /S3 /S2 /S1 UP DOWN 


; DESCRIPTION: 


;THIS PAL16R4 IMPLEMENTS A TWO CHANNEL SHAFT ENCODER OF THE 
;TYPE USED IN SPEED CONTROLLERS AND OPTICAL DEVICES. 

;BOTH THE "UP" AND "DOWN" OUTPUTS OF THE PAL DEVICE ARE 
;NORMALLY HIGH. 


;WHEN THE SIGNAL AT THE "PHIO" INPUT LEADS THE SIGNAL AT THE 
;"PHI90" INPUT, THE "DOWN" OUTPUT ALTERNATES BETWEEN HIGH AND 
;LOW LEVELS AT HALF THE "CLK" FREQUENCY RATE. ALSO, WHEN THE 
;SIGNAL AT THE "PHIO" INPUT LAGS THE SIGNAL AT THE "PHI90" 

; INPUT, THE "UP" OUTPUT ALTERNATES BETWEEN HIGH AND LOW 
;LEVELS AT HALF THE "CLK" FREQUENCY RATE. 


;THE SHAFT ENCODER FEATURES THE CONFIGURATION AND OUTPUT 
;POLARITY TO DRIVE AN 748193 TYPE UP/DOWN COUNTER. 


;THIS DESIGN WITH GLITCH-FREE OUTPUTS WILL BE EXTREMELY 
;USEFUL IN ELECTRICALLY NOISY ENVIRONMENTS. THE PINOUT IS 
;GIVEN AS A FIRST PROPOSAL AND CAN BE CHANGED ACCORDING TO 
;THE PC BOARD LAYOUT. 
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TITLE SHAFT ENCODER No. 2 
PATTERN P7098 

REVISION 1.00 

AUTHOR WILLY VOLDAN 
COMPANY MONOLITHIC MEMORIES 
DATE 09/09/82 


CHIP SE2 PAL16R8 


CLK PHIO PHI90 X4 NC NC NC NC /SSET GND 
/OC UD NC S4 S3 S2 S1 NC COUNT VCC 


EQUATIONS 
/S1 —-:= +/PHIO 
+ SSET 
/S3 := /PHI90 
+ SSET 
/S82 := Sl 
+ SSET 
/84 := §3 
+ SSET 
/COUNT : S1* S2*/S3* S4 


/S1*/S2* S$3*/S4 
/S1* S2*/S3*/S4* X4 
S1*/S2* S3* S4* x4 
Sl* S2* $3*/S4 
/S1*/S2*/S3* S4 
/S1* S2* S3* S4* X4 
S1*/S2*/S3*/S4* X4 


t+tetetteseil 


/UD := /S1* $2*/S3* S4 
/S1* S2* S3* S4 
/S1* S2* $3*/S4 
S1* S2* $3*/S4 
S1*/S2* $3*/S4 
S1*/S2*/S3*/S4 
S1*/S2*/S3* S4 
/S1*/S2*/S3* S4 


++ etteetil 


SIMULATION 


SETF OC SSET 
CLOCKF CLK 
CHECK /S1 /S2 /S3 /S4 


SETF /SSET /PHIO /PHI90O /X4 
CLOCKF CLK 
CHECK /Sl1 S2 /S3 S4 COUNT UD 


;CHECK FOR PHIO 
; INITIALIZE S1=L 


;CHECK FOR PHI90 
; INITIALIZE S3=L 


;CHECK FOR /S1l 
; INITIALIZE S2=L 


;CHECK FOR /S3 
; INITIALIZE S4=L 


;THIS OUTPUT ALTERNATES 





;BETWEEN HIGH AND LOW WITH 


;HALF OR QUARTER THE 
;CLK FREQUENCY 


:;THIS OUTPUT DETERMINES 
;IF SIGNAL PHIO LEADS 
;OR LAGS SIGNAL PHI90 


PAL16R8 
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SETF PHIO 
CLOCKF CLK 
CHECK Sl S2 /S3 $4 COUNT /UD 


CLOCKF CLK 
CHECK Sl /S2 /S3 $4 /COUNT UD 


SETF PHI90 
CLOCKF CLK 
CHECK Sl /S2 S3 S4 COUNT /UD 


CLOCKF CLK 
CHECK Sl /S2 S3 /S4 COUNT UD 


SETF /PHIO 
CLOCKF CLK 
CHECK /S1 /S2 S3 /S4 COUNT /UD 


CLOCKF CLK 
CHECK /S1 S2 $3 /S4 /COUNT UD 


SETF /PHI90 
CLOCKF CLK 
CHECK /°l S2 /S3 /S4 COUNT /UD 


CLOCKF CLK 
CHECK /S1 S2 /S3 S4 COUNT UD 


SETF PHIO 
CLOCKF CLK 
CHECK S1 S2 /S3 S4 COUNT /UD 


CLOCKF CLK 
CHECK Sl /S2 /S3 S4 /COUNT UD 


SETF PHI90 
CLOCKF CLK 
CHECK Sl /S2 S3 S4 COUNT /UD 


CLOCKF CLK 
CHECK S1 /S2 S3 /S4 COUNT UD 


SETF /PHIO 
CLOCKF CLK 
CHECK /Sl /S2 $3 /S4 COUNT /UD 


CLOCKF CLK 
CHECK /S1 S2 S3 /S4 /COUNT UD 


SETF /PHI90 X4 
CLOCKF CLK 
CHECK /Sl1 S2 /S3 /S4 COUNT /UD 
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CLOCKF CLK 
CHECK /S1 S2 /S3 S4 /COUNT UD 


SETF PHIO 
CLOCKF CLK 
CHECK Sl S2 /S3 S4 COUNT /UD 


CLOCKF CLK 
CHECK Sl /S2 /S3 S4 /COUNT UD 


SETF PHI90 
CLOCKF CLK 
CHECK Sl /S2 $3 S4 COUNT /UD 


CLOCKF CLK 
CHECK Sl /S2 $3 /S4 /COUNT UD 


SETF /PHIO 
CLOCKF CLK 
CHECK /Sl1 /S2 $3 /S4 COUNT /UD 





CLOCKF CLK 
CHECK /S1 $2 S3 /S4 /COUNT UD 


SETF /PH190 
CLOCKF CLK 
CHECK /S1 S2 /S3 /S4 COUNT /UD 


CLOCKF CLK 
CHECK /S1 S2 /S3 S4 /COUNT UD 


SETF PHIO 
CLOCKF CLK 
CHECK §1 S2 /S3 S4 COUNT /UD 


CLOCKF CLK 
CHECK Sl /S2 /S3 S4 /COUNT UD 


SETF PHI90 
CLOCKF CLK 
CHECK Sl /S2 S3 S4 COUNT /UD 


CLOCKF CLK 
CHECK Sl /S2 $3 /S4 /COUNT UD 


SETF SSET 
CLOCKF CLK 
CHECK /S1 /S2 /S3 /S4 COUNT /UD 


SETF /SSET /PHIO /PHI90 /X4 
CLOCKF CLK 
CHECK /S1 S2 /S3 $4 COUNT UD 


SETF PHI90 
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CLOCKF CLK 
CHECK /S1 S2 $3 $4 COUNT /UD 


CLOCKF CLK 
CHECK /S1 S2 $3 /S4 COUNT /UD 


SETF PHIO 
CLOCKF CLK 
CHECK Sl S2 S3 /S4 COUNT /UD 


CLOCKF CLK 
CHECK Sl /S2 S3 /S4 /COUNT /UD 


SETF /PHI90 
CLOCKF CLK 
CHECK Sl /S2 /S3 /S4 COUNT /UD 


CLOCKF CLK . 
CHECK Sl /S2 /S3 $4 COUNT /UD 


SETF /PHIO 
CLOCKF CLK 
CHECK /S1 /S2 /S3 S4 COUNT /UD 


CLOCKF CLK 
CHECK /Sl1 $2 /S3 S4 /COUNT /UD 


SETF PHI90 
CLOCKF CLK 
CHECK /S1 S2 S3 S4 COUNT /UD 


CLOCKF CLK 
CHECK /Sl1 S2 $3 /S4 COUNT /UD 


SETF PHIO 
CLOCKF CLK 
CHECK S1 S2 S3 /S4 COUNT /UD 


CLOCKF CLK 
CHECK Sl /S2 $3 /S4 /COUNT /UD 


SETF /PHI90 
CLOCKF CLK 
CHECK S1 /S2 /S3 /S4 COUNT /UD 


CLOCKF CLK 
CHECK Sl /S2 /S3 S4 COUNT /UD 


SETF /PHIO X4 
CLOCKF CLK 
CHECK /Sl1 /S2 /S3 S4 COUNT /UD 


CLOCKF CLK 
CHECK /S1 S2 /S3 S4 /COUNT /UD 
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SETF PHI90 
CLOCKF CLK 
CHECK /S1 S2 S3 S4 COUNT /UD 


CLOCKF CLK 
CHECK /Sl S2 S3 /S4 /COUNT /UD 


SETF PHIO 
CLOCKF CLK 
CHECK S1 S2 S3 /S4 COUNT /UD 


CLOCKF CLK . 
CHECK Sl /S2 $3 /S4 /COUNT /UD 


SETF /PHI90 
CLOCKF CLK 
CHECK Sl /S2 /S3 /S4 COUNT /UD 


CLOCKF CLK 
CHECK Sl /S2 /S3 S4 /COUNT /UD 





SETF /PHIO 
CLOCKF CLK 
CHECK /S1 /S2 /S3 S4 COUNT /UD 


CLOCKF CLK 
CHECK /Sl $2 /S3 S4 /COUNT /UD 


SETF PHI90 
CLOCKF CLK 
CHECK /S1 S2 S3 S4 COUNT /UD 


CLOCKF CLK 
CHECK /S1 S2 S3 /S4 /COUNT /UD 


SETF PHIO 
CLOCKF CLK 
CHECK S1 S2 S3 /S4 COUNT /UD 


CLOCKF CLK 
CHECK S1 /S2 S3 /S4 /COUNT /UD 


; DESCRIPTION 


;THIS PAL16R8 IMPLEMENTS A TWO CHANNEL SHAFT ENCODER OF THE 
;TYPE USED IN SPEED CONTROLLERS AND OPTICAL DEVICES. 


’ 


;THE "COUNT" OUTPUT OF THE PAL DEVICE IS NORMALLY HIGH. DURING 
;SHAFT ENCODING THIS OUTPUT ALTERNATES BETWEEN HIGH AND LOW. 

; INPUT "X4" SELECTS BETWEEN HALF (X4=H) OR QUARTER (X4=L) CLK 

; FREQUENCY OF THE "COUNTER" OUTPUT. 


. 
, 
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;OUTPUT "UD" DETERMINES WHETHER SIGNAL PHIO LEADS (UD=H) OR 
;LAGS (UD=L) SIGNAL PHI90. 

;THE SHAFT ENCODER FEATURES THE CONFIGURATION AND OUTPUT 
;POLARITY TO DRIVE A 74S697 TYPE UP/DOWN COUNTER. 

;THIS DESIGN WITH GLITCH-FREE OUTPUTS WILL BE EXTREMELY USEFUL 
; IN ELECTRICALLY NOISY ENVIRONMENTS. THE PINNING IS GIVEN AS 
;A FIRST PROPOSAL AND CAN BE CHANGED ACCORDING TO THE PC~BOARD 
; LAYOUT. 
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TITLE SHAFT ENCODER No. 3 (WITH INTERNAL 4-BIT UP/DOWN COUNTER) 
PATTERN P7099 


REVISION 1.00 
AUTHOR WILLY VOLDAN 
COMPANY MONOLITHIC MEMORIES 
DATE 09/09/82 


CHIP SE3 PAL20X10 


CLK PHIO PHI90 X4 /LD NC D3 D2 Dl DO /SSET 
GND /OC DOWN S4 S3 S2 §1 Q3 Q2 Q1 QC UP VCC 





EQUATIONS 
/Sl1 := /PHIO ;CHECK FOR PHIO 
+  SSET ; INITIALIZE S1=L 
/S2 := /Sl ;CHECK FOR Sl 
+  SSET ;INITIALIZE S2=L 
/S3 := /PHI9O ;CHECK FOP PHI90 
+  SSET ; INITIALIZE S3=L 
/84 := /$3 ;CHECK FOR S3 
+  SSET ; INITIALIZE S4=L 
/DOWN := S1* S2* $3*/S4* PHIO* PHI90* X4 ;PHIO LEADS PHI90:COUNT=F/2 
+ /S1*/S2*/S3* S4*/PHIO*/PHI90* X4 ;PHIO LEADS PHI90:COUNT=F/2 
z+: S1*/S2*/S3*/S4* PHIO*/PHIIO ;PHIO LEADS PHI90:COUNT=F/4 
+ /S1* S2* S3* S4*/PHIO* PHI9O ;PHIO LEADS PHI90:COUNT=F/4 
/UP := /S1*/S2* $3*/S4*/PHIO* PHI90 ;PHI90 LEADS PHIO:COUNT=F/4 
+  S1* $2*/S3* S4* PHIO*/PHI9O ;PHI90 LEADS PHIO:COUNT=F/4 
z+: S1*/S2* S3* S4* PHIO* PHT90* X4 ;PHI90 LEADS PHi0:COUNT=F/2 
+ /S1* S2*/S3*/S4*/PHIO*/PHI90* X4 ;PHI90 LEADS PHIO:COUNT=F/2 
/Q0 := /SSET* LD*/DO ;LOAD DO (LSB) 
+ /SSET*/LD*/Q0 ;HOLD QO 
:+: /SSET*/LD* UP*/DOWN ; DECREMENT 
+ /SSET*/LD*/UP* DOWN ; INCREMENT 
/Ql  := /SSET* LD*/D1 ;LOAD D1 
+ /SSET*/LD*/Q1 ;HOLD Ql 
:+: /SSET*/LD* UP*/DOWN*/QO ; DECREMENT 
+ /SSET*/LD*/UP* DOWN* QO ; INCREMENT 
/Q2  := /SSET* LD*/D2 ;LOAD D2 
+ /SSET*/LD*/Q2 ;HOLD Q2 
:+: /SSET*/LD* UP*/DOWN*/Q0*/Q1 ; DECREMENT 
+ /SSET*/LD*/UP* DOWN* QO* Ql ; INCREMENT 
/Q3 := /SSET* LD*/D3 ;LOAD D3 (MSB) 
+ /SSET*/LD*/Q3 ;HOLD Q3 
:+: /SSET*/LD* UP*/DOWN*/Q0*/Q1*/Q2 ;DECREMENT 
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+ /SSET*/LD*/UP* DOWN* QO0* Q1* Q2  ;INCREMENT 


SIMULATION 


SETF OC SSET /PHIO /PHI90 
CLOCKF CLK 
CHECK /Sl1 /S2 /S3 /S4 Q3 Q2 Q1 Q0 


SETF /SSET LD D3 /D2 Dl /DO 
CLOCKF CLK 
CHECK Q3 /Q2 Ql /Q0 


SETF /LD /X4 
CLOCKF CLK 
CHECK /Sl1 /S2 /S3 /S4 UP DOWN Q3 /Q2 Ql /Q0 


SETF PHIO 
CLOCKF CLK 
CHECK Sl /S2 /S3 /S4 UP DOWN Q3 /Q2 Q1 /Q0 


CLOCKF CLK 
CHECK Sl S2 /S3 /S4 UP /DOWN Q3 /Q2 Ql /Q0 


SETF PHI90 
CLOCKF CLK 
CHECK Sl S2 S3 /S4 UP DOWN Q3 /Q2 /Q1 Q0 


CLOCKF CLK 
CHECK Sl S2 S3 S4 UP DOWN Q3 /Q2 /Q1 Q0 


SETF /PHIO 
CLOCKF CLK 
CHECK /S1 S2 $3 S4 UP DOWN Q3 /Q2 /Q1 QO 


CLOCKF CLK PAL20X10 
CHECK /S1 /S2 $3 S4 UP /DOWN Q3 /Q2 /Q1 Q0 


SETF /PHI90 
CLOCKF CLK 
CHECK /S1 /S2 /S3 S4 UP DOWN Q3 /Q2 /Q1 /Q0 


CLOCKF CLK 
CHECK /S1 /S2 /S3 /S4 UP DOWN Q3 /Q2 /Q1 /Q0 


SETF PHIO 
CLOCKF CLK 
CHECK Sl /S2 /S3 /S4 UP DOWN Q3 /Q2 /Q1 /Q0 


CLOCKF CLK 
CHECK Sl S2 /S3 /S4 UP /DOWN Q3 /Q2 /Q1 /Q0 


SETF PHI90 
CLOCKF CLK 
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CHECK S1 S2 S3 /S4 UP DOWN /Q3 Q2 Q1 Q0 


CLOCKF CLK 
CHECK S1 S2 S3 S4 UP DOWN /Q3 Q2 Q1 Q0 


SETF /PHIO 
CLOCKF CLK 
CHECK /Sl S2 $3 S4 UP DOWN /Q3 Q2 Ql Q0 


CLOCKF CLK 
CHECK /S1 /S2 S3 S4 UP /DOWN /Q3 Q2 Ql QO 


SETF X4 /PHI90 
CLOCKF CLK 
CHECK /S1 /S2 /S3 S4 UP DOWN /Q3 Q2 Q1 /Q0 


CLOCKF CLK 
CHECK /S1 /S2 /S3 /S4 UP /DOWN /Q3 Q2 Q1 /Q0 


SETF PHIO 
CLOCKF CLK 
CHECK Sl /S2 /S3 /S4 UP DOWN /Q3 Q2 /Ql Q0 





CLOCKF CLK 
CHECK Sl S2 /S3 /S4 UP /DOWN /Q3 Q2 /Q1 Q0 


SETF PHI90 
- CLOCKF CLK 
CHECK Sl S2 S3 /S4 UP DOWN /Q3 Q2 /Q1 /Q0 


CLOCKF CLK 
CHECK Sl $2 $3 S4 UP /DOWN /Q3 Q2 /Q1 /Q0 


SETF /PHIO 
CLOCKF CLK 
CHECK /Sl1 S2 S3 S4 UP DOWN /Q3 /Q2 Q1 QO 


CLOCKF CLK 
CHECK /S1 /S2 S3 S4 UP /DOWN /Q3 /Q2 Q1 Q0 


SETF /PHI90 
CLOCKF CLK 
CHECK /Sl1 /S2 /S3 S4 UP DOWN /Q3 /Q2 Ql /Q0 


CLOCKF CLK 
CHECK /S1 /S2 /S3 /S4 UP /DOWN /Q3 /Q2 Q1 /Q0 


SETF PHIO 
CLOCKF CLK 
CHECK Sl /S2 /S3 /S4 UP DOWN /Q3 /Q2 /Q1 QO 


CLOCKF CLK 
CHECK Sl S2 /S3 /S4 UP /DOWN /Q3 /Q2 /Q1 Q0 
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SETF PHI90 
CLOCKF CLK 
CHECK Sl S2 S3 /S4 UP DOWN /Q3 /Q2 /Q1 /Q0 


CLOCKF CLK 
CHECK S1 S2 S3 S4 UP /DOWN /Q3 /Q2 /Q1 /Q0 


SETF /PHIO 
CLOCKF CLK 
CHECK /Sl S2 $3 S4 UP DOWN Q3 Q2 Q! Q0 


CLOCKF CLK 
CHECK /S1 /S2 S3 S4 UP /DOWN Q3 Q2 Q2 QO 


SETF /PHI90 
CLOCKF CLK 
CHECK /Sl /S2 /S3 S4 UP DOWN Q3 Q2 Q1 /Q0 


SETF SSET /PHIO /PHI90 
CLOCKF CLK 


SETF /SSET LD /D3 D2 /D1l DO 
CLOCKF CLK 
CHECK /Q3 Q2 /Ql Q0 


SETF /LD /X4 
CLOCKF CLK 
CHECK /S1 /S2 /S3 /S4 UP DOWN /Q3 Q2 /Q2 Q0 


SETF PHI90 
CLOCKF CLK 
CHECK /S1 /S2 $3 /S4 UP DOWN /Q3 Q2 /Q1 Q0 


CLOCKF CLK 
CHECK /Sl1 /S2 $3 S4 /UP DOWN /Q3 Q2 /Q1 Q0 


SETF PHIO 
CLOCKF CLK 
CHECK Sl /S2 S3 S4 UP DOWN /Q3 Q2 Q1 /Q0 


CLOCKF CLK 
CHECK Sl S2 S3 S4 UP DOWN /Q3 Q2 Q1 /Q0 


SETF /PHI90 
CLOCKF CLK 
CHECK Sl S2 /S3 S4 UP DOWN /Q3 Q2 Ql /Q0 


CLOCKF CLK 
CHECK Sl S2 /S3 /S4 /UP DOWN /Q3 Q2 Ql /Q0 


SETF /PHIO 


CLOCKF CLK 
CHECK /Sl S2 /S3 /S4 UP DOWN /Q3 Q2 Q1 QO 
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CLOCKF CLK 
CHECK /S1 /S2 /S3 /S4 UP DOWN /Q3. Q2 Ql G0 


SETF PHI90 
CLOCKF CLK 
CHECK /S1 /S2 S3 /S4 UP DOWN /Q3 Q2 Ql Q0 


CLOCKF CLK 
CHECK /S1 /S2 S3 S4 /UP DOWN /Q3 Q2 Q1 Q0 


SETF PHIO 
CLOCKF CLK 
CHECK Sl /S2 $3 S4 UP DOWN Q3 /Q2 /Q1 /Q0 


CLOCKF CLK 
CHECK S1 S2 S3 S4 UP DOWN Q3 /Q2 /Q1 /Q0 


SETF /PHI90 
CLOCKF CLK | 
CHECK Sl S2 /S3 S4 UP DOWN Q3 /Q2 /Q1 /Q0 





CLOCKF CLK 
CHECK S1 S2 /S3 /S4 /UP DOWN Q3 /Q2 /Q1 /Q0 


SETF X4 /PHIO 
CLOCKF CLK 
CHECK /Sl S2 /S3 /S4 UP DOWN Q3 /Q2 /Q1 Q0 


CLOCKF CLK 
CHECK /Sl /S2 /S3 /S4 /UP DOWN Q3 /Q2 /Q1 Q0 


SETF PHI90 
CLOCKF CLK 
CHECK /S1 /S2 $3 /S4 UP DOWN Q3 /Q2 Q1 /Q0 


CLOCKF CLK 
CHECK /S1 /S2 S3 S4 /UP DOWN Q3 /Q2 Q1 /Q0 


SETF PHIO 
CLOCKF CLK 
CHECK Sl /S2 S3 S4 UP DOWN Q3 /Q2 Q1 Q0 


CLOCKF CLK 
CHECK Sl S2 S3 S4 /UP DOWN 93 /Q2 Q1 Q0 


SETF /PHI90 
CLOCKF CLK 
CHECK Sl S2 /S3 S4 UP DOWN Q3 Q2 /Q1 /Q0 


CLOCKF CLK 
CHECK Sl S2 /S3 /S4 /UP DOWN Q3 Q2 /Q1 /Q0 


SETF /PHIO 
CLOCKF CLK 
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CHECK /S1 S2 /S3 /S4 UP DOWN 93 Q2 /Q1 Q0 


CLOCKF CLK 
CHECK /S1 /S2 /S3 /S4 /UP DOWN Q3 Q2 /Q1 Q0 


SETF PHI90 
CLOCKF CLK 
CHECK /S1 /S2 $3 /S4 UP DOWN Q3 Q2 Q1 /Q0 


CLOCKF CLK 
CHECK /S1 /S2 S3 S4 /UP DOWN Q3 Q2 Q1 /Q0 


SETF PHIO 
CLOCKF CLK 
CHECK Sl /S2 S3 S4 UP DOWN Q3 Q2 Q1 Q0 


CLOCKF CLK 
CHECK Sl S2 S3 S4 /UP DOWN Q3 Q2 Q1 Q0 


SETF /PHI90 
CLOCKF CLK 
CHECK Sl S2 /S3 S4 UP DOWN /Q3 /Q2 /Q1 /Q0 


CLOCKF CLK 
CHECK Sl S2 /S3 /S4 /UP DOWN /Q3 /Q2 /Ql /Q0 


SETF /PHIO 
CLOCKF CLK 
CHECK /Sl1 S2 /S3 /S4 UP DOWN /Q3 /Q2 /Q1 QO 


SETF LD /D3 /D2 /D1l DO 
CLOCKF CLK 
CHECK /Q3 /Q2 /Q1 Q0 


SETF /D3 /D2 Dl DO 
CLOCKF CLK 
CHECK /Q3 /Q2 Q1 QO 


SETF /D3 D2 /D1 DO 
CLOCKF CLK 
CHECK /Q3 Q2 /Q1 Q0 


SETF /D3 D2 Dl DO 
CLOCKF CLK 
CHECK /Q3 Q2 Q1 Q0 


SETF D3 /D2 /D1 DO 
CLOCKF CLK 
CHECK Q3 /Q2 /Q1 Q0 


SETF D3 /D2 Dl DO 
CLOCKF CLK 
CHECK Q3 /Q2 Ql QO 
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SETF D3 D2 /D1l DO 
CLOCKF CLK 
CHECK Q3 Q2 /Q1 90 


SETF D3 D2 Dl DO 
CLOCKF CLK 
CHECK Q3 Q2 Q1 Q0 


; DESCRIPTION 


7;THIS PAL20X10 IMPLEMENTS A TWO CHANNEL SHAFT ENCODER WITE AN 

; INTERNAL 4-BIT UP/DOWN COUNTER. 

;BOTH THE "UP" AND "DOWN" OUTPUTS OF THE PAL DEVICE ARE NORMALS 
7HIGH. 

;WHEN THE SIGNAL AT THE "PHIO" INPUT LEADS THE SIGNAL AT THE 
;"PHI90" INPUT, THE "DOWN" OUTPUT ALTERNATES BETWEEN HIGH AND 
;LOW LEVELS AND THE COUNTER WILL COUNT DOWN. WHEN THE SIGNAL 
;AT THE "PHIO" INPUT LEADS THE SIGNAL AT THE "PHI90" INPUT, 
;THE "UP" OUTPUT ALTERNATES BETWEEN HIGH AND LOW LEVELS AND 
;THE COUNTER WILL COUNT UP. 





;INPUT "X4" SELECTS BETWEEN HALF (X4=H) OR QUARTER (X4=L) CLK 
;FREQUENCY OF THE COUNTER OUTPUTS. 


;THE INTERNAL 4-BIT SYNCHRONOUS COUNTER HAS COUNT UP, COUNT DOWN 
;CAPABILITIES. ALSO, THE COUNTER CAN PARALLEL LOAD AND HOLD DATA 
; INDEPENDENTLY OF THE SHAFT ENCODER SECTION. THE REGISTERS ARE 
;SYNCHRONOUSLY INITIALIZED WHEN /SSET IS HELD LOW. 


; 
;THE CONTROL INPUTS PROVIDE THESE OPERATIONS WHICH OCCUR 
;SYNCHRONOUSLY AT THE RISING EDGE OF THE CLOCK. 
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Monolithic Memories/Advanced Micro Devices is a major partici- 
pant in the Command, Communication and Control (C*) military 
marketplace. Applications include programmable obsolescence 
solutions, surface mount designs, radar systems, missile guid- 
ance, aircraft graphic processors, microprocessor support logic, 
state-machine designs, military computer hardware, redundant 
solutions for backup systems, and reconfigurable hardware using 
Logic Cell Arrays. 


Programmable products offer an excellent solution to military ob- 
solescence difficulties. The dynamics of the semiconductor in- 
dustry has shortened product life cycles, creating a sunset tech- 
nology market. Programmable products offer a quick, low-cost 
replacement for obsolete parts, through direct pin compatibility or 
board emulation. 


MMI/AMD has long been a leader in military bipolar PAL, PROM 


and FIFO products. We will soon be introducing military zero- 
power CMOS PAL devices and military CMOS Logic Cell Arrays. 
With pin and function compatible PAL products offered in both 
bipolar and CMOS technologies, Monolithic Memories offers 


backup systems (redundant solutions) in case of a failure with the 
original design. So, whether the customer needs high-speed 
bipolar or low-power CMOS, we will be the only company offering 
both technologies for the military marketplace. 


Surface mount technology is an assembly technique when circuit 
boards are assembled with components bonded to metal pads on 
the board surface, instead of being inserted into through-holes 
like conventional Dual In-line packages (DIP). Surface mount 
components can be mounted on both sides of a circuit board, 
doubling the functional density. This lowers cost by reducing the 
number or size of boards required. The military need for smaller 
or denser-packed systems make surface mounting a must. We 
meet this challenge offering a full line of products in surface mount 
packaging, including Cerpack (W) and Leadless Chip Carriers 
(LCC). 


For more information on military products and applications, see 
the military datasheet section (page 5-415). 
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20 Inputs 20 Inputs 
PAL20L8 10 Outputs 10 Outputs 
PAL20X10 PAL20R6 Async. Clock PT Sharing 
PAL20X8 PAL20R4 Async. Set/Reset Polarity 
PALI6L8 PAL20X4 Register Bypass Preload 
PAL16R8 Polarity Three-State 
PALI6R6 20 Inputs Preload Security Fuse 
20 Inputs 8 Outputs Three-State 
PAL16R4 10 Outputs 8 PT/Output Security Fuse 
4 PT/Output Three-State 
16 Inputs XOR Security Fuse 
8 Outputs Preload 
8 Product Terms Three-State 
per Output Security Fuse 
Three-State 


Security Fuse 


Note: PT = Product Term 


Figure 1. Military Configurable Architectures 
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Spurred on by America’s Strategic Defense Initiative, military 
equipment manufacturers are looking for radiation hardened 
parts to put into strategic weapon and space systems. Ithas been 
stated that some level of radiation tolerance will be required in up 
to 50% of all military applications by 1990. Due tothis increased 
need for radiation-resistant integrated circuits, MMI and AMD 
have embarked on a program to determine what radiation dose 
rates its products will withstand, and to provide increasingly 
radiation-tolerant products. 


The Galileo probe to Jupiter was bombarded with 100,000 rads 
(Si) within a one hour period. In contrast, the earth’s surface 
absorbs 0.1 rads in a year. Military aircraft must withstand a 
minimum of 2,500 rads total dose. A nuclear explosion 
emits transient- ionizing radiation of at least 10° rads (Si) per 
second. Radiation dosage and type varies greatly with the envi- 
ronment. Table 1 shows radiation dosages and semiconductor 
tolerances. 


Radiation effects can be divided into two major categories; 
natural space effects and man-made nuclear effects. 


Space radiation consists mainly of cosmic rays and charged 
particles in radiation belts (Figure 1). Cosmic rays from the sun, 
made up of protons and nuclei, cause single event upsets (SEU). 
A SEU occurs when a high-energy particle passes through an in- 
tegrated circuit, changing an internal logic state. 


REFERENCES 


MAX. SPACE REQUIREMENT —— 
JUPITER SATELLITE PROBE ——* 


STRATEGIC WEAPONS REQUIREMENT ——* 
AIRCRAFT REQUIREMENT —— 
90% FATAL HUMAN EXPOSURE ——* 





Radiation belts around planets, such as the Van Allen Belt circling 
the earth, carry trapped protons and electrons. These charged 
particles accumulate over time in CMOS devices, causing thresh- 
old voltage shifts. In oxide-isolated bipolar circuits this ionizing 
radiation (total dose) will cause an increase in leakage current. 


A nuclear explosion irradiates a large neutron fluence and high 
energy gamma rays (Figure 2). Neutron radiation damages the 
semiconductor lattice structure, creating recombination sites for 
minority carriers, which decreases bipolar transistor current gain. 
Neutrons induce no change to CMOS devices, which are 
majority-carrier structures. 


Nanoseconds after a nuclear blast, short, high-intensity electro- 
magnetic pulses lash out. Difficult to stop, these gamma rays 
generate huge numbers of holes and electrons, causing large 
currents, destroying both bipolar and CMOS circuits. CMOS is 
especially susceptible due to its inherently low latch-up sensitivity 
of 10° rads per second. 


Emerging technologies and process improvements will make the 
basic military hardness requirements for semiconductor devices 
a reality (Table 2). Gallium Arsenide (GaAs) products now meet 
a no-upset dose rate of 5 X 10" rads per second and a total dose 
of 10° rads. While junction-isolated bipolar devices also meet in- 
herently high dose rates of up to 10'° rads/second and total doses 
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Table 1. Radiation Perspectives 
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of up to 10’ rads, radiation-hardened CMOS circuits are not far 
behind with a typical dose rate of 10'° rads/second and a total 
dose of 10° rads (Table 3). 


Process changes to improve the radiation tolerance of CMOS 
devices are not without penalties. Guard rings reduce latch-up 
sensitivity and leakage paths, but decrease circuit density. Thin- 
ner gate oxides increase radiation resistance but cause reliabil- 
ity problems due to hot electron effects. A dry oxide process 
produces less defects but increases manufacturing time. Done 
correctly, these sacrifices will produce a marketable 


radiation-hardened CMOS process. 
NUCLEAR! SPACE 
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Dose Rate 
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Table 2. Basic Military Hardness Requirements 








Single Event Upset 
involving electrons 
(bits/square 
centimeter) 












Single Event Upset 
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At Monolithic Memories, radiation data has been obtained for 
neutron fluence and dose rate effects. Products representing 
bipolar junction and oxide-isolated processes passed neutron 
irradiation levels of up to 10'* neutrons per square centimeter 
(Table 4). Dose rate data obtained on the junction-isolated 
products showed recovery in fifty to seventy microseconds from 
aone microsecond pulse of 2 X 10° rads (Si) per second (Table 
5). MMI and AMD will move from radiation testing towards new 
radiation-tolerant designs and processes as the need for radia- 
tion-hardened products increases. 
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PASSED PASSED 
DATASHEET FUNCTIONAL UP TO 
PRODUCT DESCRIPTION PROCESS LIMITS UP TO (neutrons/cm?) 
(neutrons/cm?) 


PAL16R4AM 20 Pin, 30 ns Junction Isolated 4.5 X 10% 1x 10" 
PAL Device Diffused Bipolar 
PAL16R4BM 20 Pin, 20 ns Shallow-Junction Isolated 4.5 x 10% 1x 10" 
PAL Device Implanted Bipolar 
PAL16R4DM 20 Pin, 15 ns Oxide Isolated 4.5x 10" 1x 10% 
PAL Device Bipolar 
PAL20R4AM 24 Pin, 30 ns Junction Isolated 8x 10'S 1x 10" 
PAL Device Diffused Bipolar 
PAL20RA10M 24 Pin, Junction Isolated 1x 10" 
Asynchronous Diffused Bipolar 
PAL Device 
PAL32R16M 1500 Gate, 40 Pin Junction Isolated 1x 10" 1x 10" 
MegaPAL™ Device Diffused Bipolar 
§3S1681 16K PROM Junction Isolated 4.5 x 10% 4x10" 
Diffused Bipolar 
53RA1681A 16K Registered Junction Isolated 8 x 10" 1x 10" 
PROM Diffused Bipolar 
53$3281 32K PROM Junction Isolated 1x 10" 1x 10" 


Diffused Bipolar 
Table 4. Neutron Bombardment Data 
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PRODUCT DESCRIPTION PROCESS DOSE RATE PHOTOCURRENT 
(rads/second) (amps) 

PAL12H6EM 20 Pin Small Junction Isolated 2.6 x 10°° 4.2 
PAL Device Diffused Bipolar 

PAL14H4M 20 Pin Small Junction lsolated 2.1 x 10'° 3.9 
PAL Device Diffused Bipolar 

PAL10L8M 20 Pin Small Junction Isolated 2.5 x 10°° 4.2 
PAL Device Diffused Bipolar 

PAL12L6M 20 Pin Small Junction Isolated 2.2 x 10'° 3.8 
PAL Device Diffused Bipolar 

PAL14L8M 24 Pin Small Junction Isolated 1.2 x 10° 3.6 
PAL Device Diffused Bipolar 

PAL16L6M 20 Pin Small Junction Isolated 2.3 x 10'° 4.0 
PAL Device Diffused Bipolar 

PAL16L8M 20 Pin Standard Junction Isolated 2.5 x 10°° 5.3 
PAL Device Diffused Bipolar 

PAL16R8M 20 Pin Standard Junction Isolated 1.9 x 10'° 5.3 
PAL Device Diffused Bipolar 

PAL16R6M 20 Pin Standard Junction Isolated 2.4 x 10°° 5.7 
PAL Device Diffused Bipolar 

PALi6R4M 20 Pin Standard Junction Isolated 2.1 x 10°? 5.6 
PAL Device Diffused Bipolar 

PAL20L10M 24 Pin Exclusive-OR Junction Isolated 2.1 x 10°° 6.7 
PAL Device Diffused Bipolar 

PAL20X10M 24 Pin Exclusive-OR Junction Isolated 1.9 x 10’ 5.9 
PAL Device Diffused Bipolar 

PAL20X8M 24 Pin Exclusive-OR Junction Isolated 2.1 x 10'° 5.8 
PAL Device Diffused Bipolar 

Table 5. Transient Upset Test Data 
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Notes 


Buried registers and 
exclusive-OR gates 
make this PAL device 
a sound foundation 
on which to build 
state machines that 
conserve I/O pins. 


DESIGN APPLICATIONS 


Programmable Logic Device 


Preserves Pins, 


Chris Jay 


Product Terms 


Monolithic Memories Inc., 2475 Mission College Blvd., Santa Clara, CA 95054; (408) 970-9700. 


Programmable-logic devices continue to evolve, re- 
sponding to designers’ demands for improved per- 
formance, functionality, and flexibility. Among the 
newest is the PAL32VX10, a programmable-array 
logic (PAL) device that enhances the architecture 
of the earlier PAL22V 10. The result is a more flexi- 
ble general-purpose device. 

Asa functional and architectural superset of the 
22V10, the 32VX10 can be programmed to carry 
out any function that its predecessor can. It also 
adds two features: regis- 
ters that can be buried 
for designing state ma- 
chines, and exclusive- 
OR (XOR) gates for 
counter and JK func- 
tions (ELECTRONIC 
DESIGN, Jan. 8, p. 45). 

The number of inputs 
to the fuse array of the 
latest programmable- 
array logic (PAL) de- 
vice has been increased to 32 by the addition of ten 
buried feedback paths. A macrocell output pro- 
grammed as an input accesses the chip’s feedback 
path, which is buried within the fuse array. 

As a result, buried state machines can be config- 
ured in the device. Any logic design calling for 
many input pins can be configured with dedicated 
input pins and any of the of macrocell I/O pins 
(Fig. 1). This option leaves pins free for configuring 
outputs as inputs, and other Boolean functions may 
be realized in buried registers. Thus, functionality 
of the macrocell need not be lost. 

The macrocell’s XOR gate (denoted by the X in 
the part number) simplifies state-machine design 
because it promotes gate product-term economy. 
This economy is the ability to create complex state- 
machine designs without exhausting the limited 
number of product terms in each macrocell. 

For example, there are ten D-type registers in the 
32V X10, one in each of its ten macrocells. The 
XOR input to each register and no more than three 





Reprinted with permission from Electronic Design, 
Vol. 35, No. 13 (May 28, 1987). Copyright 1987 Hayden Publishing Co., Inc. 


product terms transform a D-type register toa JK 
register. Once the JK register is configured, subset 
functions like the T (toggle) and an SR (set-reset) 
may be derived. 

The macrocell’s product-term economy can be a 
result of the hold state inherently defined in the JK 
truth table—a function not offered by the D-type 
register alone (see the table). For a D-type register 
to hold its contents in a “‘set”’ state during several 
states, additional product terms must hold that 
condition until it is relaxed. In many tasks, apply- 
ing product terms as holding functions could ex- 
haust the limited supply in a given cell and compro- 
mise a state-machine design. 

A practical task for the hold function is ina line- 
sync generator. The blanking and line-sync pulses 
are programmed as JK functions, and a hold condi- 
tion ts invoked for their active duration. 

The XOR gate also influences the output polari- 
ty of a logic signal. Active-high or -low logic signals 
may be created at the output pins. In PAL devices 
without polarity control, the output can be inverted 
by applying DeMorgan’s Theorem to a sum-of- 
products architecture. This approach, however, 
brings with it the added baggage of more product 
terms. For example, one term containing N literals 
(Boolean variables) is inverted by creating the sum 
of N negated literals, with each negated literal us- 
ing a product term. 

Each of the 32VX10’s ten macrocells is config- 
ured around asum-of-products and XOR input toa 
D-type register, which is clocked from a dedicated 
input pin. Each macrocell can be programmed for a 
combinational or registered output. One product 
term, associated with the control to bypass the reg- 
ister, selects the path dynamically, if needed. 

Typically, the registers are programmed as a re- 
fresh counter for dynamic RAM. When a host mi- 
croprocessor must access the RAM for read and 
write cycles, its address-line outputs bypass the reg- 
isters, which contain the current refresh count for 
the RAMs. Instead, the outputs address the memo- 
ry inputs directiy. Afterwards, the address outputs 
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from the registers are enabled again to refresh the RAM. 

In fast video applications, a similar philosophy is in ef- 
fect. The host microprocessor bypasses the contents of 
the video-memory address counter to update the video 
RAM. It is important, though, that any counter-incre- 
ment signal be disabled when the registers are bypassed; 
an active clock input would overwrite any contents of the 
D-type registers. In a video RAM system, this might not 
be a disadvantage, but for dynamic-RAM arrays, over- 
writing a refresh counter could lead to loss of data. 

One product term might also dynamically influence 
the logic signal’s output polarity. With a registered out- 
put, a dedicated register-output-polarity multiplexer se- 
lects either a Q or Q output, routing the selected signal to 
the output pin. One programmable polarity fuse selects 
the required path. This fuse does not affect the program- 
mable polarity associated with combinational outputs. 

The global product terms from the fuse array are dedi- 
cated to resetting the ten registers synchronously and also 
presetting them asynchronously. The action of these two 
product terms can be selected by the programmable-po- 
larity multiplexer for individual registered cells. 

For instance, for a registered output, the asynchro- 


eo 
Clock input 


Fuse 
array 


Register bypass control 


Distributed product 
terms 8, 10, 12,14, and 16 


Buried register 
feedback 


Asynchronous preset 





4. Each of the ten macrocells in Monolithic Memories’ PAL32VX10 PAL device 
is driven by an exclusive-OR gate, a feature that promotes frugal use of 
product terms. For example, the gates reduce the number of terms needed 
to implement JK registers. The macrocells can also feed output and input 
signals back to buried registers. As a result, designing state machines with 


the device is easier. 








nous-preset gate can be programmed to a synchronous- 
reset gate or vice versa. The single product-term input to 
the XOR gate can select the output polarity for either 
combinational or registered outputs. 

The second added feature of the new PAL device, its 
conversion of an internal D-type register to a JK-register 
cell, is demonstrated by five Boolean equations formed 
from a sum of products: 


*J+Q*K(1) 
eek) 
+:Q*I+Q* K (3) 
4:O*J+Q*K (4) 
o= = Q:+:Q*I+Q* K(5) 


where the Boolean operators are: 


Solace 
HT 
21010010 


~~ for an inversion; 

:-+: for an XOR operation; 

+ for an OR operation; 

* foran AND operation; and 

:= is the state-machine operator ‘‘becomes equal to” 


Note that all five Boolean equations have true or com- 
plement conditions for either J or K; and a mixture of J 
and K or J and K functions. 
Three of the equations require 
the XOR function. Moreover, J 
and K active-high conditions 
can be used in equation 3 only 
with an XOR operator. 

The Boolean operators are 
compatible with the Palasm as- 
sembler-support software. Pa- 
lasm is a computer-aided 
(CAD) tool that generates pro- 
gramming information for 
PAL devices. The Boolean ex- 
pression information describes 
the operation of the device to be 
programmed. 

A good design specification 
done by a CAD tool like Pa- 
lasm should come with a com- 
plete comment field to describe 
the function of the Boolean 
equations. Palasm accepts a 
comment field in the descrip- 
tion of a logic-design file. When 
a semicolon precedes a state- 
ment, Palasm ignores that 
statement during the assembly 
process. 

The functional description of 
the logic circuits is in terms of 
Boolean statements. A later 
version of Palasm will accept 


Register 


polarity 
control 
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state-machine logic descriptions on a higher level. Then, 
designers will be able to deal with logic statements con- 
taining JK functions without describing them as a series 
of a sum of products, as is now the case. 

The ability to create subsets of a JK register gives the 
logic designer more flexibility to design sequential-logic 
circuits. One of the subset functions, for example, is the 
toggle (T) flip-flop, which lends itself to counter applica- 
tions. Binary counters require that the contents of any 
general register, Q,,, remain ina hold state until the prod- 
uct term ofall the lesser significant registers goes to a log- 
ic high. Then the register must change state or toggle after 
the arrival of the next clock pulse. The T function may be 
created by tying the J and K inputs together and treating 
this composite as the T input. A hold state is maintained 
when T = 0, and achange of state results when T = 1. 

The application of the toggle function to binary 
counters can be developed by considering a general regis- 
ter Qy in a binary count sequence, where the lesser signifi- 
cant registers are Qy.;, Qu». - -Q), Qo. Using equation 3, 
the JK description for a general register is given as 


Qn = Qu st: K * QytJ * Qn (6) 


A hold condition must apply when the product term of 
all lesser significant registers is a logic zero, or: 


Qn-1 * Qn-2 *+ + Qi * Q =O 
and a toggle condition must apply when 
Qu-1 * Qn—2 *- --.Q) *Qo= 1 
This develops to 


J=K = Qy-1 * Qu_-2 *- Qi * Q (7) 


which represents the T input to a toggle register. Substi- 
tuting equation 7 into 6 leads to 


Qn 2 Qn iF: Qn * (Qu-1 * Qu—2-- Qu * Qo) 
+ Qk 
* (Qn-1 * Qn—2 *: - -Qy * Qo) (8) 


Substituting equation 7 into 6 for J and K inputs leads 
to 


Qu = Qu :+: (Qu + Qu) 
* Qy-1 * Qu—2* - -Qi * Qo (9) 


Because (Qy + Qx) = 1, equation 9 reduces to 
Qn := Qn (hold Qy) 
+: Qu * Qn—2 *. « Q) * Qo (toggle Qy) (10) 


Equation 10 is a general equation for a register in a bi- 
nary counter. Only two product terms are used because 
the XOR gate makes it possible to invoke the toggle func- 
tion as a JK subset. Without the XOR gate, the general 
equation for register N would be 
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Qn = Q% 

* Qn-1 * Qn-2 *. Qu * Qo (toggle Qn) 
+ Quy * Qy_j (hold Qu) 

+ Qu * Qg_5 (hold Qu) 


+ Q, * Q, (hold Qn) 
+ Qu * Qo (hold Qy) (11) 


Equation 11 requires N product terms as hold func- 
tions. This means that for a 10-bit counter, the most sig- 
nificant register, Qy, would need nine individual product 
terms as hold functions and one additional product term 
for a toggle function. 
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2. Thanks to its easy implementation of the JK func- 
tion and its subsets, the 32VX10 makes a compact 
line-address counter and line-sync generator for 
video tasks. 
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In contrast, equation 10 makes use of the XOR func- 
tion, requiring one product term to hold and one product 
term to toggle. No matter how significant the register in 
the count sequence, the inclusion of an XOR function in 
the equation eliminates large numbers of product terms. 
The advantage of the XOR gate increases for a 10-bit, up- 
down counter because the required number of product 
terms doubles. Q, would normally need 20 product terms 
but with an XOR gate, only three do the job. 

A practical outcome of configurable JK and T registers 
is the design of a line-sync generator circuit for control- 
ling a video RAM, and for synchronizing line-display in- 
formation to a CRT interface (Fig. 2). Memory-address 
outputs MA, through MA,, which are derived from an 
internal counter, drive the address inputs to the video 
RAM. One complete line of video information is dis- 
played and surrounded by line-sync and blanking pulses 
from two macrocells programmed as JK registers. 

The internal memory-address counter is programmed 
to address an 80-character screen and then issue an ac- 
tive-high blanking pulse. The line-sync pulse goes active- 
high when the counter reaches a decimal count of 79 (4F 
hex) and is turned off at a decimal count of 103 (67 hex). 
The blanking pulse is negated and the counter returns to 
zero after reaching a decimal count of 107 (6B hex). 

The JK functions for turning on line-sync and blanking 
pulses come from decoding the count equations. The 
blanking pulse overlaps the line-sync pulse to give front 
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and back porches to the video line-scan signal. Also, be- 
cause the host shares access to the video memory, it can 
update video information. In that case, a Write (WR) is 
driven low to produce three-state video-address outputs. 

When at work, the microprocessor can change the con- 
tents of the video RAM during frame- or line-signal fly- 
back periods to avoid displaying random data during ac- 
tive display periods. To make such changes possible, the 
WR input can be derived from a smaller combinational 
PAL device that decodes the microprocessor’s address 
lines and maps the video RAM into its address space. 

An asynchronous reset input (RST) powers up the 
PAL to a known state. With its Count Enable Input 
(CED) and Output (CEO) signals, the device can cascade 
with other circuitry, such as a frame-sync generator anda 
video-shift register. 

The memory-address counters are programmed as T 
flip-flops; line-sync, blanking and CEO functions as JK 
registers. An Enable-State Input (ENST) enables input 
states 0 through 2 as count enable inputs in preference to 
the CEI signal. When in the low state, ENST qualifies in- 
put states O through 2 as active-high count-enable inputs. 

To complement the line-sync generator design, the 
32VX10 is programmed as a 7-bit video-shift generator. 
Seven registers shift video data and three are pro- 
grammed as a 3-bit counter to control loading and shift- 
ing of video data in the shift register. When all are high, 
counter output states 0 through 2 cause the shift register 
to load and the line-sync generator to increment. 

For the example circuit, another 32VX10 is pro- 
grammed to support the line-sync generator. As with the 
other 32VX 10s, the frame-sync output is programmed as 
a JK function. The internal counter accesses row-charac- 
ter information as well as the contents of the video RAM. 
The 32VX10 video-shift register accepts dot information 
from the character generator and supplies a parallel-to- 
serial output for the video display. Any parallel-to-serial 
output format can be handled by the 32VX10. 

Readers interested in the PAL design specifications 

may contact the author for further details. 
Chris Jay, logic-cell-array applications manager with 
Monolithic Memories, has been with the company since 
1983. Three years were spent as a field-application engi- 
neerin Northern Europe. Before that, he worked for Texas 
Instruments in Bedford, England and Fairchild Camera 
and Instrument in Bristol, England, performing logic de- 
sign and applications engineering, respectively. Jay earned 
a BSc (honors) degree from Essex University in Coulches- 
ter, England in 1977. 














PLD Programmability 
Extends Its Sway Over 
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Electrically erasable CMOS parts, the 
densest 24-pin SLIM PALs yet, reach 
new heights in architectural flexibility 


Om Agrawal, Product Planning Manager 
Fares Mubarak, Senior Test Engineer 
Field-Programmable Logic Division 
Advanced Micro Devices, Inc., Sunnyvale, CA 


wo electrically erasable PAL 

devices being introduced here 

by Advanced Micro Devices 
offer by far the greatest flexibility of 
any programmable array logic 
(PAL) device to date. Several fea- 
tures contribute to the adaptability 
of these CMOS ICs—the highest yet 
number of product terms (188 and 
178) of any 24-pin PAL; the opti- 
mization of one device, the AmPAL- 
29M16, for synchronous designs and 
the other, the AmPAL29MA\16, for 
asynchronous and glue logic applica- 
tions; and the convenience of some 
11,000 electrically programmable 
and erasable interconnecting cells 
(see Figs. 1 and 2). 

But the most distinctive enhance- 
ment is the programmable input and 
output bidirectionality of each de- 
vice’s 16 I/O logic macrocells (1/O- 
LMs). Until now, some program- 
mable logic devices (PLDs) have 
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had output macrocells, but none has 
had input macrocells, let alone the 
I/O type. The I/O pin on each I/O- 
LM can in addition be configured as 
combinatorial, as an edge-triggered 
register, or as a transparent latch. 

What’s more, on 8 of the 16 I/O 
pins, the associated 8 register/ 
latches can serve independently of 
the pins as programmable buried 
state registers. Current 40-pin 
CMOS EPLDs have only half that 
number. These internal registers 
offer designers an extra eight flip- 
flops for such jobs as keeping track 
of flags or generating timing and con- 
trol signals. 

The 11,000 or so interconnecting 
elements are electrically erasable 
(E?) cells that define what each 
PAL device can do. They connect all 
array inputs to the programmable 
AND-OR array, and they configure 
the architecture of the I/OLMs. The 
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user programs these cells by charg- 
ing and discharging them, so that de- 
sign changes can be made quickly— 
far faster than for ultraviolet-light 
erasable PLDs, or EPLDs. 

The record number of product 
terms, 16 macrocells, and horde of 
E? cells make the two newcomers 


the densest 24-pin SLIM PAL de-. 


vices available, whether bipolar or 
CMOS. The ICs are five to six times 
denser than industry-standard, first- 
generation 20- and 24-pin EPLDs 
and k?PLDs and twice as dense as 
second-generation 24-pin parts. In 
fact, these CMOS PALs are denser 
even than 40-pin bipolar PLDs. 
Their 35-ns max input-to-output 
propagation delay and external clock 
frequency of 20 MHz max fixes the 


new CMOS devices in the medium 
speed range expected of bipolar 
units. Maximum active power dissi- 
pation is 120 mA. Each comes in two 
logic levels—CMOS HC and CMOS 
HCT logic for TTL systems. (The 
trademark PAL belongs to Mono- 
lithic Memories.) 

From what follows, it will become 
clear that the basic I/O logic macro- 
cell structure is the same for both the 
synchronous 29M16 and asynchro- 
nous 29MA\16. Differences arise, 
though, in the I/OLM control mech- 
anisms—their clocking, resetting 
and presetting, and output enabling. 
Also, the distribution of the product 
terms between logic and control 
functions is different for the two 
PALs. The testing and debugging 


Fig. 1. The architecture of the synchronous AmPAL29M16 E? programmable 
logic device derives much of its flexibility from its 16 bidirectional input and 
output logic macrocells. The macrocells may use either one or two clocks, 
and each controls the output of on average 11 logic product terms. 
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features, though, are identical. In 
conclusion, an application of the 
29M16 will be outlined—the use of 
nearly a dozen of these packages to 
build a 32-bit direct memory access 
(DMA) controller. 


1/OLM with a difference 

The flexibility of the I/O and stor- 
age elements of these parts lends 
them the look more of a gate array 
than a PAL device (see Figs. 1 and 
2). Eight of the I/OLMs have single 
feedback to the AND/OR array, and 
eight have dual feedback. Single 
feedback comes from either the I/O 
pin or the storage element, dual feed- 
back comes from both. 

It’s when an I/O pin is used for 
input that its associated storage ele- 
ment can be turned to use as a buried 
register or latch. As many as all eight 
of the dual-feedback I/OLMs can 
be devoted to user-programmable 
buried-state register/latches. These 
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are helpful for both synchronous and 
asynchronous state-machine designs. 

The presence, number, and pro- 
grammability of the E? cells within 
the I/OLMs is the most significant 
architectural variable of these PAL 
devices. (Second-generation PLDs 
had three to four times fewer archi- 
tectural cells.) Is the macrocell to be 
an input or an output? An edge-trig- 
gered D flip-flop or a transparent 
latch? What are the macrocell’s 
clocking, feedback, and output- 
enable schemes and output polarity? 
What if it frees up a buried state 
register? 


Programming the E? cells 

All these decisions are made by 
programming the E® cells. Single- 
feedback macrocells have nine E? 
cells, dual-feedback macrocells have 
eight. In its virgin, erased (charged 
or disconnected) state, an E? cell’s 
value is 1. In its programmed (dis- 
charged or connected) state, it isa 0. 


Both synchronous and asynchro- 
nous I/OLMs contain a configurable 
storage-element cell and five multi- 
plexers—two 2:1 multiplexers for se- 
lecting I/O and feedback path and 
three 4:1 multiplexers for selecting 
output path, clock, and output en- 
able (see Fig. 3). 

Whether a macrocell is to be an 
input or output and the nature of 
its storage element (edge-triggered 
register or transparent latch) is set 
by E? cells S, and Ss. In its virgin 
state, the I/OLM is an output cell 
and the storage element is an edge- 
triggered register. It can be changed 
by the user to an input latch or regis- 
ter or output latch by appropriate 
programming of S, and S,. 

The polarity (active high or active 
low) of the cell’s output and its se- 
quential (registered or latched) or 
combinatorial nature are determined 
by Sp) and S,. With both of these 
cells in the erased (charged) state, 
for example, the output is set to be 


Fig. 2. The 16 !/O macrocells in the asynchronous AmPAL29MA16 can each 
be allotted its own product-term-driven clock. The PAL’s need for more 
control terms leaves an average of 7 logic product terms per macrocell. 
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active low and combinatorial. 

By programming, or discharging, 
cell Sp, the designer makes the out- 
put go active high. The OR-gate out- 
put bypasses the storage element 
and feeds the output-path selection 
multiplexer directly. 

On the other hand, by program- 
ming S,, the designer makes the out- 
put sequential (registered/latched) . 
The Q output of the storage element 
passes through the 4:1 multiplexer 
to an inverting output buffer and is 
fed back to the AND array inter- 
nally. This configuration is particu- 
larly useful for state machine de- 
signs. Programming both S, and S, 
results in an active-high, sequential 
output. 

The feedback multiplexer for the 
I/OLMs with single feedback is con- 
trolled by yet another E? cell (S,). 
This separate control of the feedback 
multiplexer makes for the ultimate 
in flexibility in state machine design. 
Each I/OLM, with registered, com- 
binatorial or latched outputs, choice 
of polarity, and dual feedback, has 
12 possible output configurations. 
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The timing needs of synchronous 
state machines are much simpler 
than those of asynchronous and glue 
logic—hence the difference in the 
two PALs’ clocking, output-enable, 
and preset/set arrangements and 
in the number of product terms as- 
signed to logic and control. 

For synchronous state-machine 
design, the flexibility of one or two 
synchronous clocks, preferably with 
polarity control, is often needed for 
individual macrocells. It is impor- 
tant for designing single-, dual-, or 
quadruple-state machines with a var- 
iable number of state registers. Pro- 
grammable clock polarity offers the 
ability to trigger certain groups, or 
banks, of registers on one edge and 
other groups on the opposite edge. 
The result is faster internal state 
machines. 

Early PAL devices had a single 


synchronous clock. Some newer syn- 
chronous CMOS erasable PAL de- 
vices offer dual clocks, but the two 
are selectable for a bank of 8 regis- 
ters at a time, rather than for 16 
macrocells individually. Thus, they 
are limited to, at most, two different 
state machines, each with an eight- 
register maximum. 

In addition, these PAL devices us- 
ually have two separate clock pins. 
So, if the designer needs a single- 
state machine with up to 16 registers, 
the second clock pin is wasted. 


Better clocking 

The 29M 16s alleviate these draw- 
backs. Each I/OLM has a 4:1 pro- 
grammable clock-selection mecha- 
nism selected by two E? cells, S, and 
S,. For addressing synchronous 
logic, the 29M16 has one dedicated 
CLK/LE (latch enable) pin and 


Fig. 3. The 1/0 logic macrocell, in either the synchronous AmPAL29M16 gate 
array (with input gates shaded yellow) or the asynchronous AMPAL29MA16 
(inputs on yellow background at far left), depends for its functionality on how 
the erasable S cells configure the multiplexers and storage cell. 
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one CLK/LE/I (input) pin. For 
macrocells configured as registers, 
these pins are clock pins; for latches, 
they’re latch-enable pins. 

Either the clock or latch-enable 
pin may be selected to trigger each 
storage element on either the rising 
or trailing edge. With this, system 
designers can implement up toa 
quadruple-state machine, with each 
meant to have a different number of 
registers. 

Also, since the device offers 16 
I/OLMs with input register/latches, 
the PAL is applicable to pipelined 
systems with eight input latches and 
eight output registers. The dual- 
clock capability allows the PAL to 
run with synchronous or asynchro- 
nous inputs. The clocks’ program- 
mable polarity allows the internal 
state machine to run at twice the 
speed of the external clock. 

Further, since only one pin is ded- 
icated as a clock/latch enable, a sin- 
gle state machine need not waste the 
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second clock pin. It can be an extra 
input. 

While one or two synchronous 
clocks with programmable polarity 
are sufficient for synchronous state 
machines, asynchronous and glue 
logic applications have more de- 
manding clock/latch requirements. 
They call for more flexible PAL de- 
vices with combinatorial outputs and 
feedback and with edge-triggered 
registers and transparent latches 
having individual clock/latch-enable 
capability (preferably with pro- 
grammable polarity). 

In addition, they require individ- 
ual control of reset and preset and 
output-enable product terms, as well 
as slightly fewer product terms per 
output. Further, for asynchronous/ 
glue logic applications, the clock-se- 
lection and output-enable schemes 
should be independent. 

The 29MA16 addresses these 
needs with up to 16 product-term- 
driven asynchronous clock/latch-en- 
able controls. Each of these controls 
is a product term driven from the 
AND.-OR array as a function of cer- 
tain input signals. These, in turn, 
are controlled by the user according 
to the application. 

Like its synchronous counterpart, 
each I/OLM of the asynchronous 
29MA16 has a 4:1 multiplexer for 
clock selection. Unlike the other, 
however, it allows the user to select 
from either a common synchronous 
clock/latch enable or an individual 
clock driven by a product term. 


Preset and reset 

The synchronous 29M16 uses 
common asynchronous reset and 
preset controls for all registered or 
latched inputs and outputs. When 
the product term for asynchronous 
preset is asserted, all the register/ 
latches are immediately loaded with 
a high, independent of the clock. 
Conversely, when the asynchronous 
reset product term is asserted, all the 
register/latches are immediately 
loaded with a low, independent of 
the clock. Actual output depends on 
the macrocell polarity. 

The asynchronous 29MA16, on 
the other hand, offers individual 
asynchronous reset and preset con- 
trol for each of its storage elements. 


This feature, plus the individual 
clock capability, makes it easy for 
the device to replace a large amount 
of glue logic. 

As in all PAL devices, the size, 
organization, and distribution of the 
internal product terms of the 29M16 
and 29MA16’s AND-OR array de- 
termine its logic capability. Both 
PAL devices can have 29 inputs, of 
which 24 issue from the I/O logic 





macrocells (8 from I/OLMs with 
single feedback and 16 from I/OLMs 
with dual feedback), 1 comes from 
the I/OE pin, and 3 are dedicated. 
The 29MAI16 adds a fourth dedi- 
cated input, but the 29M16 instead 
has the CLK/LE/TI pin. 

Of the 29M16’s 188 product terms, 
all of 176 are logic terms and 12 are 
control terms. The 176 logic product 
terms are distributed among the 16 





PALs move to ever-greater flexibility 


Important gauges of a PAL device’s flexi- 
bility are the number and complexity of 
its input/output logic macrocells. With 
the introduction of its electrically eras- 
able, CMOS-based AmPAL29M16 and 
AmPAL29MA16, Advanced Micro Devices 
has brought more flexibility than ever to 
programmable logic devices. 

First-generation, bipolar fuse-pro- 
grammable PAL devices had at most 
eight outputs fixed at the factory as 
either registered or combinatorial with 
active high or low polarity. The second 
generation improved on this somewhat 
by adding 10 programmable outputs con- 
figurable as registered or combinatorial 
and active high or low. 

Although some second-generation 
CMOS EPROM-based PLDs offer as many 
as 16 logic cells, these can be configured 
only as outputs. Further, these outputs 
lack a transparent-latch capability, a re- 
quirement for both asynchronous and 
synchronous designs. 

Architectural flexibility also extends to 
features such as variable product terms, 
varying the number of product terms per 
output, and programmable buried states. 
Variable product terms, first introduced 
in bipolar PAL devices like the AmPAL- 
22V10, help the device match the appli- 
cation. Programmable buried states use 
internal storage elements for state-ma- 

Chine design; until now, PAL devices 
lacked independent buried states. Such 
elements were used only in conjunction 
with 1/0 pins. Consequently, when an 
1/0 pin was an input, its associated stor- 
age element was wasted. 

At first, PAL devices were produced 


with fuse-based bipolar technology; they 
could not be reprogrammed. Next genera- 
tion EPROM-based PLDs (or EPLDs) are 
reprogrammable, as are the still newer 
electrically erasable PLDs (E?PLDs). 

To be erased, EPLDs must be in ex- 
pensive windowed ceramic packages 
through which they’re exposed to ultra- 
violet light for a lengthy time. Easier to 
reprogram, E?PLDs fit in inexpensive 
plastic packages. 

Complete testability is another sig- 
nificant advantage of E? technology. Un- 
like bipolar and EPLDs, which require 
additional testing circuitry, E? devices 
use only the array and its operating cir- 
cuitry to make DC, AC, and functional 
tests. 

Finally, a PAL device must meet the 
specific demands of combinatorial, syn- 
chronous, and asynchronous applica- 
tions. While some CMOS PLDs have at- 
tempted to address the broad range of 
applications with a “‘one-size-fits-all” 
approach, this has limitations. The device 
tends to have extra features that show 
up in overhead while not providing others 
that are key. The result is expensive and 
less than optimal. 
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I/OLMs, as was shown in Fig. 1. The 
eight I/OLMs with dual feedback 
have eight product terms each, while 
of those with single feedback, four 
have 12 product terms and four have 
16. The average is 11 product terms 
per output. 

This variable distribution of prod- 
uct terms, as well as the number of 
product terms per output, allows 
more complex functions—such as 
specialized counters and more com- 
plex state machines—to be imple- 
mented in a single device. 

Of the 29M16’s 12 control terms, 
two are used as common asynchro- 
nous reset and preset, and eight are 
output-enable product terms—one 
pair of common AND-XOR product 
terms per bank of four registers. The 
remaining two are used for observa- 
bility in testing and for preload of 
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the device’s storage element. 

As for the 29MA16, its smaller 
total of 178 product terms is divided 
into only 112 logic terms and all of 
66 control terms. The logic product 
terms are also distributed in a var- 
iable fashion (see Fig. 2 again); 
the average is 7 product terms per 
output. Of the control terms, four 
are distributed to each of the 16 
I/OLMs, and the remaining two are 
for observability and preload. 


Output enable control 

The I/O pin for both new devices 
may be configured as a dedicated (or 
permanently disabled) input, a dedi- 
cated output, or dynamically con- 
trolled. This flexibility results be- 
cause the connection from the I/O- 
LM to its I/O pin is controlled by a 
4:1 multiplexer. 
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Fig. 4. This multichannel 32-bit DMA controller is built 
from 11 AmPAL29M16s, one AmPAL16L8, a 16 x 16-bit 
RAM, and an 8-bit command register. 


On both devices, three of the in- 
puts to this multiplexer come from 
ground, Veo, and the I/O-enable pin. 
On the synchronous 29M16, the 
fourth input to the multiplexer is 
driven by the two AND-XOR prod- 
uct terms already mentioned as be- 
ing common for a bank of four 
I/OLMs. In synchronous state ma- 
chines, this arrangement allows the 
I/O pins to be controlled in groups 
of four. The AND-XOR product 
term control also allows for program- 
mable polarity, a useful feature for 
bus-control applications. 

On the 29MA16, this fourth input 
is driven by an individual product 
term from the AND-OR array. Such 
individual product-term control 
gives users maximum flexibility in 
configuring the I/O pins. 

Both the parts have built-in fea- 
tures to simplify testability and sys- 
tem-level debugging. These include 
a preload and a power-up reset for 
testability and an observability 
product term for debugging and for 
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tracing buried state registers. 

When the observability product 
term is asserted, it suppresses the 
combinatorial output data, prevent- 
ing the data from appearing on the 
I/O pin. It then presents the con- 
tents of the register/latches on the 
output pin for each of the logic mac- 
rocells. 

The built-in power-up reset ini- 
tializes the register/latches to a 
known state, a feature that simpli- 
fies state machine designs. A secur- 
ity cell in both devices protects the 
logic against unauthorized copying. 


A multichannel DMA controller 
The AmMPAL29M16/29MAI16 are 
suitable for applications ranging 
from glue and general-purpose logic 
replacement to synchronous and 
asynchronous state machines. Fairly 
complex Mealy and Moore state ma- 
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chines can be readily designed, and 
the I/O latches support double pipe- 
lined systems. 

Figure 4 is the block diagram of a 
direct memory access (DMA) con- 
troller built for the 32-bit 68020 mi- 
croprocessor. It is a simple, four- 
channel, single-cycle, 32-bit mem- 
ory-to-memory data-transfer design. 

The 68020 processor accommo- 
dates memory blocks and peripheral 
widths ranging from 8 to 32 bits on 
its system bus. It also imposes cer- 
tain connectivity requirements on 
slave devices: 8-bit devices must go 
to the uppermost 8 bits of the data 
bus, while 16-bit devices must con- 
nect to the data bus’s upper word. 

Consequently, issues related to the 
data width for the DMA operation 
are the data-transfer size (8, 16, or 
32 bits), the data-funneling scheme 
(for packing and unpacking data 
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to support a flexible data-transfer 
scheme), and block size. 

The DMA logic comprises two ad- 
dress pointers (one for source and 
one for destination); a 32-bit data 
latch; burst, wait, and block-length 
counters; an 8-bit command register; 
a 16 x 16-bit RAM; and the DMA 
sequence controller. This design uses 
a total of 11 CMOS AmPAL29M16s 
—9 to implement pointers, latch, and 
counters and the remaining couple in 
the sequence controller. 

The two pointers have a width of 
30 bits divided into an upper 16-bit 
segment register field and a lower 
14-bit offset register field. The 16- 
segment register field allows data to 
be transferred in 64-K segments, 
each segment being a maximum of 
16-K blocks (or 64-K words, with 4 
words in each block). 

The task of segment register main- 
tenance comes under CPU control. 
Thus, if the data to be transferred 
spans a segment boundary, the CPU 
must break up the transfer into two 
separate operations and set up seg- 
ment registers for it. 

The 14-bit offset register’s value 
is loaded into the RAM locations by 
the CPU before the DMA operation 
begins. In this example the DMA 
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Fig. 5. The state diagram of the DMA controller's 
operation shows that it will cycle continuously 
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PLD Programmability Extends Its Sway Over Complex I/O 





transfers only long words, so the 
least significant address bits A, and 
Ay are stuck-at-zeroes. The offset 
register is incremented after every 
DMA read/write cycle unless the 
command register says not to. 

A 82-bit data latch temporarily 
holds the long-word data fetched 
from the source. During destination 
writes, this becomes the source for 
the data. The burst, wait, and block 
count registers are each of them 14- 
bit decrementers. 

The burst register counts down af- 
ter every DMA read/write cycle. At 
zero, the data and address buses are 
relinquished by the DMA controller 
for a number of cycles defined in the 
wait register. 

The 14-bit block counter decre- 
menter, whose value is loaded into 
RAM locations by the CPU, keeps 
track of the number of long words 
transferred in each current DMA cy- 
cle. The counter is loaded with a 
value by the CPU before each new 
DMA operation and is decremented 
by the DMA control logic at each 
transfer. A count of zero indicates 
the end of current DMA operation. 


The control sequencer 

The DMA control sequencer has 
three jobs. It handles loading both 
the pointers and the registers from 
the RAM locations. It controls the 
DMA read/write cycles. And it con- 
trols the interfaces to the CPU (han- 
dling of burst and wait registers) and 
the memory. 

Because of I/O limitations in the 
sequence, the buried registers asso- 
ciated with inputs implement the 
state sequence. The output-enable 
structure enhances the individual 
I/O control of every macrocell, and 
the clock scheme is used by trigger- 
ing on both edges of the system clock 
to improve the DMA read/write cy- 
cle throughput. 

According to the state diagram for 
the DMA sequence controller (see 
Fig. 5), the device stays in the idle 
state until a DMA operation is 
started by the CPU’s writing the 
GO bit in the command register. 
The controller then loads the point- 
ers and the counter registers from 
the RAM locations and requests 
the bus. After the bus is granted, it 
initiates memory-to-memory data 


transfers—read followed by write 
cycles. (One memory-to-memory 
transfer cycle is a single operation.) 


DMA preempticn 

The DMA preemption is per- 
formed in write state WS,. When 
the external arbiter signals that the 
bus is needed by another potential 
bus master (by negating bus grant), 
the DMA controller gives up the bus 
and enters the bus-request state. It 
stays there until the bus control re- 
turns to the DMA controller. 

‘In state WS,, if the block counter 
reaches value 0, then it returns to the 
idle state. If the burst length has 
reached zero, but the counter value 
is not zero, then it goes to the wait 
state until the wait register value 
reaches zero. Then it transfers to the 
“load burst register” state. In state 
WS,, however, if both the counter 
and burst-register values are not 
zero, and if no other bus arbiter 
needs the bus, the controller contin- 
ues memory-to-memory data trans- 
fers until it is done. Oo 
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PROSE Devices Simplify 
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A bipolar device combines a PAL device for conditional branching 
with a PROM device for storing system control functions to make 
the design of small state machines easier. 


Co logic is required in a broad range of ap- 
plications—from low-level control of custom 
bit-slice microprocessors, to bus arbitration and 
timing generation in conventional microprocessors, 
to special-purpose needs like data encryption and 
decryption. Even though these applications vary in 
complexity, they all require a sequential-type device 
(a state machine, for example) that provides for se- 
quences of output signals based upon state tran- 
sitions and that includes some capability for 
branching. More complex sequencers provide capa- 
bility for looping and subroutining. 

Possible candidates for control logic range from 
Programmable Array Logic (PAL) devices, to the 
more complex Field Programmable Logic Se- 
quencer (FPLS) devices, to instruction-based 
microprogram controllers that rely on on-chip 
memory for expanded subroutine capabilities. 
While FPLS and PAL devices have limited product 
terms for many state-machine applications, the 
other microprogram controllers are sometimes 
more complex than required. For applications re- 
quiring control functions that small and medium 
state machines can represent, the Prose device from 
Monolithic Memories Inc (MMI) can facilitate con- 
trol logic. 

Combining a PAL device for conditional branch- 
ing and a PROM for storing the control functions 


Nick Schmitz 

Schmitz is a product planning manager for program- 
mable logic devices at Monolithic Memories Inc (San- 
ta Clara, CA). 





of the system, Prose is intended to function as a 
programmable sequencer (Prose = PROgrammable 
SEquencer). It offers a maximum of 128 states and 
up to four-way branching. A bipolar device oper- 
ating at up to 25 MHz, Prose functions as a Moore 
state machine. Under certain conditions, a Mealy 
state machine can also be modeled with Prose. 


Working faster and simpler 

Custom microprocessors, such as those built with 
bit-slice components, require control logic in the 
form of multiple IC microinstruction memories 
and sequencers to generate addresses for those 
memories. For complicated sets of instructions 
with many states and complex branching and 
subroutines, a microprogram controller makes 
sense. But in situations in which a more limited set 
of instructions and instruction transitions is re- 
quired, the single-chip Prose device is faster as well 


Reprinted from the April 1, 1987 edition of Computer Design 
Copyright 1987 by PennWell Publishing Company. 
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PROSE Devices Simplify State Machine Design 


Prose in a classic traffic-controller application 


onsider a four-way intersection. For each direc- 

tion (N, S, E and W), there are two sets of 
lights: one for the forward direction (F) and one for 
left turns (L). Each set has three lights: red (R), yel- 
low (Y) and green (G). For each of the four direc- 
tions, there are also two detectors (D): a left turn 
detector and a forward detector. 

In the master diagram for traffic-flow control (a), 
each box represents a set of states generated by 
the controller in response to the input conditions. 
For instance, DO NL is executed given the input 
conditions NLD */SLD (NLD and not SLD). 

Four Prose devices, a counter and an encoder 
are used to implement this controller (b). The in- 
puts to this controller are the six outputs of the en- 
coder and the carry out (CO) signal from the 
counter. The outputs are the control signals to the 
lights themselves. The encoder simply passes the 
NLD, SLD, ELD and WLD signals. It generates two 
new signals, NSD and EWD, as follows: 


NSD=NLD+NFD+SLD+SFD 
EWD=ELD+EFD+WLD+WFD 


Prose devices 1 through 3 generate the 24 out- 
puts that control the traffic lights, while Prose 4 
preloads the counter. Note that all four Prose de- 
vices receive the CO signal from the counter. CO 
tells the Prose devices when the preloaded 
number of cycles has been completed so they can 
generate the next state. The counter is used to 
conserve states in the Prose devices; that is, the 
devices simply cycle back to the same state until 
the desired number of cycles has been completed 
or until some condition detected by the sense de- 
tectors warrants jumping out of the loop early. 
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as simpler. For instance, when an ALU is used only 
for a multiply or filtering operation, Prose devices 
reduce chip count but can also generate the required 
instructions faster than the more complex micro- 
program controller. Prose devices can also be used 
for addressing the data memory to fetch the 
operands for the filtering operation. 

Consider also the case of a video controller. For 
scanning purposes, counters that operate in various 
sequences and count lengths are required. But in- 
stead of implementing these as actual counters, the 
sequences involved can be unlocked and imple- 
mented as state-machine transitions. And there’s 
an advantage beyond the mere economy of parts. A 
count can be set or initiated and then left to poll 
various external hardware conditions while the mi- 
croprocessor performs other operations. 
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Memory and bus arbitration applications offer 
further opportunities for using Prose devices. If 
two processors or other devices vie for memory ac- 
cess, a simple Prose state machine can implement 
the logic for deciding which device gets the bus 
and/or access to memory. In such a case, the heavy 
artillery of a microprogram controller and memory 
isn’t required. 

As a sequencer for signal-processing applica- 
tions, Prose state machines offer speed and suffi- 
cient functionality without the overkill of a com- 
plex microsequencer paired with dedicated mem- 
ory. For simple algorithms, such as those involved 
in performing a fast Fourier transform, Prose de- 
vices can control the set of vectors that are mul- 
tiplied and added in the process. Via the instruction 
sets to the devices, Prose can control the operation 
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Consider a block such as DO NL. As we enter 
this block, we have just completed flow in the east- 
west direction. The previous state has generated 
the preload for the counter so that as we enter the 
block DO NL with NLD* /SLD, the counter begins 
the count. Suppose that we have preloaded 110 
and count down. On the count following 000, CO 
goes high, and this tells the Prose devices to 
branch to another state. Figure (c) is a state transi- 
tion diagram for DO NL. The state “NLG NFG” re- 
cycles until CO or /NLD occurs. Figure (d) shows 
the details of DO NLSL; Figure (e) shows DO NFSF. 
These diagrams reflect all basic transition types in 
the controller. There are four blocks of the type DO 
NL, two of the type DO NLSL, and two of the type 
DO NFSF. In addition, there’s one other block, ALL 
REDS, that contains only one state: all outputs red. 
So, from counting the possible states of the sys- 
tem according to branch type, we have 15 one-way 
branches, 10 two-way branches, 0 three-way 
branches, and 4 four-way branches, resulting in 31 
of the 128 PROM !ocations used. 

This application uses four Prose devices—three 
for generating signals to control the lights and one 
for controlling the counter for state minimization. 
Although outputs are different, all four devices use 
the same inputs and execute the same transition 
table and thus can be considered to be in the same 
state, or to be generating a single state with a 
wider output word. This usage demonstrates how 
the Prose device can be expanded in width. When 
programming the devices, state transition and 
condition equations are identical and can be 
copied. Only the output equations are unique to 
each and must be programmed separately. 


of the ALUs or multipliers and can control memory 
access when separate Prose devices are used. A 
Prose device is well suited to relatively simple oper- 
ations that don’t involve complicated decision 
making. For complex digital signal processing op- 
erations, a programmable DSP device may be bet- 
ter, but it’s not likely to be as fast as the dedicated 
hardware approach. 

In peripheral control, the simple state-machine 
approach can be efficient. Consider the case of run- 
length-limited code. Both encoding and decoding 
can be translated into state sequence diagrams, 
which examine the serial data stream as it’s read, 
and can then be readily programmed into a Prose 
device. 

Industrial control and robotics also require sim- 
ple control functions. Tasks such as mechanical po- 


al Monolithic RR Memories Pm | 


7 NLD + CO 


(c) 


NLY SLG 


SLG SFG i 








(PRELOAD 110) 


NLG NFG 


| (PRELOAD 110) 


/CO*NLD 


/C0 + /EWD 


NFG SFG |) 


CO*EWD 


I 


NLY NFG 
NFY SFY 


fi 


<+ 


(d 


= 


(PRELOAD 110) 








/CO*NLD*SLD 


/C0 + /NLD*/SLD 


NLG SLY 


| 


NLG NFG 


/NLD*SLD 
NLD*/SLD 







NLY SLY 


(PRELOAD 110) (PRELOAD 110) 


——_——_——- 


/CO*SLD /CO *NLD 





ae ae 
SLY SFY NLY NFY 






sitioning of a robot arm, simple decision making, 
calculating a trigonometric function and displaying 
several digits usually don’t require a high-power 
microsequencer with stacks and pointers and mi- 
croprogram memory. What’s required instead is a 
device that can store a limited number of states and 
allows simple branching upon conditions. 


Control logic alternatives 
A registered PAL device is a simple state machine 
and can be effectively used for simple control func- 
tions, such as counters, dynamic RAM controllers, 
interrupt controllers and certain types of video con- 
trollers. Combining a PAL device with memory 
such as a PROM, as MMI has done in Prose, gives 
the device much more powerful control capability. 
Another alternative for control logic is the FPLS, 
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Prose as a bus arbiter 


rose can be handy in applications that require 

bus arbitration. Consider, for example, two 
microprocessors and refresh circuitry that require 
access to a block of dynamic RAMs (a). 


(a) 


PROSE ARBITRATION 


CC(0-2) 


/CCACK(O-2) 


The arbiter, which is a Prose device, receives re- 
quests for access via condition codes CC(2-0). The 
device may also receive a synchronous initializa- 
tion request, /IS, which is intended to reset the sys- 
tem to an initial state. From these input signals, 
the arbiter must generate signals to control the ac- 
cess to the DRAMs. Thus it must generate ac- 
knowledge signals for the requesting devices— 
condition code acknowledge signals—/CCACK(2- 
0)—and /RAS (row address strobe) and /CAS 
(column address strobe) for the memories them- 
selves and MCS for row/column multiplexer con- 
trol. With this set of input and output signals, the 


which has programmable AND and OR arrays as 
well as JK or clocked-RS flip-flops with internal 
registered feedback that allow sequencing and con- 
figurable I/O pins. FPLS devices have a fixed 
number of product terms that can be shared be- 
tween input logic, state transitions and output func- 
tions, dictated by the application needs. Because 
the Prose device has dedicated resources—the PAL 
and PROM arrays—these applications trade-offs 
aren’t required. While not characteristic of the ar- 
chitecture, some FPLS sequencers operate at dif- 
ferent speeds depending on the logic functions 
implemented; for example, as the complexity of the 
logic functions increases, speed decreases. With the 
Prose device, however, operation speed is a con- 
stant. For many applications, FPLS devices and 
Prose devices are complementary; a design engineer 
should be able to use both, choosing the ment de- 
vice for the application. 

Finally, some recently introduced instruction- 





Prose device can precisely control the access to 
memory. 

Once access is granted, a complete memory cy- 
cle requires five clock cycles. Three clocks are re- 
quired for access; two are required for DRAM 
precharging. /RAS is asserted from the second cy- 
cle to the end of the fourth, while /CAS is asserted 
from the third cycle to the end of the fourth. (Row 
and column addresses are asserted, as shown, 
prior to the assertion of /RAS and /CAS.) Both /RAS 


(b) 


TaD IS + /CCO*/CC1*/CC2 


a cc0*/CC1*/CC2*/1S 














CCO*/CC1*/CC2*/1S 
CC1*/CC2*/1S 
CC2*/IS 


based machines, similar to a PROM under control 
of a microsequencer, offer another option for con- 
trol logic. These machines are sophisticated devices 
with ample states for state machines, but they’re 
too complex for simple control applications. More- 
over, user adaptability and customization is lim- 
ited. Because they’re powerful and have sufficient 
outputs, these machines can easily implement 
counter-type state machines. But they offer a 
limited choice of combinations of input literals for 
condition testing. While such devices form a power- 
ful microprogram controller/sequencer for gener- 
ating addresses, they don’t provide a single-chip 
solution, and in many cases they’re far more power- 
ful than the application requires. 


Prose device overview 

The major building blocks in the Prose device are 
the PAL array, the PROM array, and the shadow 
and output registers. In brief, the PAL array gener- 
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and /CAS are removed during the fifth and the first 
cycle for DRAM precharging. 

The sequences of memory clock cycles with 
their associated control signals can be viewed as a 
set of state transitions (b). The condition codes, 
CC(2-0), are assigned as follows: CC2 is a refresh 
required; CC1 is a request from Processor 1; and 
CCO is a request from Processor 2. A refresh cycle 
differs from a regular memory cycle only in that 
ICAS isn’t asserted. 

As shown in the transition diagram, A1 cycles 
upon itself if IS is H (or active low signal /IS is L), 
or if there’s no request. If there’s a refresh request 
and IS is L, the transition A1 to B1 occurs, initiating 
the cycle of states B1 through B5. This acknow- 
ledges the refresh request and generates the tim- 
ing required to carry out the refresh operation. 
Note that any time /IS occurs, the next transition is 
to Al. The same is true for the loops C1 through C5 
(processor 1) and D1 through D5 (processor 2). 
Note also that at B5 if there’s no request (and IS is 
L), the next transition is to Al. But if IS is L and C2 
is still true (another refresh request), the next tran- 
sition will be to Bt. Furthermore, if CC+ is L (no 
refresh request), |S is L, but CC1 is H (processor 1 
request), the transition will be to C1 directly. 
Similarly, if 1S, CC2 and CC1 are all L, and CCO is H, 
the transition will be directly to state D1. The tran- 
sition diagram shows a similar relationship of tran- 
sitions in the loops C1 through C5 and D1 through 
D5. There are 16 states, including 1 three-way, 12 
two-way, and 3 four-way states, which the Prose 
device can easily accommodate. 

Prose can also handle more complex configura- 
tions. Suppose there are four devices competing 











for bus and memory access. A problem arises in 
simply expanding the diagram (c) to accommodate 
the extra device. Doing this directly would require 
that Al be a five-way branch. But the Prose is 
limited to four-way branching. 


(c) CC3 + CC2 


CCO + CCl 





PROCESSOR 1 PROCESSOR 2 REFRESH 1/0 PORT 


This difficulty is easily avoided, however, by in- 
serting another state, A2, between A1 and the tran- 
sition sequences for two of the requesting 
devices. Thus A2 serves as the embarkation point 
for these transitions. Here, the condition codes are 
CC3 (refresh), CC2 (processor 1), CC1 (processor 2) 
and the CCO (an I/O port). CC3 has the highest 
priority; CCO has the lowest. Note, however, that 
D1 through D5are the fiveclockcyclesfortherefresh 
circuit. A refresh request always gets the highest 
priority but, since speed of executionisn’tcrucial for 
the refresh request, it’s reached via A2. That is, the 
refresh requestis never ignored, but it’s adequate to 
service it through A2, allowing faster servicing (by one 
clock cycle) of processor 1 or processor 2 requests. 
A1goestoA2 forCC3*/IS + CCO*/CC2*/CC1*/IS; A2 
goes to D1 forCC3 */IS. This configuration requires 
no one-way, 16 two-way, 1 three-way, and 5 four-way 
states—easily handled by Prose. 


































ates the branch address bits for the memory (128 
words X21 bits), which stores the system’s next 
state addresses, device outputs and control func- 
tions. The shadow register is used for programming 
and diagnosis, and the output register provides syn- 
chronous operation and registered feedback. 

The PAL array is a 14H2-type: it has 14 comple- 
mentary inputs and two active high outputs. Each 
of the two outputs is a sum of eight product terms. 
Eight of the inputs, from an external source, are 
used for encoding of conditions for branch selec- 
tion. The other six inputs, from internal feedback 
from the PROM, are used for selection of the con- 
ditions that determine branching when decisions 
are involved in state transitions. 

The PROM consists of 128 21-bit words or loca- 
tions, each of which can be viewed as a State, or as 
containing state-related information. Eight bits at 
each location are intended as outputs. The other 13 
bits are intended as feedback: five are feedback to 
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the PROM array itself for partially determining the 
next location to be addressed, and six are condition- 
select bits to the PAL array. They’re used for prod- 
uct term selection/enabling. The remaining two 
bits are inputs to the exclusive OR gates. They’re 
used to determine address bits A6é and A5 when 
branching isn’t involved. 

When no branching is involved, CS(S-0) select no 
condition in the PAL array, so that no product 
term is wasted, and the PAL outputs are 
Xl=X2=0. Thus XFI and XFO are used to select 
the higher-order address bits—A6 and AS, in this 
case—and product terms are conserved for use 
when real conditional selection (branching) is in- 
volved. When conditional branching is involved, 
no more than four branches can be from one loca- 
tion, and all branch states will be in the same prima- 
ry location but in different quadrants. That is, 
address bit A(4-0) will be the same, and the condi- 
tions will toggle A6é and AS to determine the exact 
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quadrant location of each of the possible next 
states. 

For diagnostic purposes, a shadow register 
breaks the data path. Data can be shifted in or out 
serially. The shadow register and the output register 
are operated by different clocks, so that data may 
be shifted without affecting device operation and 
spoiling data. This lets data (in this case, test vec- 
tors) be shifted into the shadow register and clocked 
onto the output register. A normal operation can 
then be performed, after which the data can be 
loaded onto the shadow register and shifted out to 
be analyzed. In addition, a multiplexer to the shad- 
ow register lets the register contain data from the 
output register or from the previous shadow regis- 
ter contents. 

Internally, programming of the Prose device 
takes place via the shadow register, is completely 
transparent to the user and is done individually for 
each half of the device. For programming the PAL 
part of the device, a certain pattern of data is shift- 
ed into the shadow register with Vcc raised to the 
programming supervoltage and the preset/enable 
pin also raised to the supervoltage afterwards. Indi- 
vidual fuses in the PAL device are blown and se- 
lected by the appropriate pattern. 

Programming of the PROM is accomplished in 
similar fashion. Vcc is raised to the supervoltage 
value; SDI is raised to the supervoltage afterwards. 
The bit pattern in the shadow register determines 
which fuse is to be blown. 


Software support 

Prose state machines are fully supported by new 
programs in the Palasm suite—Proasm for assem- 
bly and Prosim for simulation. To program the 
Prose device, the user inputs a list of state names, 
transitions and conditions that cause transitions to 
subsequent states. When progamming a Moore ma- 
chine, the user also inputs a list of expected outputs 
for each state, and when programming a Mealy ma- 


OUTPUT REGISTERS 


In the Prose device, the PAL ar- 
ray processes inputs to produce 


PROM 
ARRAY 
128 x 21 


two decision variables that ac- 
cess the PROM array, which 
generates outputs from the ma- 
chine and the next data address. 
Support logic is provided for 
serial scan access and diagnosis. 


SHADOW & 


chine, the user inputs a list of outputs expected for 
each path to each state. (For a Moore state ma- 
chine, the outputs depend only on the state. For a 
Mealy machine, outputs depend on the inputs also.) 
User input to Palasm 2 (for state-machine entry) 
consists of four sections: declaration, state, equa- 
tions and simulation. The declaration section speci- 
fies the type of Prose device to be programmed. 
Currently, there’s just one device in the family. 
The state section specifies the state transitions 
and the output details. The user specifies the type of 
state machine—Moore or Mealy—and the various 
defaults that should be used when the next state or 
the outputs can’t be determined from the equa- 
tions. Furthermore, the user specifies state transi- 
tions and outputs via equations. The transition 
equations specify transitions from each current 
state. The output equations specify the outputs for 
each state, in the case of a Moore state machine, or 
the outputs expected upon the transition from one 
state to another, in the case of a Mealy state ma- 
chine. For the Moore machine, there’s only one set 
of outputs for each state, but for the Mealy ma- 
chine, the outputs depend on the path to the state. 
The last section, the simulation section, contains in- 
structions that tell the software how to simulate ac- 
tual operation of the state machine and, therefore, 
how to test the device once it’s programmed. GD 
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: . n the abstract, a computer, or at 
CSI nh | n a least parts of a computer, can be 
defined by a state-machine di- 

rected-transition graph (see Fig. 1). 

A state machine is a small, locally 


e 
State mach 1 ne controlled machine—not intended 
for a complete computation of a com- 


plex program, 


e Experts generally divide state ma- 
Wit a chines into two basic classifications: 
algorithmic and synchronous-se- 

quential machines. Algorithmic ma- 

chines include the subcategorie: 

program ma eC called Turing machines, Post ma- 
chines, and finite machines, which 

primarily describe the specific algo- 

Se wencer rithm used in programming the com- 

puter. A basic difference between a 

synchronous sequential state ma- 

chine and an algorithmic machine is 





Programmable-array-logic and the presence or absence of stop 

: ‘ states. A start state, of course, must 

PROM combo provides a convenient be present in all machines or else 
. é they will not be initiated. 

approach to state-machine design A stop state in an algorithmic ma- 

é chine (like a Turing) occurs when 

for local intelligent control it has finished executing an algo- 


rithm. In a synchronous sequential 
machine, however, to achieve a stop 
‘ state, the machine must branch to it- 
Frank Lee, Product Planning and Applications self—a type of wait state—until an- 
Monolithic Memories, Inc., Santa Clara, CA other operation is ready. The only 
time a synchronous sequential state 
machine may be actually halted is 
for an error or illegal command that 
forces it into looping on itself uncon- 
ditionally. But once in such a state, 
the state machine theoretically can’t 
exit from it. The machine must be 
reset and started over again. 


Implementing a state machine 
Although devices such as PAL, 
PLA, and FPLS arrays can be used 
to implement such a state machine, a 
programmable sequencer (PROSE) 
device, designated PMS14R21, is a 
convenient, flexible, and economical 
way. It consists of a combination 
of a special PAL and a registered 
PROM that are optimized for use 
in local, intelligent-control state- 


Reprinted from ELECTRONIC PRODUCTS Magazine—February 1, 1987., AR-160 
645 Stewart Ave., Garden City, NY 11530 » © 1987 Hearst Business Communications, Inc. 





o\ Monolithic ara Memories Pm | 2-607 


2-608 


Designing a State Machine with a Programmable Sequencer 


START 








RESET 





Fig. 1. Parts of a computer can be represented by a graph of a state machine 
showing its directed transitlons between states. An algorithmic machine (a) 
has stop state(s), whereas a synchronous machine (b) does not. 
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Fig. 2. The PMS14R21 programmable se- 
quencer has a special PAL array joined to 
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a registered PROM by two OR gates. Q(0.7) 


machine design (see Fig. 2). 

The introduction of the PROSE 
device is a significant advance in the 
design of fuse-programmable state 
machines of small to middling size. 
The PAL array serves to select tran- 
sition conditions, and the PROM 
stores the state assignments. Also, 
two exclusive-OR gates help reduce 
the total number of states and prod- 
uct terms a state machine would 
need, One terminal (I/E) can be 


programmed to receive an asynchro- 
nous initialization signal (one that 
resets all registers to the high state) 
or an output enable (via an architec- 
tural fuse); others, to enable fault 
diagnosis. A power-up reset feature 
is also provided. 

Each PROSE device has eight in- 
puts, eight outputs, up to 128 states, 
and up to a four-way branching ca- 
pability. Many local control func- 
tions can be done faster than on a 
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good many other state machines. 
Also, timing restraints on external 
devices are greatly relaxed. Perhaps 
most important of all, the PROSE 
device provides a small, single-pack- 
age solution to state-machine design 
(for example, 24-pin, 3-in. Skinny- 
DIPs and 28-pin plastic leaded chip 
carriers). 


Moore and Mealy models 

Two categories of state-machine 
models are generally implemented: 
the Moore and the Mealy state ma- 
chines. The Moore machine’s out- 
puts depend only on the state; they 
are independent of inputs. A Mealy 
machine’s outputs, however, depend 
on both the state and the machine’s 
inputs. The Mealy, therefore, is a 
superset of a Moore state machine. 

A true Mealy state machine is too 
versatile to be defined in a general 
architecture. A Moore machine is 
much easier to define and simpler 
to implement. The simpler the state 
machine is, the faster it will be. On 
the other hand, for most applica- 
tions, the outputs of a state machine 
depend somehow on the inputs as 
well as on the state. 

The PROSE device can support 
the Moore model. Fortunately, the 
Mealy model can also be supported, 
provided that the PROSE device’s 
output pattern can be pipelined by 
one clock cycle. Also, the number of 
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combinations of patterns in the sub- 
sequent states, and the state thus 
pipelined, must not exceed four. 


Controlling a state machine 

Naturally, a state machine must 
include a control system in addition 
to the operating portion outlined so 
far. The control system determines 
the next state on the basis of the cur- 
rent state plus a combination of in- 
puts to the control system. Of course, 
based on its state, the operating part 
of the system determines the outputs 
of the machine. 

The number of inputs, to a certain 
extent, determines the number of 
conditions (as inputs or combina- 
tions of input literals) from which 
the state machine can select. It can 
also determine whether encoding the 
conditions is necessary, which then 
becomes part of any external delay. 
A small, local, intelligent, arbitration 
unit may have as few as three to four 
states to as many as 30 to 40 states. 
A highly intelligent controller can 
have thousands of states. However, 
a state machine generally has a rela- 
tively small number of states to 
which branching occurs, because the 
latter states will eventually loop to 
each other (except maybe for the few 
starting states). 


Conditional branching needed 


Because a state machine switches 
its states on the basis of both the 
current condition and inputs, fre- 
quently conditional branching is 
necessary, sometimes to more than 
two next states. Four-way branching 
is common. If more than four-way 
branching is required, up to 16-way 
branching can be carried out by two 
cycle:: of four-way branches (see 
Fig. 3). 

But in applications that operate 
on single-cycle instructions, such a 
branch-fanout tree may not be de- 
sirable. Such single-cycle multiple 
branching can be done provided the 
machine allows losing a cycle for the 
fanout and has enough product terms 
to perform the branching on all the 
conditions. (A PAL or PLA device 
has a limited number of available 
product terms.) 

Similarly, the number of outputs 


Fig. 3. A state fanout tree of up toa 
16-way branch can be carried out in 
two cycles of four-way branches. 
This operation requires the state ma- 
chine to lose a cycle for the fanout 
and to have enough product terms 
available to perform the branching 
for all the conditions. 


determines whether encoding of op- 
erations control is necessary and 
how many operations are possible. 
Again, decoding of control signals 
externally will mean extra delay. In 
a single-device state machine, since 
a limited number of pins is available, 
the ratio of the number of outputs to 
the number of inputs is very critical. 
This ratio determines the applica- 
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tion range of the device. 

The PROSE state machine uses a 
PAL array, designated 14H2, instead 
of a simple condition multiplexer, to 
avoid limiting the choice of possible 
transition conditions to just one in- 
put at a time. Rather, input literals 
at any time can be combined so that 
transition-condition encoding is not 
needed. The eight inputs (I, to I,) 
to the PAL device are condition sig- 
nals. Together with the six addi- 
tional inputs (CS, to CS;), which 
carry condition-select feedback sig- 
nals, they determine the next state. 
These feedback signals serve only to 
select the conditions for branching. 


The six condition-select feedback 
signals to the PAL array simplify the 
selection of the transition conditions. 
Also, combinations of conditions to 
minimize product terms can be em- 
ployed. Since the output registers 
are all initialized to high states, and 
the next state after an initialization 
has to be known (state 127 for this 
device), all product terms in the 
PAL array must be disabled with 
this pattern (all high) to perform 
unconditional branching. Therefore, 
this pattern is reserved for disabling 
all product terms for unconditional 
branching. 

The 14 PAL inputs are comple- 
mented internally to help program 
two eight-product-term OR gates at 
each of two active-high outputs (Xq 
and X, ). The conditions controlling 
the next states must fit within these 
limits. The outputs of the PAL feed 
exclusive-OR (EXOR) gates, which 
help conserve the need for state and 
product terms. 

To perform an unconditional, or 
one-way branch, to save a product 
term, Xp and X, are at 00. For ex- 
ample, the new PROM address has 
A; and A, equal to 10 (binary). Ac- 
cordingly, the EXOR feedback sig- 
nals XF,) and XF, are programmed 
as 10, to force the EXOR outputs to 
10 unconditionally. 

For initialization, the CS) to CS; 
feedback pattern for an uncondi- 
tional branch is 111111. Thus, the 
operational feedback of CS, to CS; 
to the PAL must contain at least 
one 0—that is, patterns like 111111, 
11X1X1, or XX1XX*X should not 
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be used to select any product term. 


Organization of the PROM array 


The PROM array is a fuse-pro- 
grammable array of 128 locations by 
21 outputs. Eight primary outputs 
are directly available (Q, to Q;). 
The actual functions are defined by 
the user. In addition, 13 more out- 
puts serve as feedback signals to help 
determine the next state. All the out- 
puts are registered at the rising edge 
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Fig. 4. An absolute address map (a) 
and a primary location map (b) show 
how a location within each of four 
PROM quadrants has the same pri- 
mary location. If two states branch 
to the same state (c), that state 
would have to be duplicated if all the 
resulting next states exceed four. 


of the CLK input signal. 

The primary locations in the 
PROM are addressed by five inputs, 
Ag to Ay, which are fed back directly 
from the PROM registered outputs 
to the PROM array. These 5 bits de- 
termine the next primary location. 

Therefore, 32 primary locations 
define the states. The remaining two 
address inputs to the PROM (bits 
A; and A,), which the PAL array 
and the exclusive-OR gates generate, 
address four PROM quadrants, in 
each of which branching can be per- 
formed to any of its 32 primary lo- 
cations (see Fig. 4a). 
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Potentially, each of the 128 loca- 
tions thus addressed could define a 
different state. In actuality, the num- 
ber implemented is usually fewer. 
For example, some states may need 
to be duplicated in different primary 
locations. Consider a state A whose 
next states will be B, C, and D, and 
a state E whose next states will be 
D, F, and G. Although one next state, 
D, is common to the two conditional 
transfers, putting B, C, D, F, G with- 
in the same primary location (within 
the four quadrants) is impossible, 
since a primary location can accom- 
modate only four next states in all 
(see Fig. ¢b). Therefore, two pri- 
mary locations must be occupied 
with the state D and its clone. 

Another example requiring dupli- 
cation of states occurs even when a 
state A branches to states B and C, 
and state D branches to states C, E, 
and F—state C being common (see 
Fig. 4c). Although the total number 
of next states is four, if insufficient 
product terms are available, the 
states of A and D may still need to 
be duplicated to achieve that result. 

In the worst case, if every next 
state had to be duplicated because 
of incompatible combinations, or if 
all states had four-way branches, the 
PROSE state machine could still ac- 
commodate 32 states. Each state 
would have to be pointed to a differ- 
ent primary location, 


Registers for the PROM 


To conserve terminal pins, the 13 
bits of feedback signals from the 21- 
bit output register are buried within 
the PROSE chip (see Fig. 5). What 
is called a shadow register, together 
with a diagnostic-on-chip (DOC) 
scheme, enables testing various parts 
of the chip, including the buried-sig- 
nal conditions. 

The shadow and output registers 
(see Fig. 6a) are operated by differ- 
ent clocks—DCLK and CLK, re- 
spectively—in several modes with 
the MODE and SDI signals (see 
Table 1). The output registers can 
multiplex data from the shadow reg- 
ister (to receive a test vector) or 
from the PROM array (to receive a 
test result). Similarly, the shadow 
register can multiplex data from the 
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output register (for the test result) ; 
the previous shadow-register bits (to 
shift in a test vector and/or shift 
out a test result); and from the cur- 
rent shadow-register bits (as just a 
hold operation). 

The DOC-register concept also 
solves the programming issues for 
the PROSE device. A pattern in the 
shadow register identifies the fuse 
to be blown (see Fig. 6b). Bits PT) 
to PT, select one product term out 
of eight from each of the PAL’s sum 
of terms. Of these, bits CS, to CS, 
and I, to I, select one input to the 
PAL array. The bit labeled O/P se- 
lects a sum term, and the bit T/C 
selects either the true or complemen- 
tary value of the input selected. 

Similarly, the fuse to be blown in 
the PROM is identified by the pat- 
tern in the shadow register (see Fig. 
6c). Ag to Ag is the absolute address 
for the PROM array, and OA, to 
OA, selects 1 bit out of the 21 bits 
in a PROM word. 


Applications abound 

The PROSE device is designed to 
be used as a general-purpose state 
machine for controlling equipment 
or for data manipulation. The 128 
states and four-way branch capabili- 
ties make it possible to be used as a 
high-speed memory or bus arbitra- 
tor, a video-RAM arbitrator, an in- 
dustrial traffic controller, a signal- 
processor sequencer, a sequencer for 
mass storage or communications, a 
data encoder and decoder, and for 
many other applications. For exam- 
ple, the PROSE state machine can 
implement the run-length-limited 
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Fig. 5. The block diagram for a diagnostic register bit includes both a shadow 
register and an output register. Thus a test vector and the test result can be 
shifted in and out of the shadow register without corrupting the output flow. 


code, designated 2,7 RLL (see Table 
2). It is popular in mass-storage 
equipment and can increase the ca- 
pacity of such a device by approxi- 
mately 50%. 

In any mass-storage media, a min- 
imum separation of at Jeast one zero 


between adjacent logic ones is re- 
quired. In the 2,7 RLL coding/de- 
coding system, at least two zeros 
must exist between any two adjacent 
ones. (Thus encoded patterns like 
11 or 101 are illegal because there 
are fewer than two zeros between ad- 
jacent ones.) 

Since the distance between ones 





Table 1. Functions of the Diagnostic Registers 


L x 1c * | hold OREG.<€PROM 


H X hold OREG:<€SREG: 


L X SREGi<€SREG-1] hold 
SREG.<€SDI 


H SREG:<OREG; hold 
hold hold 


determines the flux transition fre- 
quency on the magnetic media, a 
minimum of two zeros between ad- 
jacent ones means that flux transi- 
tions are at least three bit intervals 
apart. Therefore, the encoded flux 
density becomes 3/X (a binary 100 
in the space X, the minimum on the 
medium between transitions). Other 
schemes like MFM, however, require 
a density of 2/X (a binary 10 in the 
minimum space X). Accordingly, 
since three encoded bits occupy the 
space on the media that carried 2 
bits before encoding, the encoded 


Shadow 
register 
(SREG) 


Output 
register 


(OREG) Operation 


normal 
load from SREG 
shift SREG 


load to SREG 
no operation 
L = zero 


H = one 
X = don't care 


C = clock pulse (low-to-high transition) 
* = stable clock or high-to-low transition 
i = bit number 
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density is 3/2, or 1.5-times the orig- 
inal density. (Additionally, to syn- 
chronize data, a maximum of seven 
zeros can be present between any two 


adjacent ones.) 


Starting from the INIT, or reset 
state, the encoder/decoder first car- 
ries out a three-way branch—to per- 
form encoding if the signal ENC is 
received, or to perform decoding if 
the signal DEC is received, or else 
to itself. Once the state machine 
starts, it continues on its own and 
will be stopped when the machine 
receives a reset signal at the initiali- 


zation input pin. 


Thus, in its initial state, the 
PROSE machine loops on itself until 
a control signal, CT, goes low. Then 
a signal CT, must be set to com- 
mand either an encode or decode 


operation. An instruction (/CT )/ 


CT,), therefore, forces a jump to 
Aoo, Which starts the encoding op- 
eration of the state machine (see 
Fig. 7a). Data inputs serially, before 
encoding, as M, and outputs encoded 


serially via Q,. 


The Q, port provides an output 
for decoding; it is not used during 


encoding. Therefore, in the states in 














Table 2 
An IBM 2,7 RLL Code Table 


Input pattern Output pattern 


000100 
0100 
100100 
00100100 
1000 
001000 
00001000 










which encoding is performed, the la- 
bels OX and 1X in Fig. 76 refer to the 
output bits of Q; and Qgz, respec- 
tively, where X (for the Q, output) 
is a don’t-care condition. 

With PROSE running at twice the 
uncoded data rate and employing a 
four-stage pipeline, if the input pat- 
tern for M is 


11100011000101 


the encoded data output at Q, will be 


1000100010001000000100100000010 
00100010001001000... 


Similarly, the instruction (/CT)/ 
CT,) forces a jump to Cog, which 
starts the decoding operation. Dur- 








ing decoding, Q, is not used, but 
now Q; is the decoded serial data 
output. Therefore, the labels for the 
output pattern of Q are either 0X 
or 1X, where again X stands for a 
don’t care. 

For example, if the input pattern 
before decoding is the serial data in- 
put N: 


1000100010001000000100100000010 
0010001000100..., 


the output pattern at Q, will be 
11111111000110001010.... 


with each bit appearing twice, since 
all legal data is clocked twice. 

In a practical design, if erratic pat- 
terns occur in the decoded data 
stream, the encoder/decoder should 
neither correct the data nor stop, but 
should make an intelligent guess at 
the error’s probable nature and con- 
tinue without holding up the data 
stream. However, in this example, 
illegal patterns have been made to 
enter an error state, because this sit- 
uation is much easier to follow for 
illustration purposes. 
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PROM ARRAY OUTPUT PATTERN PAL ARRAY XOR 
ADDRESS FEEDBACK FEEDBACK 
FEEDBACK 
BIT 0 1 10 14 16 18 
© fled? TELL llel=lede 
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CODE FEEDBACK 
BIT 12 14 
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PROM ARRAY UNUSED PROM DATA re ae 
ADDRESS BIT CODE SIGNIFICANT 
FEEDBACK PROM ADDRESS BITS 
Fig. 6. Bit assignments for the shadow and output registers during normal operation (a) are 
quite different from the shadow and the output registers during PAL array programming (b) 
énd from the shadow and the output registers during PROM array programming (c). 
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INIT (RESET STARTUP STATE} 


One three-way branch (INIT), 53 aicons #T4*/CTQ | ieT1°CT9 on 
two-way branches, and 15 uncondi- 
tional branches occur in the 2,7 
RLL. The three-way branch will oc- 
cupy three quadrants of one primary 
location. The 53 two-way branches 
can be merged into 40 two-way 
branches and placed into 20 primary 
locations merely by combining with 
two identical next states. The 15 un- 
conditional branches can be merged 
into 8, which will occupy two pri- 
mary locations. Thus, the total num- 
ber of occupied primary locations is 
1 + 20 + 2 = 23, which is well under 
the maximum of 32. This state-mini- 
mization approach is primitive and 
merges out only the states whose sets 
of next states are the same or subsets 
of other remaining states. 

The PMS14R21 PROSE state ma- 
chine will be available in April for 
$25.95 each in quantities of 100. 
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Ns bl Fig. 7. The state-machine diagrams of a 
(x) 2,7 RLL encoder (a) and its decoder (b) 
ERR (FOR LEGAL. © @F@ @ typical example of the possible ap- 
(b) PATTERN) plication of the PMS14R21 device. 
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; THE DESIGN FILE 

; The following specification was written in the PALASM® 2 for- 
; mat, and can be compiled using the PALASM 2 and PROASM™ 
; software packages developed by Monolithic Memories. 


; There are two corrections to the said article: 
;1) Table 2, the output pattern for input 010 is 100100. 
32) Table 2 (same table), the output pattern for input 0010 is 


00100100. 
; This specification was written by Frank Lee with help from 
; Jo-Ning Ta 
; Header 
Title 14R21.G17 
Pattern 27r11l.eg 
Revision 03 
Author Frank Lee 
Company MMI 
Date 3/6/87 
; Chip 


CHIP PROSE-27RLL PMS14R21 
; Pin Definition 


CLK DCLK CT0CT1 NM NCNCNCNCSDIGND 
/TSDO Q7 Q6 NC NC NC NC NC NC MODE VCC 


; String Definition 
; Used in section on output patterns 
STRING CQS ’/Q7*Q6’ 
STRING SQ6’/Q7* Q6’ 
STRING SQ7’ Q7*/Q6’ 
; State machine type: Moore machine, output pattern is dependent 
; on current state only. 
; Feature or architecture pin is used as reset (or initialization) pin. 
STATE MOORE_MACHINE MASTER__RESET 
; This design spec starts with description for state transitions, fol- 
; lowed by output patterns, and conditions for the branches. 


; State transitions 
; Power-up state 


; The PROSE device is initialized to all 1’s. On the first clock, it will 
; perform unconditional branch to state INIT 
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POWER UP: = VCC —> INIT 


; The PROSE will stay at INIT until there is a command for it to 
; start encoding (condition ENC) or decoding (DEC). A00 is the 
; starting state for encoding and C00 is the starting state for de- 
; coding. 

_ INIT := ENC —> A00 + DEC —> C00 + —> INIT 


; Encoder design 


; A00,A10,...,B00,... are states for the encoder, as defined in fig. 7. 
; EIN is defined in the condition section. It is the same as input M. 
; VCC is an always-true condition. 
; A00 will branch to A11 on EIN, else to A10 

A00 := EIN —> All +—> Al0 
; A10 : always branches to A20 

A10 := VCC —> A20 

All := VCC —> A21 

A20 := EIN —> A31 +—> A30 

A21 := EIN —> A383 +—> A382 

A30 := VCC —> A40 

A381 := VCC —> A4l1 

A382 := VCC —> A00 

A338 ‘= VCC —> B00 

A40 := EIN —> A51 +—> A50 

A41 := EIN —> A538 +—> A52 

A50 := VCC —> A00 

A51 := VCC —> A60 

A52 := VCC —> A00 

A53 := VCC —> B00 

A60 := EIN —> A71 +—> A70 

A770 := VCC —> A20 

AT71 := VCC —> A20 

B00 := EIN —> B11 +—> B10 

B10 := VCC —> B20 

Bll := VCC —> B21 

B20 := EIN —> A381 +—> A30 

B21 := BIN —S A33 +—S A32 





; Decoder design 


; C00,C10,...,D00,...,E00,... are states for the decoder, as defined in 
; fig. 7. 
; DIN is defined in the condition section. It is the same as input N. 
; C00 will branch to C11 on DIN, else to C10 

C00 := DIN —> Cl1l+—> C10 

C10 := DIN —> C21 +—> C20 
> ERR is the error state for an illegal encoded pattern received. 


C11 := DIN —> ERR + —> C22 
C20 := DIN —> C31 +—> C30 

C21 := DIN —> ERR +—> C32 
C22 := DIN —> ERR +—> C33 
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C30 := DIN —> C41 +—> C40 
C31 := DIN —> ERR +—> C42 
C32 :=— DIN —> ERR + —> D00 
C33 := DIN —> C43 + —> E00 
C40 := DIN —> C50 +—> ERR 
C41 := DIN —> ERR +—> 051 
C42 := DIN —> ERR + —> C52 
C43 := DIN —> ERR +—> C53 
C50 := DIN —> ERR + —> C60 
C51 := DIN —> ERR +—> C00 
C52 := DIN —> C61 +—> E00 
C53 := DIN —> ERR +—> D00 
C60 := DIN —> ERR +—> C70 
C61 := DIN —> ERR + —> C71 
C70 := DIN —> ERR +—> E00 
C71 := DIN —> ERR +—> D00 
DOO := DIN —> D11+—>D10 
D10 := DIN —> D21 +—> D20 
Dil := DIN —> ERR + —> D22 
D20 := DIN —> C31 +—> C30 
D21 := DIN —> ERR +—> C32 
D22 := DIN —> ERR +—> C33 
E00 := DIN —> Ell +—>E10 
E10 : = DIN —> E21 +—> E20 
E1l := DIN —> ERR +—> E22 
E20 := DIN —> E31 +—> E30 
E21 := DIN —> ERR + —> E82 
K22 := DIN —> ERR +—> E33 
E30 : = DIN —> E41 +—> E40 
E31 := DIN —> ERR -+-—> B42 
E32 := DIN —> ERR +—> E43 
E33 := DIN —> E45 +—> E44 
E40 := DIN —> C50 +—> ERR 
E41 := DIN —>ERR+—> O51 
H42 := DIN —> ERR +—> C52 
E43 := DIN —>D11+—>D10 
E44 := DIN —> Ell +—> E10 
E45 := DIN —> ERR +—> C53 





; Error stay until initialized 
ERR := VCC —> ERR sSTAY UNTIL RESET 


; <state>.OUTF refers to the output pattern for the <state>. 
; CQS, SQ6, and SQ7 are output patterns. CQS means output bits 
;6 and 7 are all 0’s. SQ6 means output bit 6 is 1 and output bit 
; 7 is 0. SQ7 means output bit 7 is 1 and output 6 is 0. In all 
; cases, output bits 0-5 are all don’t cares as far as this design 
; is concerned. 
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INIT.OUTF = CQS 
A00.0UTF :—=CQS 
A10.0UTF := CQS 
A11.0UTF :— CQS 
A20.0UTF : = $Q6 
A21.0UTF : = SQ6 
A30.0UTF := CQS 
A31,.0UTF :— CQS 
A32.0UTF : = CQS 
A33.0UTF := CQS 
A40.0UTF :— CQS 
A41.0UTF :— CQS 
A50.0UTF := CQS 
A51.0UTF := CQS 
A52.0UTF : = SQh 
A53.0UTF : = CQS 
A60.0UTF : = CQS 
A70.0UTF : = SQ6 
AT1OUTF := CQS 
B00.OUTF :— CQS 
B10.0UTF := SQ6 
B11.0UTF := SQ6 
B20.0UTF : = CQs 
B21.0UTF := CQS 
C00.OUTF := CQS 
C10.0UTF := CQS 
C11.0UTF := CQS 
C20.0UTF := CQS 
C21.0UTF := CQs 
C22.0UTF : = CQS 
C30.0UTF : = CQS 
C31.0UTF : = CQS 
C32.0UTF : = CQS 
C33.0UTF := CQS 
C40.0UTF := CQS 
C41.0UTF := CQS 
C42.0UTF := CQS 
C43 OUTF := CQS 
C50.0UTF : = CQS 
C51.0UTF := CQSs 
C52.0UTF := CQS 
C53.0UTF := CQS 
C60.0UTF := CQS 
C61.0UTF :—= CQS 
C70.0UTF := CQS 
C71.0UTF := CQS 
D00.OUTF: = CQS 
D10.0UTF: = SQ7 
D11.0UTF: = SQ7 
D20.0UTF: = SQ7 
D21.0UTF: = SQ7 
D22.0UTF: = SQ7 
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E00.0UTF: = CQS 
E10.0UTF : = SQ7 
E11.0UTF: — 8Q7 
E20.0UTF: = $Q7 
E21.0UTF: = SQ7 
E22.0UTF: — SQ7 
E30.0UTF: = SQ7 
E31.0UTF: = $Q7 
E32.0UTF: = SQ7 
E33.0UTF : = $Q7 


E40.0UTF : = SQ7 
E41.0UTF : = SQ7 
E42.0UTF : = SQ7 
£43.0UTF : = SQ7 
E44,.0UTF : = SQ7 
E45.0UTF : = $Q7 


ERR.OUTF := CQS 


CONDITIONS 


The conditions for state transitions 
ENC = /CT1’°/CTO — ; encode command 
DEC =/CT1* CTO  ;decode command 


we 


EKIN=M ; encoder input (original NRZ data) 
DIN=N ; decoder input (encoded data) 
; SIMULATION 


; Simulation vectors not shown here 
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Designer’s Guide to 
VME Bus Control—Part 1 








By using fuse-programmable controllers (FPCs) and 
PLDs, you can implement VME Bus control in your 
computer system with a minimum of hardware. Just 
a few chips per plug-in module, and a bus arbiter, 
can perform all bus-control functions. This article, 
Part 1 of a 2-part series, describes the bus-arbitration 
process and control functions and shows you how to 
implement the VME Bus protocol in an FPC. Part 2 
will describe the implementation of slave controllers, 
discuss interrupt handling, and provide tools for pro- 
gramming the FPC. 


Arthur Khu, Advanced Micro Devices 


Until recently, you needed many ICs—sequencers, mi- 
croprogrammed contre! stores, and other MSI/LSI 
chips—to implement V) —3us control in a computer 
system. Now, however, you can use a minimum of 
hardware to handle the bus’s intermodule communica- 
tion functions. Just a few fuse-programmable control- 
lers (FPCs) and programmable-logic devices (PLDs) 
relieve the CPU of all bus-control functions. 

A typical VME Bus-based computer system com- 
prises one or more master modules (eg, CPU boards), 
one or more slave modules (eg, cache/memory boards), 
a bus arbiter, and interrupt-handling circuitry. A mas- 
ter initiates a data transfer to a slave by requesting 
control of the data bus from the bus arbiter. Once bus 
control has been granted to a master, the master and 
slave exchange control signals according to predefined 


Reprinted from EDN, October 2, 1987 
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protocols that guarantee an orderly transfer of data 
between the communicating modules. The interrupt- 
handling circuitry services all interrupt requests. 

By using the streamlined design in Fig 1, you can 
implement most intermodule communication ina VME 
Bus-based system by using just two types of VLSI 
devices: the Am29PL141 fuse-programmable device 
and the AmPAL22V10 programmable-logic device. For 
the remainder of the bus-control functions, you’ll need 
some bus-arbitration circuitry, which must occupy a 
particular slot on the VME Bus backplane, but which 
can reside on the same board with the bus master of 
highest priority. 

When you design VME Bus control into your system, 


PROCESSOR 
MASTER 1 


PROCESSOR 
MASTER 2 


BUS 
ARBITER 


_ VME BUS 


Fig 1—You can implement VME Bus control in a computer system 
by using just a few fuse-programmable controllers (FPCs) and 
programmable-logic devices. 
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You can implement VME Bus control in 
your computer system by embedding the 
bus-control functions in state machines that 
comply with the VME Bus protocol. 


your first step is to consider the VME Bus protocols. 
These protocols specify the steps your circuitry must 
take to perform any bus-related operation, such as the 
transfer of data between two modules. You can describe 
these protocols by means of flow diagrams that show 
how the various interface signals reflect the interac- 
tions between the communicating modules. Fig 2, for 
example, shows how the priority bus arbiter resolves 
simultaneous bus requests from two modules that use 
the same request line. 

Next, you analyze the flow diagrams to pick out the 
functions that can be incorporated into FPCs or PLDs, 
and you define state machines for those functions. You 
can describe the state machines abstractly, by Boolean 
logic equations, or diagrammatically, by means of 
standard flow-chart symbols (eg, rectangles to repre- 
sent processes and diamonds to represent control-flow 
decisions). Finally, you must write programs for the 
state machines in a high-level or assembly language. 
You repeat this process to design each of the four main 
types of VME Bus controllers: bus arbiters, masters, 
slaves, and interrupt handlers. 

Before any master module can perform a data trans- 
fer, it must request control of the data bus from the bus 
arbiter; the arbiter must check to see whether the bus 
is free and must resolve any contention between two 
masters of equal priority. For example, consider a 
priority-option bus arbiter implemented on a single 
PLD. The arbiter will monitor the four bus-request 
lines (BRo3) and assign the highest priority to BR3. 

As the flow diagram (Fig 2) shows, the arbiter 
grants the bus to the requesting module that’s using 
the highest active request line, which is BR; in this 
example. To enable your bus arbiter to resolve simulta- 
neous bus requests from two or more modules that use 
the same request line, you must daisy-chain the associ- 
ated bus-grant signal to all the devices using that 
request line. Therefore, the arbiter must be in the first 
slot of the VME Bus system. The module that’s closest 
to the bus arbiter will have the highest priority: If it 
requests the bus, it will receive the bus grant and lock 
out any modules farther down the chain (Listing 1). 
The AND/OR array of the PLD processes all bus 
requests in parallel, so that priority arbitration is 
complete in only one clock cycle. 
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Priority arbitration options 


Sometimes, the arbiter must force the current bus 
master to relinquish control of the bus; this situation 
occurs, for example, when a bus master of higher 


LISTING 1 


iF (/BBSY*(BRO+BR1+BR2+BR3)) THEN “ifbus notbusyanda ” 
BEGIN ”" request line is active ” 
“if BR3 is active, grant” 
" bus to device on BRB” 
“activate bus grant i 
” daisy chain 2 " 
“if BR1 is active and BR3” 
” and BR2 are not, then ” 
“ grantbustodevice ”" ° 
" using requestling 1 * 
“BRO-3 and BGOIN to 
" BGSIN are active low 


IF (BR3) THEN 
BGSIN := 1; 

IF (/BR3°BR2) THEN 
BG2IN := 1; 

IF /BR3/8R2°BRt) THEN 
BGIIN := 1; 


IF (/BR3°/BR2°/BR1°BRO) THEN 
BGOIN := 1; 
END; 


IF (BBSY*BGIIN) THEN “if BGIIN is asserted in response to a request ” 


BGIIN := 1; "over line BRt, then continue asserting BG1IN” 
"until requesting device drives BBSY active ” 


LISTING 2 


IF (BBSY"(6RO+BA1+BR2+BA3) THEN 
3) )) THEN 


ee ((MASTER(1:0} = 
BCLR:= 1; "assert BCLR if MASTER <> 3° 
IF (/BR3°8R2* ((MASTERT!: O} = 3) ) ) THEN 
BCLA := 1; 
IF (MASTER{1:0} = 0) THEN 
BCLR := 1; 
END; 


priority initiates a bus request. The arbiter examines 
the priorities of both the current bus master and the 
new requester. If these conditions meet the predefined 
bus-clear conditions, the arbiter asserts the bus-clear 
signal (BCLR). 

In the design in Fig 1, the bus arbiter keeps track of 
the current bus master’s priority by recording the 
bus-request line that was used to gain control of the 
bus. For example, if the current bus master used BR; to 
gain control of the bus, it sets two output registers 
called Bus_Master to the binary value 2. Either of the 
two following conditions will activate BCLR: 

@ The value in Bus_Master is 2, 1, or 0 and the 

active bus request line is 8 or 2. 

@ The value in Bus_Master is 0, and any bus- 

request line is active. 

When the value in Bus_Master is 3, the arbiter will 
not honor any bus requests until the Bus Busy line 
(BBSY) is high. Listing 2 gives the logical expression of 
these conditions. Once having asserted BCLR, the 
arbiter holds this line in the active state until the 
current bus master releases BBSY. 

Be sure to define the BCLR in such a way that 
uninterruptible devices can use bus-request line 3, 
which has the highest priority. Devices that temporari- 
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LOCATED IN SLOT 2 LOCATED IN SLOT 1 
MASTER B REQUESTER 8 MASTER A REQUESTER A ARBITER 
DRIVE (DEVICE es = DRIVE (DEVICE 
WANTS BUS) HIGH ican WANTS BUS) HIGH 
) | ARBITRATION ! 
be DETECT (DEVICE IN PROGRESS | DETECT (DEVICE 
WANTS 8US) HIGH Ge cerned WANTS BUS) HIGH 


DRIVE BRT LOW DRive BRI LOW 


DETECT BA; LOW 
DRIVE BGTIN LOW 


DETECT SGiIN LOW 
ORIVE BBSY LOW 


RELEASE BAI DETECT BBSY Low 
DRIVE (DEVICE DRIVE BGTIN HIGH 


GRANTED BUS) HIGH j 


DETECT (DEVICE DETECT BGTIN HIGH 
GRANTED BUS) HIGH 





[MASTER A HAS CONTROL 
ei cal od ales ed corde PERFORM DATA TRANSFERS 


ORIVE (DEVICE 
WANTS BUS) LOW 


DETECT (DEVICE 
WANTS BUS) LOW 
RELEASE BBSY 


DRIVE (DEVICE DETECT BBSY HIGH 
GRANTED BUS) LOW 


ee sores ome 
[arertration em 
IN PROGRESS DRIVE BGTIN Low 
comme mens DETECT (DEVICE 
GRANTED BUS) LOW 


DETECT BGIIN LOW 
BGIIN BGioUT ORIVE BGTOUT LOW 


s DAISY CHAIN ‘ 


DETECT BGTIN LOW 
DRIVE BBSY LOW 


RELEASE BRI [waster B HAS dcreer BEV OW 
DRIVE (DEVICE GRANTED CONTROL OF DATA- DRIVE BGTIN HIGH 


BUS) HiGH [__ TRANSFER BUS 
BGTIN BGTOUT 
——————— DETECT BGTIN HIGH 


TIN 
DAISY CHAIN DRIVE BG1OUT HIGH 








OETECT (DEVICE GRANTED 
BUS) HIGH 


DETECT BGIIN HIGH 


PERFORM DATA TRANSFERS 
DETECT (DEVICE WANTS 


BUS) LOW 
SOE ELLGW. —————— DRIVE (DEVICE GRANTED 
RELEASE BBSY [ARBITER WAITS FOR 


NEW BUS REQUESTS j 


DETECT (DEVICE GRANTED 
BUS) LOW 


Fig 2—Flow diagrams help you define the VME Bus protocols. This flow diagram illustrates how the bus arbiter chooses between requesters 
that have the same priority level. 
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LISTING 3 


"specify the device to use” 
iNE 


DEVICE "MASTER" (Am29PL141) 
DEF 


“define the test conditions” 
"master request Input 

"bus grant in line 

"data transfer acknowledge line * 
"check address strobe bus line * 
"other test conditions 


bus_request = t0 
bus__grant = (i 
dtack = [2 
addr_strobe == t3 


"define the control outputs” 
off_state = 0000#h 
as = 8000#h 
busmaster = 4000#h 


“off state * 
"46th bit = 1: address strobe ” 
"15th bit = 1: inform master ” 
arb_feq = 2000#h = “14th bit = 1: drive BRniine * 
bbsy . = 1000#h "13th bit = 1: drive BBSY line “ 
"signals going to the bus ere gated thru inverting high-current 
bus driver; these include AS,ARB_.REQUEST, and BSSY . 


signal = 0001#h; "other control signals . 
“FPC assembler formatis = <label:output,statement; >, with label 
optional” 
BEGIN 
“wait for a bus request to be asserted high” 
@) start : off_state , while (not bus__request) walt 
else goto pi(get__bus) ; 
(©) bus_granted “assert BBSY and BUSMASTER signals” 
: bbsy+busmaster 
, if (addr__strobe = 1) then 
goto pl{bus__granted) ; 
“AS(L) must be HIGH before continuing; this means previous bus 
master is not driving the bus anymore 
“other statements” 


"BUS ACQUISITION microcode subroutine: the control output 
ARB__REQUEST Is asserted (BRAn line) until the /BGINn signal 
is received active LOW; if the bus request line from the master 
goas LOW before the bus is granted (e.g., master cancels bus 
request), then turn off the ARB_REQUEST output and return 
to the walt state START 

get__bus: arb_request , If (bus__grant « 
goto pi(bus__granted) ; 

. if (Ous__request) then 
goto pi(get_bus) ; 

, if (not bus__request) then 
goto pistart) ; 


0) then 
” arb__request 
off__state 


END. "end of source cade” 


CPU MODULE 
(MASTER) 


STATUS 


BUS 
GRANT 


FROM BUS 
ARBITER £ 
~~. 


74LS14 


Fig 3—You can implement the requester logic for a master module 
in an FPC. The FPC handles all bus-acquisition and data-transfer 
protocols. 


ly can be suspended to accommodate interrupts and 
higher-priority operations should be assigned to bus- 
request line 0. The BCLR conditions will vary with 
your application, but you can modify them just by 
redefining the high-level logic expressions. 
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Master modules 

The master modules of VME Bus systems initiate all 
data transfers over the bus. A system can have one 
master (the CPU unit) or several (multiple processors, 
DMA controllers), A master module must control the 
bus before it can perform any data transfers. Most of 
the bus-control logic resides in the requester section of 
the module; this section handles bus acquisition and 
communication with other master or slave devices in 
the system. 

You can microprogram all the requester functions 
into a single FPC, which serves as the interface be- 
tween the master device and the VME Bus (Fig 3). The 
FPC’s microprogram handles the bus-acquisition proto- 
col. Upon receiving the bus-grant signal from the 
arbiter, the microprogram informs the master device 
that it has control of the bus and may initiate a data 
transfer. After completing the transfer, the master 
device releases the bus-request signal, thus causing the 
FPC to free the bus by releasing the BBSY signal. The 
FPC may also relinquish control of the bus at the 
request of the bus arbiter. 

In designing your application, you need to extract the 
bus-acquisition phase of the requester from the flow 
diagram in Fig 2 and translate this information into a 
state diagram (Fig 4), from which you must write the 
corresponding FPC assembler source code. Listing 3 
gives an example of this code. 


FPC-controlled data transfers 


Once a master module becomes the bus master (that 
is, once it gains control of the bus), it can begin 
transferring data to a slave module. A data-transfer 
flow diagram (Fig 5) shows the necessary handshaking 
signals for transferring 32-bit data between master and 
slave. The state diagram for the master module’s FPC 
(Fig 4) combines the bus-acquisition (requester) and 
data-transfer-control functions of the master module. 
An FPC assembler (which the manufacturer of the FPC 
provides) simplifies the task of microprogramming this 
state machine into an FPC. 


Handling unanswered data-transfer requests 

To prevent bus lockups, which malfunctioning slave 
devices may cause, you should implement a bus-time- 
out (BTOy) option in the master module by writing a 
microcode routine that uses the 6-bit counter in the 
master’s FPC: (Listing 4 gives an example of such a 
routine.) The bus-time-out option permits a bus master 
to abort a data-transfer cycle if the slave does not 
respond within n microseconds. 

At the start of every data-transfer operation, the 
FPC microprogram loads a value into the 6-bit counter, 
tests for the data-transfer acknowledge signal 
(DTACK), decrements the counter and tests it for zero, 
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ISSUE BUS 
REQUEST 
BR 


NO 





ACTIVATE BBSY 
© INFORM MASTER 
LT MASTER BUS 
ions BLOCK BUS GRANT OUT 
DRIVE BR HIGH 




















LATCH ADDRESS 
ONTO BUS 

ORIVE CWORD LOW FOR 
32-BiT DATA TRANSFERS 


CHECK IF SEQUENTIAL 
TRANSFER CYCLE 














ORIVE BEST HIGH 
(BR MusT BE 
RELEASED 30 nSEC 
BEFORE BBSY 


RELEASE CWORD, 
TACK 


OEACTIVATE MASTER- 
GRANTED-BUS 
SIGNAL 









WAIT FOR MASTER 
TO REQUEST BUS 


GO TO START 









BUS 
ERROR 
ROUTINE 


Fig 4—You can derive detailed state-machine diagrams from your flow diagrams. In this diagram for a master requester, the 
bus-acquisition phase is surrounded by a dashed line. Note the close correspondence between the state diagram and the assembly-language 
program in Listing 3. 
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The bus-time-out option permits a bus 
master to abort a data-transfer cycle tf a 
Apres THE SEE slave does not respond within a specified 


PAESENT baci ies : 
PRESENT ADDRESS MODIFI 

DRIVE LWORD LOW time. 
DRIVE [ACK HIGH 

DRIVE AS LOW 


LISTING 4 

















SPECIFY DATA DIRECTION PROCESS ADDRESS 





DRIVE WRITE Low RECEIVE ADDRESS "Begin transfer : Bus time oul loop contains instructions 1 and 
RECEIVE ADDRESS MODIFIER 2. If count reg is loaded with 63 and FPC is running at 20 MHz, 
ReCene Pane raw then total time before bus times out is (63+1)"2°50 ns . 
bed escalated ADDRESS 1S VAL 1D ee re to piGerloe one) : i 
St U — , 
BEAR MGH mOcaTES m" FOR THIS SLAVE DSO + DS1 , while (creg < > 0) noe 
THAT PREVIOUS SLAVE IS TED Se EeaTE Device Pree loop to pi(wait__oop) ; 
NO LONGER DRIVING DATA BUS) NO FURTHER ACTION DSO + DS1, if (creg = 0) then 
pine gee goto pi(bus__time__ out) 
transfer_complete: “goto to this section if handshaking signal 
was received” 
STORE DATA bus__time__out: "execute this section when counter is 0” 


RECEIVE WRITE LOW 
RECEIVE DS; LOW 
RECEIVE DS LOW 
LATCH DATA FROM DATA 


BTO__signal,....... “activate bus time out signal” 








LINES 00.31 

SELECTED DEVICE and finally loops back to the instruction that tests 
DTACK. As soon as DTACK is detected, the program 
branches to the section of code that handles a normal 

data-transfer operation. 
If the counter value reaches zero, however, the 
RESPOND TO MASTER microprogram must branch to a section of code that can 
DAIVE OTACK LOW handle situations in which data-transfer requests are 
not completed; such situations are, of course, entirely 
eeeEaeiaa dependent on the user and the application. To calculate 
bdr Ah oes the time that will elapse before the program generates 
AOOnE = Moore n Uncs ; a bus-time-out signal, multiply the number of instruc- 
RELEASE [WORD as asked sate tions in the time-out loop by the system cycle time by 
DRE DS; GH [ ror saruicrry THe assumPrion | the initial value (plus 1) that the program has loaded 
ee into the count register. EDN 

Author’s biography 





Arthur Khu, a product planning engi- 


A dating socnowigbos tinmNiCA neer for Advanced Micro Devices 


Sr CVELE THEN RECEIVE AS, 0501 HIGH (Sunnyvale, CA), is responsible for re- 
RELEASE DSo. RELEASE DYACK search and definition of advanced pro- 
ELSE G0 TO ADDRESS ERLE grammable-logic-device architectures. 


He holds a BS in Math/Computer Sci- 

ence and an MS in Computer Science 

from Santa Clara University. In his 
Fig 5—This flow diagram shows the signals you’ll need to perform spare time, Art enjoys racquetball and 


a 32-bit data transfer once a master has acquired control of the astronomy. 
bus. 
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By using fuse-programmable controllers (F PCs) and 
PLDs, you can implement VME Bus control in your 
computer system with a minimum of hardware. Part 
1 (October 2, pg 187) of this 2-part series described 
the bus-arbitration process and control functions and 
showed how to implement the VME Bus data-transfer 
protocol in an FPC. This second and final part de- 
scribes the implementation of similar controllers for 
slave modules and interrupt handlers, and it pro- 
vides tools for programming the FPC. 


Arthur Khu, Advanced Micro Devices 


When you offload a system’s bus-control functions from 
the CPU toa hardware bus controller, you considerably 
reduce the time these functions require, thereby im- 
proving the data-transfer rate over the bus. You can 
substantially reduce the cost and chip count of such a 
bus controller by implementing the bus-control logie in 
VLSI devices stich as fuse-programmable controllers 
(FPCs) and programmable logic devices (PLDs). 

The VME Bus protocol requires that all data trans- 
fers over the bus be initiated by a master module. Your 
system can include several master modules, such as 
CPUs or DMA controllers; a bus-arbiter module arbi- 
trates simultaneous data-transfer requests from two or 
more masters. No data transfer can take place until the 
requesting master has been given control of the bus by 
the bus arbiter. 
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LATCH_DATA 
STORE_DATA 


MEM ony BCAHD Soe 


on 


EE 
REQU 


en a 


a. WME BUS. 7) : 
ADDRESS MODIFIERS AN $ 


Fig 1—This device controller, which is implemented with an FPC 
and a PLD, handles data-transfer and interrupt protocols for a slave 
device ina VME Bus system. 


Because a slave can’t initiate a data transfer, your 
system must include an interrupt mechanism that al- 
lows a slave to request service from a master. You car 
implement such a mechanism in your system by design- 
ing a slave subsystem like the one in Fig 1. This slave 
subsystem comprises a slave module and a slave inter- 
rupt controller. The slave interrupt controller, which 
consists of an Am29PL141 FPC and an AmPAL22V10 
PLD, serves as the interface between the slave subsys- 
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The interrupt controller for a slave device 
implements the interrupt protocol in 
hardware. 


tem and the VME Bus structure. Once the master has 
initialized the slave, it can issue commands to the slave. 
The slave performs these tasks in the background, 
leaving the master free to continue its own operations. 
When the slave is ready to return the status or result of 


a task to the master, it issues an interrupt request. . bois 


The VME Bus single-cycle data-transfer protocol 
defines the interactions between master and slave con- 
trollers (Fig 2). When the master drives the address 
strobe (AS) low, the slave controller latches the address 
presented on the bus, and a separate address-decoding 
unit on the slave board decodes the address. If the 
address is not valid (ie, if it’s not in the range associated 
with this slave), the slave takes no further action. 
However, if the address selects this slave subsystem, 


‘(apc 
ADORESS THE SLAVE 

. PRESENT ADDRESS 
ness MODIFIER 
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Fig 2—The slave controller recognizes its own address and receives 
a data word from the bus master in this flow diagram for single-cycle 
transfer. 
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Fig 3—This state-machine diagram for Fig 1’s slave controller is derived from the flow diagram in Fig 2. The state machine has four modes, 
which handle interrupts as well as single, block, and read-modify-write data transfers. 
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The VME Bus protocol requires that all 
data transfers over the bus be initiated by 
a master module. 


the slave looks for signals from the master that specify 
whether a read or a write operation should take place. 
After presenting or storing data, the slave controller 
drives the data-transfer-acknowledge signal (DTACK) 
low to inform the master of the successful transfer. 

From the VME Bus-protocol flow diagrams (Fig 2), 
you can derive a state diagram (Fig 3) for the slave- 
controller state machines; Fig 4 shows the resulting 
timing pattern for the slave controller. You can develop 
microcode for the slave controller’s single-cycle transfer 
mode from the state diagram and the timing pattern. 
Use the address-modifier lines (AMo;) to specify the 
other three slave operating modes (sequential, or block, 
transfers; read-modify-write transfers; and interrupt 
eycle). To recognize these special modes, the slave 
controllers on each slave board constantly monitor the 
six address-modifier lines. 

When the code presented on the address-modifier 
lines specifies a block-transfer operation, the master 
retains control of the bus throughout the operation by 


holding AS and BBSY low. For a block transfer, bus 
arbitration takes place only once, before the start of the 
operation; for single-cycle transfers, bus arbitration 
takes place before the transfer of each word. A block 
transfer, therefore, takes less time than does the corre- 
sponding number of single-word transfers. 

At the start of a block-transfer operation, all the 
slaves load the address presented on the bus into their 
address counters and decoders, but only the slave 
whose memory range encompasses the decoded address 
responds to the data-transfer request. As each word 
transfer is completed, all the slaves increment (or 
decrement) their address counters and decode the new 
address. This procedure is necessary because the mem- 
ory block being transferred may reside on more than 
one slave memory board. 

In the slave subsystem in Fig 1, the PLD decodes 
control signals from the bus and slave board and sends 
two signals, OPER» and OPER,, to the slave’s FPC. The 
FPC decodes these two signals, along with inputs from 
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VALIDATES ADDRESS 


READY WHEN AS ACTIVE . 
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single-cycle data transfer. 
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Fig 4—You’ll need to generate a timing diagram for each of the slave controller's operating modes. This diagram shows the timing for a 
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the slave, bus, and address-decoding units on the slave 
board, to determine which of the four possible slave 
operating modes to execute. The four modes, which are 
designated by binary codes, specify the following oper- 
ations: 

@ (00) Perform a single-cycle transfer 

e@ (01) Perform a sequential-cycle transfer 

@ (10) Perform a read-modify-write cycle 

@ (11) Perform an interrupt cycle. 

If OPER» and OPER, are both high, the FPC operates 
as an interrupter by branching to an interrupt subrou- 
tine (Fig 3). 

When the slave requests an interrupt, the FPC 
generates an interrupt-request signal and waits for the 
interrupt-acknowledge signal (I[ACK) and daisy-chain 
signal (IACKIN). When the data strobes become ac- 
tive, the PLD reads a 83-bit value from the address bus 
(A,.s); this value indicates which interrupt-request line 
was acknowledged. The PLD decodes these three bits 
to determine whether their value matches its own 





nee " SHP ISSUES 
 SETBUS "aus REQUEST 





request level; if it finds no match, no further action 
occurs. If it does find a match, however, the PLD routes 
a valid signal to the FPC, indicating that the interrupt 
handler has acknowledged the slave’s interrupt re- 
quest. The FPC then signals the slave board to put its 
status or identification byte on the data bus for the 
interrupt handler to use as an interrupt vector. The 
slave FPC waits in a loop until the interrupt handler 
drives the IACK signal high to signify that interrupt 
service is complete. 

Besides containing master, slave, and bus-arbiter 
modules, a VME Bus system usually has an interrupt- 
handler module that handles external I/O or special 
system events (time-out or overflow errors, for exam- 
ple). You can reduce the logic complexity of the inter- 
rupt-handler module in your system by offloading some 
of the initial interrupt-recognition tasks to an inter- 
rupt-handling preprocessor (IHP). You can use a PLD 
as the IHP, programming it to preprocess interrupt 
requests, obtain control of the bus, and handle hand- 
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Fig 5—The interrupt-handling preprocessor monitors the seven VME Bus interrupt-request lines (TR,;), identifies the interrupting device, 
and tells the interrupt handler when to begin servicing the interrupt. This timing diagram shows the signal states that exist before and during 
the transfer of the identification and status bytes from the slave-interrupt subsystem to the interrupt handler. 
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When the slave module 1s in block-transfer 
mode, bus arbitration takes place only once 
before the start of the operation. 





shaking signals (such as interrupt-acknowledge sig- 
nals). Only when the IHP latches the interrupt vector 
will control pass to the interrupt-handler module. 

To define interrupt-request processing, bus acquisi- 
tion, and the interrupt vector’s transfer phase, you'll 
have to use logic equations written in high-level Boole- 
an notation. In the design in Fig 1, the PLD monitors 
seven interrupt-request lines and four data-transfer 
control inputs, and it sends 10 control signals to the 
interrupt handler and the VME Bus drivers. The PLD 
monitors all interrupt-request lines according to the 
following logic equation: 


IF (IR]1 + IR2 + IR3 + IR4 + IR5 + (1) 
IR6 + IR7) THEN 
BR38:=1; 


“THIS INTERRUPT HANDLER USES” 
“THE BR3 REQUEST LINE” 


If any interrupt-request line is active, the PJ.D asserts 
BR38 to initiate the bus-acquisition phase. 


The next step is to wait for the bus-grant-in signal. 








Only when BGIN3@ is active will BBSY be active. The 
logical expression of this condition is given in the 
following equations: 


IF (BR3*/BG3IN) THEN (2) 


BR3:=1; 


IF (BR3*BG3IN + BBSY*/SERVICE_DONE) THEN (3) 


BBSY: = 1; 


Eq 2 continually asserts BR8 as long as BG3IN is not 
active; Eq 3 asserts BBSY only when request line BR3 
and BG3IN are active, or if service is not complete after 
the IHP asserts BBSY. 











When the IHP is. the bus master, it puts the 3-bit 





Development tools help you program FPCs 


The Am29PL141 is a single-chip fuse-programma- 
ble controller (FPC) that can implement state ma- 
chines and distributed control functions. You can 
express and functionally verify these state ma- 
chines and functions by using three FPC-develop- 
ment tools provided by the manufacturer: an.as- 
sembler, a test-vector generator, and a simulator. 
These development tools, which are written in C, 
run on an IBM PC or compatible computer under 
MS-DOS. Source code for the programs is avail- 
able, so you can port the software to other 
systems. 

The FPC has a control store that’s resident in 
its PROM; the store is 64 words long and 32 bits 
wide. Instructions such as jumps, loops, and sub- 
routine calls in the PROM control store are condi- 
tionally executed by the 20-MHz internal seq- 
uencer. Each 32-bit instruction is partitioned into 
the following format: 


BITS: 1 5 1.3 °6 16 
FIELD: OE OPCODE POL TEST DATA OUTPUT 


The output field contains the 16 control outputs 
that the FPC generates when it executes an in- 
struction. The upper eight output bits are con- 
trolled by the output enable (OE) bit; they may 





be high, low, or disabled. The lower eight output 
bits are always enabled. The op-code field speci- 
fies which of the 29 possible instructions the FPC 
will execute if the condition selected by the test 


DATABASE 
PL141 
ASSEMBLER 


PROM BIT 
PATTERN 


TEST-VECTOR 
GENERATOR 
JEDEC FUSE 
MAP 
PROM 
PROGRAMMER 


To program the Am29PLI41 FPC to implement VME Bus- 
control functions, you can use the assembler, test-vector genera- 
tor, and simulator provided by the FPC’s manufacturer. These 
development tools, which are written in C, run on an IBM PC or 
compatible computer under MS-DOS. 


SIMULATOR 
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interrupt-line-acknowledge value on the bus and as- 
serts the data strobes. Upon receipt of the DTACK 
signal from the interrupter, the IHP strobes the status 
byte from the bus into a register on the interrupt- 
handler card. The IHP then informs the interrupt 
handler that a status/identification byte is ready and 
that the handler should begin servicing the interrupt. 
The IHP resolves interrupt priorities within a single 
clock cycle because all interrupt/input signal lines are 
processed in parallel by the logic array in the PLD. 
Listing 1 shows how you'd express the procedure in a 
logic-description language. 

As you can see from Listing 1, if both the IR7 and the 
BBSY signals are active in section 1, then the 3-bit 
value generated by the IHP will be binary 111 (decimal 
7), regardless of the state of the other interrupt lines. 
In section 2, if [R1 and BBSY are active and the other 
six control signals are inactive, then the 3-bit output 
value will be binary 001 (decimal 1). In section 2, IR1 is 
the highest priority line that is active. 

The remaining interrupt-preprocessing steps com- 
plete the transfer of the interrupt-vector byte; this 
transfer is described logically in the following 








field is true. The polarity bit (POL) determines 
whether a high input or a low input represents 
true for the test input selected. You can use the 
data field as an argument for the op-code field. 
For example, if you want the FPC to jump (op 
code 25) to location 34 (22uex) when the second bit 
in the test field is high (true), you put the follow- 
ing 32-bit word into the control store (though 
you'll also have to define the OE bit and the 
output bits): 


OE 11001 0 010 100010 OUTPUT 


The assembler simplifies system design by al- 
lowing you to use a high-level language. Instead 
of specifying the bit patterns shown above, you 
can write the following section of code: 


OUTPUT , IF (T2 = 1) THEN 
GOTO PL (34); 


The assembler translates this statement into the 
appropriate bit patterns; when the FPC executes 
the instruction, it generates the bit pattern de- 
fined by the symbol OUTPUT. 

‘Every microinstruction written in assembly 
language follows this format: 





LISTING 1 
IF (eBsy) THEN — . a 4 


(1) F A?) THEN 
INTR{2:0} : = 7; 


IF (IA7*IR6) THEN 
INT 


' "NR was active 


interrupt 
“IR6 active, IA7 inactive 2 
“acknowledge interrupt line 6” 


Rl2:)} : = 
IF (/IA7* AR6*IRS) yiHEN "IRS highest peed 
INTR[2:0}: = 6; “Interrupt line active . " 


(2) IF lar pRGs SRA AA” RAMI) THEN 
on TR[2:0} : = eknowledge Interrupt tine. 1” 


IF (BBSY) THEN (4) 
BEGIN 
TACK := 1; 
(X) IF (DTACK) THEN 
LATCH_STATUS := 1;. 


“begin interrupt acknowledge daisy” 
“chain; if device sends data 
“transfer acknowledge (DTACK) 
“signal, then latch the status 


END: 
IF (LACK) THEN 


AS := 1; “assert the address strobe signal to inform the 


interrupter that the interrupt acknowledge 
level is ready” 


< LABEL: > OUTPUT , STATEMENT ; 


The label field, which is optional, simplifies the 
writing of subroutine calls and conditional branch 
instructions, which you can express in IF-THEN- 
ELSE, WHILE-DO, or COMPARE forms. Once 
you’ve written all the instructions with the editor 
and translated them to executable form with the 
assembler, you can cause the assembler to gener- 
ate a JEDEC fuse map. 

Before physically programming the PROM con- 
trol store that you’ve designed, you can test your 
design with the help of the test-vector generator 
and simulator. The test-vector generator produces 
test vectors, from a user-generated truth table, 
and converts them into a JEDEC-standard test- 
vector file that the simulator can use. 

The simulator. performs two important func- 
tions: It verifies the control flow of your design, 
and it checks the fuse map that the assembler 
generates. By using the simulator’s interactive 
mode, you can inspect, modify, or preset any of 
the internal registers in the FPC. The simulator 
also has single-stepping and break-point features 
so that you can trace the operation of your design 
in detail. 
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An interrupt-handling preprocessor can off- 
load some of the initial interrupt-recogni- 
tion tasks of your systems interrupt-han- 
adler module. 





Once the assertion of the DTACK signal indicates the 
successful transfer of the 8-bit status or identification 
byte, the IHP instructs the interrupt-handler module 
to begin the interrupt-service routine. This instruction 
from the IHP is logically defined as follows: 





IF (BBSY*LATCH_STATUS + BBSY* (5) 
START_SERVICE*/SERVICE_DONE) THEN 
START SERVICE : = 1; 


This definition states that the IHP constantly asserts 
the START_SERVICE signal until the interrupt han- 
dler generates a SERVICE_DONE signal, at which 
point the IHP drives START_SERVICE low. The logic 
described above generates the timing diagram shown in 
Fig 5. 








Development tools simplify controller design 


Development tools provided by FPC and PLD manu- 
facturers simplify the task of programming these de- 
vices as VME Bus controllers. Once you’ve analyzed the 
bus protocols and converted these into state-machine 
diagrams, you can write assembly-language programs 
and high-level logic equations to describe the state 
machines. The assembler and logic software will then 
process these programs and equations to fit into the 
FPC or PLD. For asummary of the programming tools 
available for the Am29PL141 FPC, see box, “Develop- 
ment tools help you program FPCs.” EDN 
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The first fuse-programmable controller 
eliminates bulky and expensive designs, freeing distributed 
intelligence to carve out a greater niche for itself. 


4 spawned the suburbs, standard microprocessor 
ad buses and add-on boards have distributed pro- 
cessing intelligence, revolutionizing the design of 
digital systems. Breaking systems into independent 
modules shortens the design cycle, eases upgrades, 
and accelerates fault diagnosis. But despite these ad- 
vantages, an essential element has been missing: a 
one-chip controller geared specifically to the needs of 
distributed intelligence. 

Without that critical ingredient, engineers have 
been forced to turn to less than optimal solutions. 
One approach relies on boards packed with as many 
as 35 SSI and MSI devices. Another tack is to go with 
a powerful —yet costly— VLSI chip. Alternatively, a 
programmable logic device can be pressed into ser- 
vice, but such circuitry lacks the computing power to 
control peripherals. 

The missing element, a fuse-programmable con- 
troller chip, is now here. By mixing intelligence and 
control, the Am29PL141 stakes out new territory for 
distributed systems. The 20-MHz IC combines for the 
first time all of the elements of an intelligent micro- 
code controller. Its powerful sequencing logic steps 
through the controller’s 64-by-32-bit pipelined 
PROM. That fuse-programmable memory stores a 
user-defined microprogram drawn from a set of 29 


A: much the same way that cars and highways 
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microinstructions, including a repertoire of jumps, 
multiple branches (or case statements), and sub- 
routine calls. All can be executed conditionally, de- 
pending upon the outcome of one of eight tests. In 
addition, a serial shadow register on the 28-pin chip 
helps designers diagnose system troubles right down 
to a particular IC. (In the past, expediency often dic- 
tated that complex trouble-shooting be avoided for 
as long as possible.) 


Four basic blocks 


The controller comprises four main functional 
blocks. Three of them—the microaddress control 
logic, condition code selector, and microinstruction 
decoder—form the cornerstone of the controller, the 
address sequencer. The fourth is a microprogram 
memory (64 by 32 bits) with a pipelined register and 
serial shadow register (Fig. 1). 

For the most part, the elements of the address 
sequencer are fairly typical. Nevertheless, the way in 
which they are organized and connected, as well as 
the instruction set, make the chip unique. For ex- 
ample, the microaddress control portion of the 
sequencer contains one register for counting loops 
and another for stacking subroutine return address- 
es. Yet either register can be employed to double the 
capacity of the other. Consequently, the chip can nest 
two levels of loop counting or two levels of subroutine 
branching (the instruction set reflects those abili- 
ties). And the high degree of interaction between ele- 
ments, particularly within the microaddress control 
logic, makes necessary a highly sophisticated micro- 
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instruction set. 

The microaddress control logic is the brain of the 
address sequencer, since it generates the addresses 
that access the microinstructions. At any time, the 
address that is called depends on the preceding in- 
struction and the outcome of any conditional tests. 

Within the control logic, a program-counter multi- 
plexer supplies the PROM’s 6-bit address. The multi- 
plexer takes the address from a microprogram 
counter, incremented program counter, branch con- 
trol logic, or subroutine register. Because the pro- 
gram counter contains the address of the currently 
executing instruction, that instruction is executed 
again when the program counter is selected as the 
address source. As a result, the counter plays a fun- 
damental role in tallying loops and executing “wait 
until true” instructions. 

The incrementer holds the next address in the 
sequence, and is the expected source when no jumps 
are executed and no branch or subroutine conditions 
exist. When conditional statements like “if... 
then...else” and multiple branches pass the re- 
quired tests, or when unconditional jumps are exe- 
cuted, the branch control logic supplies the address. 
Finally, when the program calls a subroutine, the 
subroutine register supplies the necessary address. 

A multiplexer selects one of three address sources. 
If only one stack level is needed, the value stored in 
the subroutine register is chosen. When the count 
register feeds the subroutine register, however, it 
furnishes an additional stack level. The third source, 
the incrementer, supplies the subroutine’s return 
addresses. 


Doing double duty 


If not needed for a second subroutine level, the 
count register can, among other functions, execute it- 
erative loops and time external events. To accom- 
plish the former, the controller loads the register 
with the number of iterations to be run. Each iter- 
ation decrements the register until it reaches zero. 
The zero-detection logic associated with the counter 


‘informs the chip’s microinstruction decoding logic 


when the register “bottoms out.” 

Using the same logic, an instruction can be re- 
peated a set number of times. Repeated executions of 
the same instruction is a simple way to insert wait 
states and, therefore, build an interface to different 
microprocessors and peripherals. 

The count register is loaded from any of four 
sources: a decrementer, for normal loops; an instruc- 


tion field; the subroutine register; and the branch- 
control logic. The last derives a 6-bit value from a 
data field in a microinstruction. 

The branch-control logic, a powerful block within 
the sequencer, calculates the 6-bit value either by ap- 
plying the microinstruction data field directly or by 
using it to mask the chip’s six test inputs, Ty through 
T;. In the second case, the masked input actually be- 
comes the branch address. Moreover, either the data 
field or masked test value serves as both a branch ad- 
dress and a count value. 

The same control logic also compares the masked 
test inputs to a constant in a microinstruction field. 
The outcome of this check affects a flip-flop. The lat- 
ter’s condition itself becomes a factor in deciding 
conditional branch and subroutine instructions. If a 
match occurs, the flip-flop is set. Alternatively, the 
flip-flop remains unchanged if there is no match. 
Because the flip-flop does not change when there is 
no match, it is particularly useful for comparing 
ASCII characters and other 6-bit fields, as well as for 
successively checking the chip’s test inputs. 


Controlling conditions 


A set flip-flop is one of the eight aforementioned 
tests that fulfills a conditional branch or subroutine. 
Through its condition-code selection logic, the con- 
troller is able to check each of its six test input lines, 
as well as the Condition Code input. Further, an 
exclusive-OR gate within the selection logic switches 
the meaning, or interpretation, of a test result. In 
other words, with no external hardware, a test condi- 
tion can be asserted either when a match occurs or 
when one does not. 

The final component of the chip’s sequencer sec- 
tion is the microinstruction decoder. That pro- 
grammable logic array generates the IC’s internal 
control signals based on the microinstruction being 
executed and the test results reported by the con- 
dition-code logic. 

The IC’s fourth functional block comprises the 
fuse-programmable microprogram memory, pipe- 
lined register, and serial shadow register. The pipe- 
line register is 32 bits wide, and stores the micro- 
instruction being run. The next address is calculated 
by the sequencer and its contents is fetched from the 
microprogram memory. The upper 16 bits of the 
Pipeline’s output remain within the chip to sequence 
addresses and control internal functions. 

Only the 16 low-order bits link to the outside, as 
user-defined control lines. Of these, the upper byte is 
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1. The 20-MHz Am29PL141 is the first complete microprogrammable controller chip, making it an important 
building block tor distributed processing systems. Its powerful sequencing logic steps the controller through 
its pipelined PROM. The fuse-programmable memory is 64 by 32 bits. 





an Monolithic Kart Memories al 2-635 














2-636 


Fuse-Programmable Chip Takes Command of Distributed Systems 


put in the high-impedance state by setting a micro- 
instruction’s Output Enable bit to 0. Moreover, chips 
can be cascaded readily if more than 16 control bits 
are needed (Fig. 2). 

The serial shadow register, also 32-bits wide, sim- 
plifies device- and system-level diagnostics. It can be 
loaded in parallel with the contents of the pipelined 
register or loaded serially from the Serial Data Input 
pin. On the other hand, the serial register can also 
load the pipeline or shift data out serially. It also may 
simply hold the data sent to it. 

To check out the chip, an instruction is shifted 
serially into the shadow register and then loaded in 
parallel into the pipeline. Doing so forces the instruc- 
tion to be executed, and its results transferred back 


Condition Code O 


Microprogram 
address 
sequencer 


Upper 
16 bits of 
microinstruction 


\ 
Microprogram 


memory 
(64 X 32 bits) 


shadow 
register 


#16 bits of 
output 


ca 


(To system) 


Am29PL141 
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from the pipeline into the shadow register. From 
there it is shifted out for diagnosis. If all the shadow 
registers in a system are tied together, a series diag- 
nostic loop is created that isolates a problem down to 
a single chip. 


The shadow fuse 


A separate fuse must be blown to set up the serial 
shadow register. When that is done, four pins are 
redefined to handle diagnostics. Specifically, the 
Condition Code and Zero lines and Output Data Bits 
6 and 7 become, respectively, the Serial Data In 
(SDI), Serial Data Out (SDO), Diagnostie Clock 
(DCLK), and Mode control lines. 

The strength of any controller—and the advantage 
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2. When an application calls for more than 16 control bits, two or more chips can be 
cascaded horizontally. The lower 16 bits of each of the chip’s 32-bit microinstructions 
(of which there are 29) serve to control a system's components. Eight of these 16 con- 
trol bits can be put into a high-impedance state under microinstruction control; the oth- 


er 8 bits are always enabled. 
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of a microprogrammed system that employs it—lies 
in an engineer’s ability to specify the sequence in 
which microinstructions are executed. To ensure 
that ability, the controller executes all the basic 
high-level constructs required for structured micro- 
programming. Its 29 op codes include sequential in- 
structions, conditional instructions, dual branching 
forks, and multibranching case statements. Iterative 
executions, like For, While, When, and Until, round 
out the set. In addition, Jump, Jump to Subroutine, 
Loop, and Compare instructions allow designers to 
store very complex algorithms in the chip’s 64-word 
memory. 

Instruction formats fall into two categories. The 
first is for general microinstructions; the second is 
for the chip’s Compare instructions. The latter com- 
pare a 6-bit test input to a masked constant. The 
Compare instructions are well-suited for character 
searches, as well as key searches in a look-up table. 

A single-precision, floating-point peripheral board 


(Fig. 3) presents a good example of the part that the 
controller plays in a distributed system. As a micro- 
programmed design, the peripheral serves as an add- 
on math accelerator card that plays with different 
hosts and buses. The controller orchestrates the ac- 
tions of the floating-point processor, and various reg- 
isters, register files, and memory chips. 


Simple arithmetic 


The processor is simple to use, partly because it in- 
curs no pipeline delays. It conforms to IEEE and oth- 
er industry standards, and takes only a single clock 
cycle to add, subtract, or multiply. It needs five cycles 
to divide, using the Newton-Raphson method that in- 
verts one of the factors and multiplies. In operation, 
to divide X by Y the chip fetches the approximate in- 
verse of Y from a PROM-based table and multiplies 
it by X. One or two iterations of this method increase 
the initial accuracy. 

The floating-point board works with a microword 
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3. In a typical application, the controller oversees the workings of a floating-point pro- 
cessor board. Host instructions sent to the FIFO and instruction registers initiate sub- 
routines in the controller that generate the signals that run the board. Two controllers 
are employed to supply the necessary number of control signals. 
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of at least 25 bits, 9 more than available with one con- 
troller. Thus the design employs two controller chips. 
The floating-point processor requires five command 
bits. Three are instructions and two select the input 
source. It also needs three control bits to enable its 
trio of data registers. Two other chips (each a dual- 
access four-port régister of 64 words by 18 bits) tem- 
porarily store commands. They accept 12 register ad- 
dress bits (6 for source and 6 for destination) from 
the host or the controller’s microprogram memory. 
Data passes to and from the host through input 
and output registers on the board, which call for 
their own enable signals. Another bit is needed to ad- 
vance the FIFO instruction register. One is necessary 
to enable and another to select a status word. (The 
floating-point chip supplies status information, 
which is available to the controller through its test 
inputs as well as to the host through the register file.) 
Seven control bits are left for miscellaneous tasks. 
Operation begins when at least one 16-bit instruc- 
tion is loaded from the host into the peripheral’s 
FIFO register. The instruction consists of a 4-bit op 
code, a 6-bit source-register address, and a 6-bit ad- 
dress for the second source register, which also stores 
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4. Loading an instruction into the FIFO starts the pe- 
ripheral and activates the controller, which loads an 
external instruction register and decodes the op 
code field. The op code initiates a subroutine in the 
controller, issuing the proper control! signals, ad- 
vancing the FIFO register, and loading the next 
instruction. 


the results. Until an instruction is received, the con- 
troller is in the wait state (Fig. 4). When an instruc- 
tion arrives, however, the FIFO’s Empty signal acti- 
vates the controller, which then reads the command 
from the FIFO into a separate instruction register. 
The controller also loads the instruction’s op code in- 
to its test inputs. It then masks the two unused test 
bits and jumps to a subroutine that performs the 
operation specified by the op code. After completing 
it, the controller advances the FIFO register to load 
the next instruction. 

The peripheral executes up to 16 op codes. The first 
eight are single-cycle operations and identical to 
those of the floating-point processor. They consist of 
addition, subtraction, multiplication, and format 
conversion instructions. The remaining op codes are 
used to load, store, and divide data, and a multiple cy- 
cle instruction multiplies and accumulates values. 
The four remaining op codes can be defined by the 
user to implement application-related operations. 

Software and hardware tools are a necessary part 
of such projects as the foregoing peripheral. The soft- 
ware assembles high-level microprograms and a 
JEDEC output file that specifies the fuse pattern to 
be burned into the PROM array. Currently, a pro- 
gram called Fuse Formatter, which runs on the IBM 
PC personal computer, lets designers enter hexadeci- 
mal code that corresponds to PROM data. From that 
code, the program creates a file that is downloaded 
directly to one of several PROM programmers. The 
latter blow the corresponding fuses in the micro- 
program memory and are the only required hard- 
ware tools.O 





Om Agrawal is the product planning manager for 
programmable logic devices at AMD. He has designed 
16- and 32-bit minicomputers, and is the coauthor of a 
book on high-speed memory systems. Agrawal holds a 
PhD in electrical engineering and computer science 
from Iowa State University. He also received an MBA 
from the University of Santa Clara. 

As a senior product marketing engineer for the 
company's microprocessor division, Deepak Mithani, 
designs and markets bipolar microprocessors. He 
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University and an MSEE from the University of 
Wisconsin. 
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PAL Device Buries Registers, 
Brings State Machines to Life 





Advanced Micro Devices Inc., 901 Thompson PI., P.O. Box 3453, Sunnyvale, CA 94088; (408) 732-2400. 


Designers who build state machines from pro- 
grammable logic devices (PLDs) must develop 
good juggling skills. The job usually takes a careful 
balancing of the I/O pins, the product terms, and 
the registers that store states and output bits. But 
because these resources all are limited in a PLD, 
performance and compactness often have to be sac- 
rificed in favor of the desired function. 

Ideally, state-machine designers need a PLD 
with buried, or internal, state registers, compact 
packaging and high 
speed. Buried registers 
free valuable I/O pins; a 
small package saves 
precious board space; 
and high operating 
speed speaks for itself. 

Approaching the 
ideal is the AmPAL- 
23S8, the first 20-pin 
programmable array 
logic (PAL) device de- 
signed specifically for building state machines and 
sequencers. Its architecture is more flexible than 
any 24- or 28-pin PAL device or programmable 
logic array (PLA). For example, it has 14 edge- 
triggered, D-type flip-flops (Fig. 1). That alone 
compares favorably with 24-pin bipolar PAL de- 
vices, which have up to 10 registers, and with 24- 
and 28-pin PLAs, which have 12 to 14 registers. 
Only 40-pin PAL devices with 16 registers surpass 
the new chip. 

As significant is the fact that of the chip’s regis- 
ters, six hold buried state bits. By definition, the 
buried registers do not merely free I/O pins: In- 
deed, they actually enhance the chip’s value as a 
state machine. Driven by the chip’s AND-OR 
array the buried state registers, along with the eight 
registers dedicated to I/O pins, make possible a 
wide range of states and outputs. 

Moreover, the buried state bits can be easily 
accessed, controlled, and observed. Although the 


Reprinted by permission of Electronic Design, July 1986. All rights reserved. 
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first two attributes are common among PAL de- 
vices, the third has always presented problems. The 
23S8 is the first chip whose product-term array 
makes the buried state registers observable. 

Of the other eight registers, four serve in output 
logic macrocells and four store output states. Each 
macrocell contains three main blocks (Fig. 2). One 
block is the register, a rising-edge-triggered flip- 
flop sharing asynchronous reset and synchronous 
preset inputs with all the other flip-flops. The other 
two blocks are multiplexers, one of them selecting 
from one of four output paths, the other from one of 
two feedback paths. 

The logic chip is small, fast, and easy to program. 
Packaged in a 0.3-in. DIP, it is half the width of 
28- and 40-pin PAL and PLA type devices. More- 
over, it comes in 33- and 28.5-MHz versions, with 
propagation delays of 20 and 25 ns, respectively. 
The faster version is an ideal companion for new 
8-MHz, 16-bit microprocessors, as well as for 
16-MHz, 32-bit units. Even at 28.5 MHz, the new 
chip is faster than existing bipolar PLA-based se- 
quencers, which operate from 20 to 25 MHz. It also 
beats equivalent-density PAL devices operating at 
or below 25 MHz. 

Programming takes place by blowing reliable 
and fast platinum silicide fuses, and using available 
software packages. Like all PAL devices, the 23S8’s 
fusible AND-OR array is a direct measure of its 
capability. With 6200 fuses, it is three to six times 
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the size of standard 20-pin bipolar PAL devices. In fact, it 
is larger than all other bipolar PLDs except for the 40-pin 
chips, which are three to four times as large and have 
some 8000 fuses. The new chip’s fuse array handles all 14 
registers, with 6 to 12 logic products for each output, and 
up to 23 inputs—-9 of them dedicated, 8 for feedback, and 
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4. The AMPAL23S8 Is the first 20-pin PAL device de- 
signed specifically for building state machines 
and sequencers. It contains 44 registers, 6 of 
them buried and requiring no I/O pins, 4 dedi- 
cated to output macrocells, and 4 storing output 
bits. The states of the burled registers are easily 
observed, which simplifies debugging. 


6 for the buried states. 

The number and distribution of the chip’s product 
terms are two other measures of its capability. It has alto- 
gether 135 product terms for logic and control functions. 
Moreover, the AND array that drives the 14 flip-flops is 
variably distributed (see the table ). 

The distribution of the 124 logic product terms helps 
adapt the chip to system requirements, while optimizing 
its internal resources. (Of the 11 other terms, 8 control 
output, the remaining 3 control preset, reset, and observ- 
ability, respectively.) For additional flexibility in de- 
signing powerful state machines, the chip can trade off the 
product terms of its buried registers. 


OBSERVABILITY CLOSE UP 


The chip’s observable product term is generated by the 
AND-OR array, along with an ability to preload the 
buried registers. Observability helps check a buried regis- 
ter’s state, so that a designer can monitor any register dur- 
ing debugging. 

Under control either of a single pin or of a combination 
of six input signals, the observable product term drives a 
set of inverting buffers. In normal operation, the buffers 
also serve the four dedicated output registers and two of 
the four macrocells. When activated, therefore, the ob- 
servable product term disables signal flow from those reg- 
isters and macrocells and simultaneously connects the six 
buried registers to their respective I/O pins. 

Flexibility in configuring a state machine derives from 
the independent control of output and feedback multi- 
plexers. Each output macrocell has three fuses, two to 
determine the output path and one the feedback path. The 
fuses can configure the macrocells in eight ways: a combi- 
natorial or a registered output with either an I/O pin ora 
registered feedback. Each of those four variations offers 
either active high or active low outputs. 

In the output path, fuse S, determines whether the out- 
put is active high or active low. The output-nature fuse, 
S2, selects sequential or combinatorial operation. When 
both of those fuses are intact, a flip-flop’s Q pin passes 
through an inverting buffer, making the output active low 
and registered. 

On the other hand, programming or blowing S, renders 
the output term active high; blowing S, causes the output 
to bypass the register, making it combinatorial. A combi- 
natorial output with feedback can, moreover, provide 
more than one level of ORing logic. Finally, fuse S, con- 
trols the feedback path, sending either the flip-flop’s Q 
output or the I/O pin back into the array. 

Controlling each output’s enable line through a prod- 
uct term from the AND array creates further options by 
transforming an I/O into a dedicated input, a dedicated 
output, or a dynamically controlled I/O pin. A designer 
can thus adjust the number of input and output pins, and 
have the dynamic I/O capability required by most buses. 
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Each output enable product term is further associated 
with an individual polarity fuse. This polarity fuse allows 
designers to modify a control signal to enable outputs us- 
ing De Morgan’s theorem, which is especially useful for 
bus control applications. 
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2. Each programmable macrocell has three main 
blocks: a rising-edge-triggered D-type flip-flop, 
an output multiplexer, and a feedback multi- 
plexer. The output multiplexer selects from one of 
four output sources; the feedback multiplexer 
feeds either an !/O pin or the flip-flop’s Q output 
into the programmable AND-OR array. 
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3. Two 2388s lie at the heart of an advanced dynamic RAM controller that 
compares well in its data-transfer features—such as dynamic bus sizing and 
handling misaligned data—with recent 32-bit microprocessors. One of the two 
main PAL devices controls the timing and arbitration of the memory cycles; 


the other sizes and aligns data transfers. 
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The chip’s high speed and buried state registers make it 
an ideal foundation for a flexible and fast dynamic RAM 
controller that meets the complex requirements of the lat- 
est microprocessors. Even at a 16-MHz processor clock 
rate it can become a controller with few or no wait states 
that might otherwise have to be built from scratch. 

There is, for example, no ready-made controller that 
can take advantage of dynamic bus sizing the auto- 
matic adjustment to the amount of data movable in one 
bus cycle. Nor can any off-the-shelf unit handle misal- 
igned data, that is, when an operand falls outside its 
proper memory boundaries. Bus sizing and data align- 
ment are features of the recently arrived Motorola 68020 
and the Intel 80386, both 32-bit microprocessors. But a 
controller able to perform the two functions can be built 
around two 23S8s, one for timing and arbitration logic 
and the other for datz sizing and alignment (Fig. 3). 

The timing and arbitration chip arbitrates among 
present, refresh, and CPU cycles; executes read or write, 
read-modify-write, and refresh cycles; and asserts inter- 
face signals. The chip first arbitrates between the proces- 
sor’s Memory Request (MREQ) and Refresh Request 
(RFCLK) signals. It gives priority to the cycle currently 
being executed, follows this with the refresh cycle, and 
concludes with the processor cycle. Both intermittent and 
burst-mode refresh schemes are therefore possible. 

The design’s timing relationships and arbitration re- 
quirements define a state-machine diagram (Fig. 4). The 
actual state machine 
employs five of the six 
available buried regis- 
ters. The sixth register 
is a flag and so acts as a 
small, independent state 
machine. Either of two 
software packages 
Cupl from Personal 
CAD Systems Inc.'s 
Assisted Technology 
Division (San Jose, 
Calif.) or Abel from 
Data I/O Corp. (Red- 
mond, Wash.) easily 
describes the state oper- 
ations. (Cupl is the lan- 
guage used in this par- 
ticular example. ) 

No arbitration takes 
place during the exe- 
cution of a cycle, which 
is indicated by a state 
other than 0. The tim- 
ing and arbitration chip 
stores any refresh re- 
quest and responds to it 
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later, continuing to follow the current cycle through its 
various States. Thus the current state automatically takes 
priority, and no other cycle is started until the current one 
is complete. 

At the end of the cycle, at state 0, the timing and arbi- 
tration chip arbitrates between MREQ (a CPU cycle) 
and RFCLK. In the small independent state machine, 
the buried register flag, Bs, latches RFCLK, which may 
last no longer than one clock cycle. When RFCLK is 
high, B, toggles to state 1 and stays there until the refresh 
cycle is executed. Then it toggles back to state 0 to be 
ready for the next RFCLK. 

During refresh cycles, the timing and arbitration chip 
asserts the Refresh signal (RFSH), which supplies the re- 
freshed row address to the memory address bus. The chip 
also generates the Row and Column Address Enable 
Lines, ROWADD and COLADD, which multiplex ad- 
dresses onto the memory bus. At state 16, the chip re- 
moves ROWADD and asserts RFSH; state 17 asserts the 
Row Address Strobe signal (RAS) and refreshes all 
memory banks before removing RAS and RFSH. Re- 
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4. A state diagram for the timing and arbitration chip reflects its oper- 
ation for selecting and orchestrating the memory cycles. The chip 
walts in state 0 to handle either a processor cycle or a refresh cycle; 
the latter takes priority over any but a cycle currently in progress. The 
chip performs refresh, read, write, and read-modify-write cycles. 


RAS, RFSH, ROWADD 


moving RFSH increments an external 10-bit refresh-row- 
address counter (AmPAL22V 10) to the next row. 

If no refresh request is pending during state 0 and 
MREQ is asserted, a processor cycle begins and the ma- 
chine jumps to state 1. Otherwise the machine stays in 
state 0 and polls for either a refresh or processor cycle. 

The PAL chip responds to the processor’s signal RMC 
to execute a read, a write, or a read-modify-write cycle, 
with the internal registers controlling timing for the dif- 
ferent cycles. If, for instance, the memory consists of 
4 Mwords, organized as four equal banks of 32-bit words, 
the chip asserts a RAS signal for each bank—RAS,g 
through RAS;. 

The strobe signal being asserted depends on address- 
select signals ADSEL» and ADSEL,, which in most sys- 
tems are the high bits of the processor's address bus. After 
asserting the proper strobe, the machine jumps to state 2 
to manipulate the address-handling signals. For instance, 
to multiplex addresses, ROWADD and COLADD are 
again asserted. 

In state 2, the timing and arbitration chip removes 
ROWADD and asserts COL- 
ADD, along with SIG, the refer- 
ence that tells the data-sizing and 
alignment chip that a processor cy- 
cle is in progress. SIG syn- 
chronizes the two 23S8s and in- 
structs the data-sizing and 
alignment PAL to send a column- 
address strobe (CAS) to memory. 

The state machine remains in 
state 2 for an extra-cycle, and then 
moves to state 4 to allow time to ac- 
cess the data. There it decides, 
based on processor signal RMC, 
whether to execute a read, a write, 
or a read-modify-write cycle. Be- 
fore completing a read or write ac- 
cess, the timing and arbitration 
chip removes RAS and waits for 
one more cycle to allow for the 
RAS precharge time. Simulta- 
neously, it switches back to ROW- 
ADD from COLADD in prepara- 
tion for the next processor cycle, 
and disables SIG. 

When the RMC signal requests 
a read-modify-write cycle, the 
timing and arbitration chip auto- 
matically lengthens the cycle. It 
holds RAS until DSACK and WE 
signals have been asserted, and 
only then completes the cycle. 

The data-sizing and alignment 
chip works only during a CPU cy- 
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cle. It creates WE for the write and read-modify-write cy- 
cles, and it accepts SIG, which synchronizes the chip’s 
different states (Fig. 5) with those of the timing and 
arbitration chip. 

Four of the data-sizing and alignment chip’s buried 
state registers generate its state-machine function. The 
machine starts polling in state 0 and stays there until it re- 
ceives SIG, the start of a processor cycle. 

The 68020 has a 32-bit maximum data width and a 
data bus divided intu four byte-wide segments, each inde- 
pendently controlled by a CAS signal from the data- 
sizing and alignment chip. The processor can therefore 
read and write byte-wide data over any segment. 

The four CAS signals, CAS, to CAS;, strobe the col- 
umn address from the high- to low-order memory bytes. 
The signals also allow the microprocessor to read and 
write across the low-low, low-middle, high-middle, and 
high-high bytes of a 32-bit data port. The data-sizing and 
alignment chip must assert all four CAS signals for a 
32-bit port; only CAS, and CAS; are needed to supply a 
16-bit port’s low- and high-byte strobes; while for an 8-bit 
port, CAS; is enough. 
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§. The data-sizing and alignment chip works only 
during a processor cycle and waits in state 0 until 
one is requested. If the processor calls for a read 
ora write cycle, the chip asserts DSACK, CAS, 
and, for a write cycle, WE. For a read-modify-write 
cycle, it asserts two DSACK signals, one each for 
the read and write segments. 


The data-sizing and alignment chip asserts the CAS 
signals in state 1, and selects them according to the size of 
the port in use, the width of the data being moved, and 
how the data aligns with the port. Coding on input pins 
PRT, and PRT, shows the port size; signals SIZ) and 
SIZ, from the microprocessor show data width; and the 
microprocessor’s least-significant address bits, Ay and 
A,, give the alignment. 


THE PROCESSORS PREFERENCE 


The size of the port dictates the amount of data that can 
be transferred in any one cycle. Based on the PRT signals 
the state machine asserts DSACK signals, which ac- 
knowledge the CPU and tell it how much data it can 
move, since it will always try to move as much as possible. 
The state machine synchronizes the acknowledgment 
timing, asserts the required CAS signals, and holds them 
until the read, write, or read-modify-write cycle is com- 
plete. 

A read cycle’s DSACK signal is asserted in state | and 
remains until the end of the cycle at state 4. A write cycle 
is identical, except for the additional WE signal. But 4 
read-modify-write cycle requires two DSACK signals, 
one for reading, the other for writing. The first DSACK is 
the same as that for a read or a write cycle; the second 
DSACK occurs in state 8 and remains until state 11, with 
a delayed WE asserted in state 8 by an internal 4-bit 
counter.O 
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hen it comes to generating 
W complex, high-resolution dig- 

ital waveforms, Advanced 
Micro Devices’ Am2971 picks up 
where older parts leave off. Precise 
time delays once required the use of 
either a hybrid structure consisting 
of an analog delay line combined 
with digital logic or costly counters 
driven by high-frequency clocks. On 
the one hand, analog delay lines have 
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short enough tap-to-tap time delays, 
or resolutions, to be usable for han- 
dling the tighter timing relation- 
ships of dynamic and static RAMs. 
However, those lines cannot also ac- 
commodate the longer total delays 
needed by these devices. 

On the other hand, the desired 
waveforms can be obtained digitally 
from the outputs of one or more 
binary counters. But to achieve reso- 
lutions of 10 ns, counters must be 
driven with 100-MHz clock frequen- 
cies, which are difficult to distribute 
around a board. 

The programmable event gencra- 
tor (PEG) a monolithic IC, offers 
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a more elegant solution for the prob- 
lem of generating complex high- 
speed timing waveforms (see Fig. 
1). Applications for the PEG (see 
box, “A new PEG in the designer’s 
tool box”) range from simply cor- 
recting the clock skew that results 
from distributing a clock signal on 
a backplane to generating complex 
state-machine timing. The PEG’s 12 
output lines can be programmed by 






EVENT | EVENT 
0 ! 





Fig. 1. In response to a logic tran- 
sition on its trigger (TRIG) input, the 
PEG will generate 12 simultaneous 
user-programmed timing waveforms 
on its Ty to Ty, output lines. 


the user to assume either a logic- 
high or a logic-low level within each 
of 32 time slots or events. The fre- 
quency of these events—that is, the 
frequency of the output waveforms 
—is also programmable. But it is the 
precision and resolution of these 
waveforms that is the PEG’s claim 
to fame (see Fig. 2). 


Internal operation 

The PEG is an edge-triggered 
logic device that generates a pre- 
programmed digital waveform in 
response to a triggering signal. It 
consists of four basic blocks: a next- 
address and event store, a start-ad- 
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dress store, an oscillator and clock- 
control block, and control logic (see 
Fig. 3). A total of 623 user-program- 
mable platinum-silicide fuses, simi- 
lar to those used in AMD’s bipolar 
PROMs and programmable array 
logic devices, are located throughout 
these blocks. 

When the PEG’s fuses are being 
programmed, the 12 timing-tap out- 
puts (T, to T,,) are used as fuse- 
address inputs, A thorough descrip- 
tion of the procedure for program- 
ming the PEG can be found on its 
data sheet, so for the purpose of 
understanding what goes on inside 
the chip, let’s assume that it has al- 
ready been programmed. 

An output-timing sequence is ini- 
tiated by a logic transition at the 
TRIG input. Two user-program- 
mable fuses are located in the trig- 
ger-polarity block. One of them is a 
polarity-select fuse, which when un- 


Fig. 2. Tap-to-tap resolution of an 
analog delay line (a) is not indepen- 
dent of its total delay time. However, 
the PEG’s 12 programmable output 
waveforms (b) are independent of 
one another. Resolutions down to 
10 ns can be easily obtained. 
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programmed will cause the timing 
sequence to start during a negative 
transition of the TRIG input. If it 
is programmed, the sequence starts 
when a positive transition occurs. 

The second fuse in this block is 
used to define the end of a timing 
sequence. If this fuse is left unpro- 
grammed, the timing sequence is 
stopped on the trailing edge of TRIG 
pulse. Otherwise if the second fuse 
is programmed, the end of the tim- 
ing sequence is defined by the stop 
bits as programmed by the user into 
the next-address/event generator 
functional block. 


Customized waveforms 

A transition at the TRIG input 
latches three address bits, Ag to Ao, 
which are decoded by the start-ad- 
dress store, which serves as an 8 x 
5-bit mapping PROM. Each of its 
eight addresses represents a differ- 
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ent starting point for the 12 output 
waveforms. For example, three dif- 
ferent addresses in this store could 
be used to initialize output wave- 
forms that implement the read-, 
write-, and page-mode timing se- 
quences of a dynamic RAM. 

The five bits of each address are 
used to select one of thirty-two 18- 
bit locations from the next-address 
and event store (also a PROM). 
Each 18-bit string contains three 
groups of information. The first 5 
bits define the next address of the 
desired timing sequence. The next 
12 bits define the logic levels on each 
of the timing-output lines (Ty to 
T,,). The 18th bit is a sequence-stop 
bit. As the timing sequence pro- 
gresses from the first to the thirty- 
second event-store address, the out- 
put lines will produce 12 independent 
waveforms. 

The assumption so far is that the 
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stop-function fuse has not been 
programmed. If it has, the timing 
sequence will no longer stop on the 
trailing edge of the TRIG pulse. The 
end of a timing sequence will instead 
be defined by the stop bit. When a 
timing sequence encounters a stop 
bit, the entire sequence comes to a 
halt, and the 12 timing-output lines 
remain at their current logic level. 

Each of the timing-waveform out- 
puts has a minimum useful cycle 
time of 40 ns (or 20 ns per change). 
When the PEG is operating at its 
maximum internal-clock frequency 
(f, = 100 MHz), the outputs must 
be programmed to remain un- 
changed for at least two clock pe- 
riods for each change of logic level. 
That is, although an output can only 
change a minimum of every 20 ns, 
the timing resolution between events 
among taps may be as low as 10 ns. 
When the PEG’s internal clock is 
programmed to operate at 50 MHzor 
slower, the timing waveform can be 
programmed to change once each 
clock period. And its clock-frequency 
range is low enough—from 7 to 20 
MHz—for the user to drive the 
PEG’s CLK input from the system 
clock via an internal clock input pin. 

Alternatively, the designer may 
opt for crystal control by connecting 
a 7 to 20-MHz crystal to the X, and 
X, inputs of the PEG. And thanks 
to the five fuses within the oscillator 
and clock-control block, the user can 
also set the frequency of the output 
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Fig. 3. The PEG consists of four basic blocks: a next-address and event store, 
a Start-address store, an oscillator and clock-contro!l block, and control logic. 


waveforms. Four of these fuses are 
used to select f,. 

When an external crystal is used, 
the PEG’s internal phase-locked loop 
(PLL) will treat the crystal’s fre- 
quency as an input-reference fre- 
quency, f,, and will multiply it by a 
programmable value (1, 5, 10, 4, %4, 
or 1%) to obtain the desired value of 
f, (see Fig. 4). Also an output-clock 
frequency, f,, is available on the 
CLKOUT pin. This output clock-—— 
whose frequency is either % or Yo 
of the internal-clock frequency— 
may be used to synchronize the out- 


A new PEG in the 
designer’s too] box 
Designers no longer have to grap- 
ple with low-resolution analog de- 
lay lines or costly timing logic 
when high-speed waveforms are 
needed. The Am2971 program- 
mable event generator, trade- 
marked PEG, from Advanced Mi- 
cro Devices has 12 output lines 
that can produce and accurately 
place logic transitions as short as 
20 ns, with a minimum waveform- 
to-waveform resolution of 10 ns. 


puts of the PEG with the remainder 
of the system. And speaking of sys- 
tems, the PEG comes in very handy 
for generating the specialty cycles 
offered by today’s DRAMs. 


DRAN-timing application 

Typically, the system designer 
must use more than one analog de- 
lay line and some external logic to 
produce the multiple CAS pulses 
needed for the page-mode cycles of 
some DRAMs. Timing designs using 
high-speed clocks and counters are 
equally cumbersome in this case. 
Once again, the PEG chip offers a 
streamlined solution because it_can 
readily generate the multiple CAS 
pulses. 

Another job for the PEG comes 
about as a result of the use of multi- 
plexed address lines on high-density 
DRAM boards. Because the many 
address lines presenta relatively 
large capacitive load to the multi- 
plexer, significant time delays are 
introduced in the address-timing 
waveforms. 

Typically, a signal provided by the 
system switches one or more ad- 
dress-multiplexer ICs. This signal 
must be accurately timed to change 
state after time interval 1 in Fig. 5. 
The performance of the individual 
DRAM—and ultimately of the entire 
memory system in which it resides— 
depends on reducing interval 2 as 
much as possible. Interval 1 is 15 to 
20 ns in most standard DRAM spe- 
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X13 A7 TO 20-Mbz AT-CUT 
PARALLEL-RESONANT CRYSTAL 


(a) 


CLKOUT ‘fo! 


Fig. 4. The PEG can be clocked by 
an external crystal (a) or by the sys- 
tem clock. On-chip programmable 
clock-control logic (b) enables the 
user to set the internal-clock fre- 
quency, fe, 


cifications. Thus, a resolution of 10 
to 15 ns is desirable for the DRAM- 
timing waveforms—an easy job for 
the PEG. 

What’s more, the PEG can be used 
to generate the necessary timing 
waveforms for multi-array mem- 
ories too, In video-DRAM systems, 
for example, the rising edge of the 
transfer-enable signal, TRG, must 
be accurately placed relative to the 
rising edge of the serial-port clock 
signal, SC. When the PEG is used to 
generate both waveforms, the de- 
signer has full control over the reso- 


CLK'Xy ‘f}) 


—— 


lution between the rising edges of 
the TRG and SC signals. 


More events and/or channels 

In some applications, 12 output 
channels are sufficient but more than 
32 event states are necessary. These 
additional states can be obtained by 
connecting two PEGs as shown in 
Fig. 6. After PEG 1 has cycled 
through its 32 states, its Ty line 
triggers PEG 2. The second PEG 
will then begin its timing sequence 
at the start address specified by PEG 
1. If necessary, the clock frequency 
of PEG 2 may be changed by using 
the clock output of PEG 1 as the 
clock input to PEG 2. Otherwise, the 
internal clocks and TRIG polarities 


[coin 000 XAXXXXAXAXKMKKK KX 





COO ee D 


Fig. 5. In this ORAM delayed-write cycle, interval 1 must be kept to within 10 
or 15 ns, and interval 2 must be as short as possible. When mounted on the 
same DRAM memory board, the PEG could be used to generate the RAS, 


CAS, and WE signals. 
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Fig. 6. By connecting two PEGs, de- 
signers can obtain timing waveforms 
that consist of more than 32 events. 


of the two PEG chips are indepen- 
dently programmable. 

If the waveforms must be nested, 
one of the output taps of PEG 2 can 
be used to remove the TRIG signal 
from PEG 1. PEGs can also be par- 
alleled for applications in which 32 
event states are sufficient but more 
than 12 output channels are needed. 
For example, when 12 to 24 channels 
are needed, the designer can simply 
drive two PEG chips with the same 
input signals and use as many of the 
output lines as necessary. oO 
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Wait-State Remover Improves 
System Performance 





he trends toward faster micropro- 

cessors and denser dynamic ran- 

dom-access memories have 
sparked a conflict. New-generation mi- 
croprocessors require machine cycle 
times of 150 to 250 ns (Table 1). But 
traditionally as DRAM density increases, 
cycle times slow. Needed is a way to 
combine high performance with zero- 
wait-state operation. 

Although hardware-intensive cache 
schemes can work, they are very costly. 
It is possible, however, to improve sys- 
tem performance with little additional 
hardware by substituting enhanced- 
page- and static-column-mode cycles for 
regular memory cycles, thus cutting 
down the number of wait states. The 
resulting system would be especially 
useful for small systems and subsystems 
with space restrictions. 

Our design incorporates a DRAM such 
as the Am90C256, a CMOS device that 
offers uninterrupted serial or random ac- 
cess to 512 bits of data, or the 
Am90C257, which combines dynamic 
storage and static decode to nearly dou- 
ble sequential access rates. These de- 
vices provide extended-page and static- 
column access modes at a small increase 
in cost. These modes permit data access 
times of 40 to 60 ns, enabling the mem- 
ory controller to remove wait states for 
several processor cycles. 

The only additional hardware is an 
AmPAL22V 10, a 24-pin programmable 
logic device that offers an AND/OR logic 
structure for custom programming, 22 
inputs, and 10 outputs with 12 product 
terms each to provide timing and arbitra- 
tion for extended-page-mode and refresh 
cycles. Also needed are two devices to 
latch and compare row addresses and a 
counter to track the extended-page- 
mode cycles’ time out. 

Usually, microprocessors perform a 


Reprinted with permission of VLSI, 
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FIGURE 1. Wait state remover arbitration scheme. 


number of continuous accesses in a lo- 
calized address range to fill up the in- 
struction pipeline or to store or retrieve 
data. Our scheme is based upon this 
locality of memory access, somewhat 
similar to cache schemes. It involves 
performing an extended-page-mode (or 
a static-column/ripple-mode) access cy- 
cle, instead of a regular memory cycle. 
The time required for the data to be read 
or written for this cycle is the same as for 
a regular memory cycle. The cycle’s 
row address is stored and then compared 
to that of the subsequent cycle. This row 
address would be the same for a local- 


ized access cycle within the same row of 
up to 512 (for a 256K) or 1054 (fora 1M 
DRAM) column addresses. 

If the row address is the same, the 
system performs a fast-access, en- 
hanced-page-mode cycle that requires 
only the column address and no proces- 
sor wait state. If the addresses are differ- 
ent, the current extended-page-mode cy- 
cle is completed, and a fresh cycle is 
performed at the new row address. Com- 
pleting the previous cycle and initiating 
anew one adds 20 to 40 ns to the normal 
cycle, depending on the speed of the 
DRAM controller. Usually, one wait- 
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state processor cycle provides additional 
time of 50 to 125 ns, enough to squeeze 
in this overhead and not cause further 
wait states. 

Because row addresses can be com- 
pared during the same time interval as 
the address decoding and generation of a 
memory request or select, this action 
does not increase cycle time. The result 
is a regular cycle speed with one (or two) 
wait states for random memory accesses 
at different row addresses and fast, no- 
wait-state cycles for localized accesses 
at the same row address. Depending on 
the program, execution speed can in- 
crease 5% to 20%. 

The AmPAL22V 10 also acts as a tim- 
er and arbiter for the extended-page- 
mode and refresh cycles (Figure | and 
Table 2). Its refresh-cycle counter runs 
at 16 zs. But the DRAM controller does 
not perform a refresh every time it re- 
ceives a request: because the page-mode 
cycle can be extended up to 75 ps to 
allow the most CAS-only fast cycles, 
refreshes at 16-js intervals would dis- 
rupt operation. Instead, the arbitration 
scheme is designed so that, during an 
extended-page-mode cycle, the memory 
controller counts up to four refresh re- 
quests before performing the refresh 
cycles. 


Description of Circuit 
And Boolean Logic 


For the sake of clarity, the circuit 
diagram shows a general-purpose arbi- 
tration interface, not a complete DRAM 
controller. It can be either integral to a 
PAL-based DRAM controller or linked to 
a VLSI DRAM controller, depending on 
the user’s needs. 

For 256K DRAMs, an Am29843 with 
9-bit latches stores a 9-bit row address. 
The arbiter PAL controls the latch-enable 
signal, LE. A 9-bit Am29809 compares 
the stored row address with that of the 
current cycle directly from the processor 
on address lines A9-A17. If the address- 
es match, the Am29809 sends a MATCH 
signal to the arbiter PAL. The arbiter PAL 
also receives the memory-request 
(MREQ) signal from the address-decod- 
ing circuitry (not shown here). 

Of the AmPAL22V10’s two counters, 
the refresh counter is free running and 
generates refresh requests (RFCLK) at 


Processor 


8088/8086 (8 MHz) 
8088/8086 (10 MHz) 
80286 (8 MHz) 


80386 (8 MHz) 
68020 (16 MHz) 
68020 (20MHz) 








Cycle time (ns) 
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TABLE 1. Microprocessor cycle times. 


INPUTS 
MREQ 
MATCH 
TOUT 
RFCLK 
CACK 


OUTPUTS 
CYCFLAG 
SLOG 
FSTC 
RFSC 
RCTO 
RCT1 


ICYCFLAG: = 


* (MREQ + MATCH) 


/SLOC: 
* RFSC * RCT1 * RCTO 


+ /MREQ * /CYCFLAG ‘ /MATCH * SLOC 
“ FSTC * RFSC * RCT1 * RCTO 


+ /SLOC * CACK 


/MERQ * /CYCFLAG * MATCH * SLOC 


(RCT1 * /RCTO + /RCT1 * RCTO) 
* RFCLK 

+ CT* RFSC 
*(/RCT1 * /RCTO + RCT1 * RCTO) 
+ /RFCLK * (/CT + /RFSC */RCT1 


RFCLK * RCTO 
+ CT * RFSC * RCTO 


+ /RFCLK * (/CT + /RFSC) * /RCTO 


= /RFSC 


/MREQ * CYCFLAG * RCT1 * RCTO 
+ ICYCFLAG * /TOUT * (RCT1 + RCTO) 


/MREQ * CYCFLAG * SLOC * FSTC 


; Assert for RAV 

; Unassert based 

; on 3 conditions 

; Start extended 

, cycle 

; Row address not 
; matched new 

; extended cycle 

; Hold tilt complete 


; Match CAS only 
; fast cycle 


; Count up 
; Count down 


; Hold count 


; Count up 
; Count down 
; Hold count 





TABLE 2. Listing of equations for arbiter. 


fixed 16-ys intervals. The counter for 
the extended-page-mode cycle is con- 
trolled by the arbiter PAL, which starts 
the counter by sending a signal (CYC- 
FLAG). On completion of the 75-:s tim- 
ing count, the counter sends a time-out 
signal, TOUT, to the arbiter, forcing it to 
complete the current cycle and remove 
the CYCFLAG signal. If the arbiter re- 
quires a new extended-page-mode cycle 
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at a different row address before this 
timeout, it resets the counter by remov- 
ing the CYCFLAG signal. So the counter 
starts when CYCFLAG is high and stops 
or resets when CYCFLAG is low. 

The arbiter’s operation is quite sim- 
ple. It uses registers SLOC, FSTC, and 
RFSC as flags to indicate a cycle request 
to the memory controller. SLOC requests 
the initiation of an extended-page-mode 
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cycle. This new cycle continues, even 
after completion of a data transfer, as 
long as CYCELAG is high. FSTC requests 
the initiation of a fast CAS-only cycle at 
the same row address at which an ex- 
tended-page-mode cycle is in progress, 
meaning that CYCFLAG must be high for 
the FSTC signal to be asserted. 

The third RFSC requests the initiation 
of a refresh cycle. All cycle requests for 
new slow extended-page-mode cycles, 
fast CAS-only extended-page-mode cy- 
cles, and refresh cycles are arbitrated on 
the basis of inputs MREQ and MATCH, 
and status feedbacks of CYCKLAG and 
(RCTO, RCT), a counter to keep track of 
number of refresh requests. A new cycle 
request is arbitrated only when no cur- 
rent cycle request is active, indicated by 
SLOC, FSTC, and RFSC being low. 

When CYCELAG is low, the DRAM 
controller should complete the existing 
extended-page-mode cycle (usually by 
deuctivating both RAS and CAS signals to 
the memory). As shown in the Boolean 
equations. CYCFLAG goes low if at least 
one of the following three conditions is 
encountered: 





1. Extended-page-mode cycle timeout 
(TOUT) high. 
. Refresh count up to four (RCTO = | 
and RCTI=1). 
3. New cycle at a different row ad- 
dress MREQ and MATCH. 


tu 


This allows arbitration to begin afresh 
for a new cycle. Because CYCFLAG is 
generated for each new extended-page- 
mode cycle, it is also used as a latch- 
enable signal for the 9-bit row-address 
latch. 

The counter made by register RCTO 
and RCT! remembers refresh requests 
made during an extended-page-mode 
cycle. It counts up by one upon receiv- 
ing each RFCLK, to the maximum of 
four, it counts down by one when a 
refresh cycle is executed. An extra regis- 
ter, CT, monitors refresh-cycle execu- 
tion by detecting the rising edge of the 
refresh-cycle request RFSC. CT and RFSC 
low indicate the refresh cycle’s comple- 
tion, and this is used to decrement 
counter RCTO, RCT1. The refresh cycles 
are repeated, and no memory cycles are 
allowed, until this counter is reset to 
zero. O 
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By using software to define programmable 
logic devices as run-length-limited encodel 
decode systems, you can design disk-drive 
systems that have 50% more data-storage 
capacity than drives that implement the 
MFM < code. 





Arthur Khu and Rudy Sterner, 
Advanced Micro Devices Inc 


When you're designing a disk-drive system, you can 
implement run-length-limited (RLL) 2,7 encoding/de- 
coding circuitry in your design by using only three 
programmable-logic devices (PLDs) and two shift reg- 
isters. You design the encoder and decoder as state 
machines and use the timing diagrams to determine 
what the timing and control signals must be. 

To create a disk controller with encoding/decoding 
features, you can use three AmPAL22V10 PLDs and a 
disk controller such as the Am9580/Am9582 chip set 
(Fig 1). The Am9580 hard-disk controller and the 
Am9582 disk-data separator perform all the general 
disk-control functions. The PLDs have appropriate 
architectures for implementing the encoding and decod- 
ing state machines. Further, you can reprogram the 
PLDs to implement higher density ratios for data 
encoding, and you can increase your system’s speed 
simply by using faster PLDs. 

An RLL code is a code in which the number of zeros 
between ones—the run length—is definite. The “2,7” 
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PLDs Implement Encoder/ Decoder 
for Disk Drives 





designation means that the code for the binary data 
string has at least two and at most seven zeros separat- 
ing the ones. RLL codes increase the density of data 
stored on a disk by reducing the number of recorded 
pulses (ones) necessary to represent a given amount of 
data. This reduction allows the disk-drive circuitry to 
pack the ones closer together, increasing the amount of 
data on the disk. 

In comparison with the (de facto) industry-standard 
approach, MFM, the RLL 2,7 code increases by 50% 
the amount of data you can store on a disk drive (see 
box, “RLL 2,7 code vs MFM code”). In addition, RLL 


DRIVE 
CONTROL 


HARD-DISK 
CONTROLLER 
Am9580 


DISK-DATA- 
SEPARATOR 


CONTROL | pisk.DATA 


SEPARATOR 
Am9582 


RLL-PLD 
SYSTEM 





Fig I—A complete disk controller requires only two VLSI ICs, a 
PLD-based encoding/decoding system, and a drive interface. The 
hard-disk controller and the disk-data separator provide generic 
disk-drive control, and the PLDs provide RLL 2,7 encoding and 
decoding, which increases disk storage capacity. 
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TABLE 1 — RLL 2,7 CODING RULES 


DATA | 2,7 CODE 


10 1000 
WW 0100 


000 100100 
010 001000 
O11 000100 
0010 | 00001000 
0011 | 00100100 





2,7 decoding circuitry recovers quickly from code-de- 
tection errors. 

As Table 1 shows, RLL 2,7 encoding circuitry trans- 
lates seven data strings into 2,7 code. You can break 
any binary non-return-to-zero (NRZ) data string into 
combinations of the seven data strings. To obtain the 








(a) NOTE: STATES 0, 3, 4, AND 5 ARE USED FOR ADDRESS-MARK GENERATION 
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decoded data string, the circuitry matches the 2,7 code 
patterns with the seven 2,7 code strings in Table 1. 
Because 2,7 code strings have variable lengths (they 
can be 2, 3, or 4 bits long), your design will need control 
logic that controls the output from the encoder/decoder 
as translation takes place. Encoding and decoding state 
machines (Figs 2 and 3) implement this control logic 
from the code in Table 1 (see box, “Convert RLL 2,7 
code to a state machine”). The encoding state machine 
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Fig 2—The encoder state machine (a) describes the translation of input data into RLL 2,7 code. The first two cycles in the encoding of the 
data string 00101100 (b) demonstrate how the encoder determines the correct code by examining both the bit to be encoded and the next two 


(look-ahead) bits. 
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During decoding, circuitry matches the 
RLL 2,7 code patterns with the seven RLL 
2,7 code strings to obtain the corresponding 
decoded data string. 





produces two 2,7 code bits for each data bit received. 
The decoding state machine, on the other hand, pro- 
duces one data bit for every two bits of 2,7 code. The 
clocking scheme in these encoding/decoding state ma- 
chines is simpler than the familiar table-look-up meth- 


od, which requires suspended operation during encod- 
ing and decoding. 

You can implement both of these encoding/decoding 
2,7 state machines with one PLD device (IC,) and two 
shift registers (INREG and OUTREG) (Fig 4). To 
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CONTENTS 
OF 4-BIT 
INPUT 
SHIFT 
REGISTER 
INREG 


SHIFT IN TWO NEW CODE 
BITS BEFORE GOING 
TO CYCLE S; 
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Fig 3—This decoder state machine (a) represents the translation of RLL 2,7 code into an output data stream. To determine the correct output 
data, the decoder examines four code bits—two bits to be decoded and two look-ahead bits, as shown in the example in 6. 
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Because the RLL 2,7 code contains two bits 
for every one bit of data, the timing cir- 
cuitry divides the clock signal for the coded 
data, producing a data clock. 








synchronize the encoder/decoder with the hard-disk 
controller, you use two other PLDs (IC, and IC;) to 
provide clock-generation and address-mark-control 
logic. IC, is a AmPAL22V10, which has sufficient 
capacity to implement the two state machines, IC,, also 
an AmPAL22V 10, utilizes the PLD’s large capacity and 
programmable output cells to implement the address- 
mark-control circuitry. An AmPALI6HD8 (IC3), which 
is sufficient for implementing the clock circuitry and the 
random logic, completes the design. 

To understand the state-machine implementation of 
the RLL 2,7 encoder, consider the state machine in Fig 


RLL 2,7 code vs MFM code 


Although the RLL 2,7 and MFM 
coding methods can both in- 
crease a disk drive’s capacity, 
RLL 2,7 code is more compact. 
A disk drive that implements 
the RLL 2,7 code can, therefore, 
store 50% more data than can a 
drive that implements the MFM 


code. DATA 


C CLOCK PULSE 
+1 0011 








2. The encoding state machine begins in state zero; the 
first four bits of the data to be encoded are in the shift 
register INREG. For the data stream in the figure, the 
encoder, beginning in the first cycle (Ty) reads the first 
bit in the stream as 0 and sees that the next two bits 
(the look-ahead bits) in INREG are 0 and 1, respective- 
ly. According to the state diagram, the encoder pro- 
duces a 00 output because, as Table 1 shows, input data 
starting with 001 translates to a character string that 
starts with 00. The encoder then enters state 2, shifts 
the encoded 0 out of INREG, and shifts in the next 
(fifth) bit of the data to be encoded. 


DATA . 1100110011 


MINIMUM FLUX- 
TRANSITION SPACING 


0011 


CODE 0100 00100100 00100100 


On the magnetic medium in 4 
the disk drive (hard-disk or flop- 
py-disk drives), binary data ap- 
pears as a change in flux (repre- 
senting a one) or as no change in 
flux (representing a zero). Be- 
cause the disk density is limited 
by the minimum distance be- i a 
tween flux transitions, the maxi- 
mum data density depends on 
how the data is encoded. 

MFM is an RLL code with the 
designation 1,3;1,2;1. RLL 2,7 
code (its full designation is 
2,7;1,2;3) allows a minimum of 
two zeros between each one, and 
MFM code allows a minimum of 
one zero. RLL 2,7 code can 
store as much as 50% more data 
in a given number of flux 
changes than can MFM code; 
therefore, RLL 2,7 code can 
store as much as 50% more data 
on a given section of magnetic 


occupies. 


medium. 


Consider the example in Fig 
A. The data stream 1100110011 
is represented by eight transi- 
tions in MFM code, and by five 
transitions in RLL 2,7 code. 
When the disk drive uses the 
minimum distance between tran- 
sitions to record the RLL 2,7 


| = a = AT 150% DENSITY OF MFM 


Pradddt ttbt ttt tt ti 


OS 


Fig A—Data storage that’s 50% more dense is the principal advantage of RLL 2,7 code 
over the de facto standard MFM code. Because it needs fewer transitions to describe 
data, the RLL 2,7 code takes up only 67% of the disk storage space that MFM code 


code, the RLL 2,7 code takes 
67% of the space that the MFM 
code takes, so it has space avail- 
able to store 50% more data. In 
most applications, the actual 
storage increase is between 35% 
and 40%, because some disk 
space is reserved for sector and 
data-field markers. 





ra | Monolithic rat Memories a | 


PLDs Implement Encoder/Decoder for Disk Drives 








In the second cycle (T;) of the encoding process, the 
encoder sees that the data bit is 0 and the two look- 
ahead bits are 1 and 0, respectively. According to the 
state diagram, when the encoder is in state 2 and sees 
010, its output is 00. As before, the encoder then moves 
to the next state (in this case, state 0) and shifts a new 
bit into INREG. The rest of the encoding process 
proceeds similarly. 

The decoding process is similar to the encoding 
process. The decoder, which is described as a state 
machine (Fig 3), accepts two input bits in RLL 2,7 
format and sees the next two coded bits as look-ahead 
bits. In contrast to the encoder, the decoder produces 
one output bit for every two input bits. 

To implement this encoder and decoder circuitry with 


Am9582 
DISK-DATA 
SEPARATOR 


OUTREG 


AmPAL22V10 
RLL 2,7 


AME ENCODER. 


INREG 
SHIFT IN 


ORIVE CONTROL 


COMPLT 


PLDs, you can use PLD-design tools such as CUPL 
from Personal CAD Systems Ine (San Jose, CA) and 
Abel from Data I/O Corp (Redmond, WA). These 
software tools include syntaxes that you can use to 
describe state machines as well as general logic equa- 
tions. 

To control the rate at which IC, (in Fig 4) receives 
data and code, IC, and IC; implement the timing signals 
shown in Fig 5. Three signals (Read, Write, and 
Code—Clock) from the hard-disk controller and disk- 
data separator form the basis of the timing signals. 

First, the timing circuitry produces a clock signal, 
Rd_Clk, from the Code_Clock signal that originates at 
the disk-data separator’s FDDAM output. Because the 
RLL 2,7 code contains two bits for every one bit of 


DRIVE 
INTERFACE 


AmPAL22V10 
ADDRESS-MARK 
GENERATOR 
DETECTOR 


RO | RO CLK] 


DATA CODE 


Am9580 
HARD-DISK 
CONTROLLER ICs 
AmPAL16HD8 
GLUE LOGIC 


BINARY DATA 





RO CLK 


CONTROL SIGNALS 


2.7 CODE TO FROM DRIVE INTERFACE 


Fig 4—The encoding/decoding circuitry includes three PLDs. IC; implements the encoding and decoding state machines, IC, monitors IC; 
and provides special timing and input signals. IC, implements glue logic and timing signals. 
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Before the disk-drive system can decode data 
on the disk, it must synchronize itself with 
the disk’s data clock to find out when the 
data bits begin. 


data, the timing circuitry divides the clock signal for 
the coded data (Code—Clock), producing a data clock 
signal, Rd_Clk. The timing circuitry then uses Code— 
Clock and Rd_Clk to control the timing of shift- 
register control signals Shift_Out and Shift_In. 

To obtain the equations you need to program the 
PLDs, examine the timing diagrams. The timing dia- 
grams show that the state-machine design requires 
only a few timing signals to implement the controller. 





Because the timing circuitry loads and shifts data 
produced by IC, at the points indicated on the timing 
diagram in Fig 5, the only clock signal that IC, requires 
to code or decode data in INREG is the Rd_Clk signal. 

To control the transfer of data and code, the Shift_In 
signal latches data or code into INREG, and the 
Shift_Out signal controls the output of OUTREG. For 
example, when encoding data, the encoder produces 
two bits of code on each rising edge of Rd_Clk. Because 





Convert RLL 2,7 code to a state machine 


You can use a simple algorithmic 
procedure to convert a code 
table to an encoding state ma- 
chine. For each row of the code 
table, you create a simple 2- 
state expression for the conver- 
sion of the data into the code. DATA TO 
Then you combine the expres- CODE 
sions into one state machine for 
the entire table. 

For the code in Table 1 of the 
accompanying article, you con- 
sider each data bit and the asso- 
ciated pair of bits of RLL 2,7 
code. For example, the first row 
of the table contains two data 
bits: one, which is associated 
with the code 10, and zero, 
which is associated with the 
code 00. The state machine for 
this row begins in state zero 
(which is arbitrarily assigned) 


1 


t 
ALL 2,7 CODE 


LOOK- 
ie oe 


(a) STATE TRANSITION FOR 
2.7 DATA ROW 10 








0) \), 


(b) DATA ROW 10 IS 
COMPLETELY CODED 
BY THIS STATE 
MACHINE 


(c) ENCODE DATA ROW 11 
USING THE SAME PROCEDURE 


(d) oul THE STATE 
MACHINES FOR DATA 
ROWS 10 AND 11 


Fig A—Creating a state machine from a code table is a 2-part process. First, you 
create a simple state machine for each row in the table, and then you combine the state 
machines into one state machine for the whole table. 


and changes state when it sees 
that the first data bit is a one 
and the next bit (the look-ahead 
bit) is a zero. This change of 
state, which appears graphically 
in Fig Aa, results in the out- 
put 10. 

Note that the state machine 
must evaluate the look-ahead 
bit. If this bit is a one instead of 
a zero, the input data will be 11, 
corresponding to the second row 
of the table, so your state ma- 
chine must generate a 01. For 


this code table, no more than 
two look-ahead bits are neces- 
sary for encoding any data bit. 

If the state machine is in state 
one, and the input data is zero, 
the state machine produces a 00 
output and returns to state zero 
(Fig Ab.) Note that the data bit 
encoded in this state is the look- 
ahead bit from the previous 
state and that no look-ahead bit 
is necessary for encoding be- 
cause the zero is the last bit in 
data row 10. 


You use this procedure to cre- 
ate state machines for the other 
table rows as well. Fig Ac, for 
example, contains the state ma- 
chine for data row 11. Because 
all the state machines have the 
same beginning state (state 
zero), you-can combine them to 
form the complete encoding 
state machine. Furthermore, the 
state machines may share other 
states, as shown in the combina- 
tion of the two data rows 
(Fig Ad). 
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Cn ae 


the Load signal is high at the same time that Rd_CIk is 
high, the bits are loaded into the shift register. Because 
the frequency of Shift-Out is twice that of Rd_Clk, 
Shift_Out shifts beth encoded bits out before the next 
encoded bits are loaded. Shift_In presents the next 
-data bit to the encoder at the same time that the second 
encoded bit is shifted out, starting the next encode 
cycle. 


READ DECODE 


el Ee oy 


CODE CLOCK 
Pe ca ia TWO CODE BITS 
RO CLK 
LOAD | ( 
SHIFT OUT | IN | \ [X 


LOAD ONE DECODED "DATA BIT 


READ 


SHIFT IN TWO NEW CODE BITS 


i pe LL a A 


INREG AND OUTREG LOAD WHEN LOAD ANDO SHIFT ARE HIGH; 
THEY SHIFT WHEN LOAD IS LOW AND SHIFT IS HIGH 





SHIFT IN 


LOGIC EQUATIONS: 
RO. CLK := 


LOAD = READ+WRITE*RD_CLK 


ON Am9582 DISK-DATA SEPARATOR 


HARD-DISK CONTROLLER 





RD.CLK (A REGISTERED SIGNAL FROM IC2) 
SHIFT. IN = READ*:CODE .CLOCK + WRITE*RD..CLK 
SHIFT .OUT = READ* RD.CLK+WRITE*CODE CLOCK 


CODE . CLOCK : CODE-RATE CLOCK SIGNAL FROM FDDAM PIN 


READ,WRITE : READ-WRITE MODE SIGNALS FROM Am9580 


Before the disk-drive system can decode data on the 
disk, it must synchronize itself with the data clock from 
the disk and find out when the data bits begin. To assist 
the circuitry in synchronization and initialization, you 
can place synchronization signals, as well as a marker 
indicating the beginning of data, at the beginning of the 
RLL code. When the RLL circuitry reads these pat- 
terns, it’s ready to decode RLL data. 


SSS SSS SSS 


WRITE ENCODE 


i i 3 


CODE CLOCK 
ea ENCODE ONE DATA BIT 
RD CLK 
ior ool LT Ll 
wa 2-BIT CODE 
SHIFT OUT Pee ie 
SHIFT OUT SECOND BIT 
WRITE [ 
SHIFT IN ONE NEW By ee 
SHIFT IN | | | | | 





AD_CLK : DATA-RATE CLOCK DERIVED FROM CODE. CLOCK 


SHIFT_IN : COMBINATORIAL SIGNAL FROM IC2 
SHIFT_OUT, LOAD : COMBINATORIAL SIGNALS FROM IC3 





Fig 5—The timing signals from IC; and IC; control the loading and shifting of code and data in the INREG and OUTREG registers. Note 
that the clock and shifting signals for the cuded bits have twice the frequency of those for the data bits, a situation that corresponds to the 21 


density ratio between code and data. 
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If the decoder detects any pattern before it 
detects the address marker, the circuit resets 
itself and begins the initialization sequence 
anew. 


0 DETECTED CODE RECOVERS 


COMPLETELY 


STATE OF 
DECODING 
MACHINE 


ERROR 
RECOVERY 


Fig 6—The ability to recover from an error is a useful feature of 
RLL 2,7 code. An error at cycle E, results in incorrectly translated 
data until the machine recovers fully. 


The synchronization field comprises a series of pat- 
terns that allow the phase-locked loop in the disk-data 
separator to synchronize to the frequency of the incom- 
ing data, These patterns represent the maximum fre- 
quency input. For RLL 2,7 code, that input is 100100, 
because this code has the fewest permissible zeros 
between ones. Using the highest possible frequencies 
minimizes the synchronization time, so the patterns are 
completed quickly. 

A marker that follows the synchronization field indi- 
cates the beginning of data. This marker, the address 
‘mark, must be distinct from all other code words. For 
example, you can use a pattern that violates the code 
rules, such as a string of zeros (this string is called “de 
erase” because it removes all flux transitions from the 
recording medium). Alternatively, you can use a pat- 
tern that obeys the code rules but is not a defined code 
pattern, In the examples in this article, the pattern 
00000100 identifies the beginning of data. Although this 
pattern is not a defined code word, it doesn’t violate the 
RLL 2,7 code rules as long as it’s preceded by no more 
than two zeros in a row. 

When the 100100 pattern is synchronizing the circuit- 
ry, the circuitry produces the Rd_Clk signal, which is 
in phase with the pattern. In order for the decoder to 
operate properly, the Rd_Clk signal must rise with the 
first one in the pattern and fall with the second one. 
However, because of the repetitive nature of the pat- 
tern, the circuitry could produce a falling edge of 
Rd_Clk on the first one in the pattern and a rising edge 
on the second one, in which case the circuitry would not 








be in synchronization with the beginning of the data. 

The encoding circuitry resolves this synchronization 
problem by inserting the pattern “0100” eight times 
between the synchronization field and the address 
mark. The phase-locked-loop system locks onto the 
0100 pattern, and the Rd_Clk control circuitry in IC, 
sets itself to decode the data correctly. 

To put the synchronization and marker data into the 
RLL 2,7 code on the disk, you must design the RLL 2,7 
encoder to produce the signals. When the signal WG 
(from the hard-disk controller IC;) goes high, the data is 
encoded to all zeros. The encoder translates the zeros 
as the synchronization pattern 100100, which is then 
stored on disk. 

Next, the address-mark-control (AMC) signal from 
the hard-disk controller sets an internal latch in IC., 
producing the output Am_Latch, which forces IC; to 
put ones in the INREG register. The encoding circuitry 
codes the ones as the string 0100. After the eighth 0100 
pattern, IC; sets the Complt signal high. On sensing 
Complt, IC, writes the address mark. 

The encoding circuitry must have the first four data 
bits in INREG by the time the address mark is written. 
IC, writes two patterns for the address mark. At the 
start of the second address-mark pattern, IC, sets AMF 
(Address Mark Found) High and resets Am_Latch low. 
The assertion of AMF signals the hard-disk controller 
to begin shifting data into INREG for encoding. Be- 
cause four Rd_Clk cycles occur while the address-mark 
pattern is being written, the first four bits are shifted 
into INREG by the time the second address mark is 
written. 

The decoder circuit has two safeguards that prevent 
it from identifying the address-mark pattern incorrect- 
ly. First, the decoder must detect at least five 0100 ~ 
patterns before it acknowledges the address-mark pat- 
terns. Second, if it detects any pattern before it detects 
the address mark, the circuit resets itself and begins 
the initialization sequence. 

IC; implements these two safeguards by monitoring 
the RLL code that IC; decodes. When the hard-disk 
controller asserts the RG and AMC signals, IC; sets 
Complt and Am_Latch low. IC; sets Complt high when 
it detects the fifth consecutive 0100 pattern, enabling 
IC, to detect the address mark. If the input to IC, is 
anything other than a 0100 pattern or an address mark, 
IC, sets Complt low, and the initialization begins anew. 

If IC, successfully detects the address-mark pattern, 
it sets AMF high. The assertion of AMF causes IC, to 
set Am_Latch high, thus enabling the output of OUT- 
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2EG to send decoded data to the hard-disk controller. 
Am_Latch remains high as long as RG is high. 

RLL 2,7 encoding also provides for error recovery. 
Whenever a disk-drive system reads code from a disk, 
she read/write heads or the transmission cables can 
xause transmission errors. One of the properties of 
RLL 2,7 code is that any single-bit error (for example, a 
coded one detected as a zero) will correct itself after a 
run of at most 16 correctly detected bits. 

In short, whenever a 2-bit pattern doesn’t match any 
of the expected patterns for a particular state of the 
decoding circuitry, the decoding state machine returns 
to state zero and generates a zero as a translation for 
the erroneous code bits. Then the decoder shifts the 
next two code bits into the INREG register and contin- 
ues decoding from state zero. 

In such cases, the decoding state machine can cor- 
rectly decode coded data, but it may not recover 
immediately upon returning to state zero. As Fig 6 
shows, although the code bits that the decoder detects 
may be valid, the decoded data is incorrect because the 
error has forced that state machine into the wrong 
state. In this example, the decoder doesn’t recover until 
6 code bits later (in cycle E6), when it again falis into 
the correct state and accurately decodes the data. EDN 
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Mixing Data Paths Expands 
Options in System Design 





Chip designers are creating powerful CPUs and peripherals 
with 16- and 32-bit parts. Mixing these with 8-bit parts 
overcomes limitations imposed by established designs, 
incomplete families, and software incompatibility. 





by Mark S. Young and 
James R. Williamson 


Integrating 16- and 32-bit peripherals and CPUs 
into 8-bit designs, at the simplest level, means 
separating the control and data paths from new 
peripherals and the systems. Mixing different data 
path widths and control protocols, however, makes 
possible major improvements in function, perfor- 
mance, and cost. 

The price/performance curve of VLSI chips, for 
example, allows designers to obtain more and bet- 
ter functions for the same amount of money every 
year. Alternately, the functionality of a device can 
remain constant while the price falls. 

Moreover, these new devices with wider data paths 
can extend the life of older designs. For example, 
many of the most popular personal computers today 
use the 8088 microprocessor and, therefore, are con- 
strained to an 8-bit data path. Designers of add-on 
accessories for these personal computers prefer the 
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newer 16-bit peripherals. These peripherals will let 
users preserve their software investments, improve 
performance, and stave off obsolescence. 

Mixing different data path widths can also enhance 
new designs. For example, it is less expensive to use 
an 8-bit bus in a new design because the memory 
requirements are generally cheaper. Only half as 
many dynamic RAMs are necessary for the same 
number of kilobytes of memory. In addition, an 
8-bit bus needs much less control and support logic. 
Designers can mix smaller data path peripherals with 
wider data path CPUs. This allows them to introduce 
systems based on the newer, more powerful 32-bit 
CPUs even before 32-bit peripherals are available. 

Designers can use this mixing method to obtain 
wider data paths from existing designs until a new 
system design is warranted. They can also use parts 
in unexpected applications. For example, cost- 
conscious terminal manufacturers might want to use 
the Am8052/8152A chip set (the 8052 is an advanced 
CRT controller and the 8152A is a video system con- 
troller) in new terminals based on the relatively 
inexpensive 8051 microprocessor. Mixing the 8-bit, 
single-chip microprocessor with the 16-bit CRT con- 
troller allows designers to maximize the cost/perfor- 
mance ratio of the terminal. 

Mixed data path widths can improve bus utiliza- 
tion as well. A 16-bit peripheral in a 32-bit system 
only occupies half the data bus for data transfers. 
If the designer mixes the data paths correctly, how- 


ever, the 16-bit peripheral could transfer data as 
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COMMENTS 


WAIT UNTIL PERIPHERAL TAKES BUS 
MAKE SURE MEMORY ACKNOWLEDGE |S 
NOT ASSERTED 


READ IN UPPER BYTE ; A0=1; 
WAIT FOR MEMORY ACKNOWLEDGE 
ISSUE RO STROBE 


WAIT FOR MEMORY ACKNOWLEDGE 
TO GO AWAY. 


READ IN THE LOWER BYTE, AO - 0, 
WAIT FOR MEMORY ACKNOWLEDGE. 
ISSUE RO STROBE. 


STROBE IN DATA TO PERIPHERAL 
OEASSERT WAIT. 
WAIT FOR SUCESSFUL READ (0S). 


The state Mow control diagram for a bus master read 
operation illustrates the control sequence employed 
by the 8/ 16-bit bus control logic. 





32-bit chunks and improve bus efficiency by 100 per- 
cent for that peripheral. 

Two central concerns stem from mixing devices 
that communicate over different-sized buses. The 
first problem results when two devices communicate 
ona ‘‘common’’ data bus. Consider, for example, 
a 32-bit system utilizing 8- and 16-bit peripherals. 
Overcoming the mismatched data paths requires 
some form of controlled multiplexing/demultiplexing 
of the different data paths. In addition, extra con- 
trol signals for partitioning the 32-bit word into 8-, 
16-, and 32-bit chunks may be required. 

Many 16-bit CPU-based systems that use 8-bit 
peripherals normally use just the lower 8 bits of the 
data bus to transfer data to and from the peripheral. 
This method does not work in systems using 16-bit 
peripherals and 8-bit CPUs, however, and it tends 
to break down in systems with 8-bit peripherals hav- 
ing bus master capability. 

A bus multiplexing method involves multiple 
transfers when taking data from or adding data to 
a mismatched data bus. For example, before a 16-bit 
peripheral can transfer data over an 8-bit bus, the 
16-bit data must be divided into two 8-bit chunks. 
It is then transferred sequentially. First, the lower 
8 bits are transferred out on the bus. Then, in the 
next transfer cycle, the upper 8 bits of the 16-bit 
word are sent out. The major difference in the oppo- 
site case—a bus read operation from an 8-bit bus 
to a 16-bit device—is that the first byte read from 
the system must be latched. Once the second byte 
has been fetched, the 16-bit peripheral reads in the 
assembled 16-bit (2-byte) word. Additional provi- 
sions may be needed when the 16-bit peripheral only 
wants to access a single byte. 


The other major problem in mixed data path 
transfers is the actual data read/write operation. The 
nature of the multiple transfer forces designers to 
guarantee that the stretched transfer will occur and 
that it will not be interrupted. Two aspects of stretch- 
ing the transfer cycle from or to tne peripheral illus- 
trate the complexity of this problem. 

The first case, when the peripheral is the bus 
master, is the simplest. A 16-bit peripheral holds its 
data available for what normally would be two com- 
plete bus transfer cycles. This function can be per- 
formed when the transfer acknowledge signal to the 
peripheral is delayed. If the data was latched instead 
of holding the peripheral in a multiple word transfer, 
however, the device could try to send the next 16-bit 
data word and its ‘‘new’’ address. The procedure 
of latching the data and releasing the peripheral 
should not be used, therefore, because it may inter- 
fere with the addressing of the remaining (pending) 
8-bit transfer. 

Whenever a device acts as a bus slave to a CPU 
that cannot access the device’s natural word width 
in a single operation, a different constraint appears. 
The sequence must be set up so the peripheral cannot 
obtain the bus while the CPU is in the middle of a 
slave read/write operation. In a typical system, the 
CPU is the last device in the interrupt queue. It is 
possible for the peripheral to become bus master 
between the first and second read operations and in- 
validate the results of the first read operation in a 
realtime system. This is because an 8-bit CPU would 
have to perform two consecutive read operations to 
examine a 16-bit peripheral control register. 

This function can be handled two different ways. 
If the CPU has a bus lock instruction, as in the iAPX 
family of CPUs, the programmer must use one of 
these instructions before the CPU accesses the 
peripheral. Alternately, the CPU needs to disable 
the arbitration logic while it is performing the unin- 
terruptible access with the 16-bit peripheral. 


Crucial cycle 

The uninterruptible word transfer cycle is crucial 
for maintaining the integrity of the data transferred. 
When either the CPU or a peripheral on the bus 
makes an access using the 8/16-bit control logic, it 
must complete the larger device’s word access before 
relinquishing the bus. If this requirement is not met, 
a transfer’s integrity can be violated easily by some 
other device. This interrupts the transfer, and cor- 
rupts or aborts the multiplexing sequence. 

To illustrate this point, consider a system consist- 
ing of an 8-bit CPU and several 8- and 16-bit periph- 
erals. Assume one of the peripherals is executing a 
block transfer of 16-bit data onto the 8-bit bus. If 
the CPU interrupted the transfer in order to poll the 
peripheral during a half-word transfer, two undesir- 
able events would occur. Either the multiplexing 
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sequence would be damaged irreparably when the 
CPU polled the peripheral, or the CPU would read 
garbage from the peripheral. 

Designing the control interface to allow mixing 
of 8- and 16-bit peripherals requires attention to the 
data and control flow. During a write dperation, the 
data is written out sequentially: the lower byte comes 
before the upper byte (or vice versa). The read oper- 
ation differs only because the data bus is 8 bits and 
because it forgets the last byte transferred; it knows 
the current byte only. Hence, the interface requires 
that one of the bytes be latched until the full 16-bit 
word has been assembled. 

The slave mode of operation works almost the 
same as the peripheral bus master mode. The single 
exception is the slave write operation. When the 
interface is defined, the designer must make a con- 
scious choice about which byte (upper or lower) to 
latch during peripheral read operations (or con- 
versely, slave peripheral write operations). Once this 
decision has been made, the CPU must always access 
the latched data byte first (during a slave write) and 
then access the non-latched byte to complete the 
transfer. This restriction is minor, requiring no extra 
software overhead. It could affect the ease of the 
programmer’s coding if not handled properly, how- 
ever. For example, if the programmer used a com- 
piler to generate the software for the system, extra 
care may be necessary to ensure the compiler gener- 
ates the correct addressing sequence. 

An alternative solution would be to latch both the 
upper and lower data bytes. In this case, however, 
the cost of the interface would increase, as would 
the complexity, with no appreciable gain. The con- 
trol flow in these designs derives from two differ- 
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ent sources: the state control flow itself and the 16-bit 
peripheral interfacing with the 8-bit bus. A state dia- 
gram can be used to specify how uninterrupted word 
transfers will occur and how the upper and lower 
byte address is generated. 

In addition, the specific bus timing of the periph- 
eral and the data bus must be examined to quantify 
the state control flow. These timing specifics also 
provide information on data latching, read/write 
control strobes, and addressing to and from the 
peripheral. The state control flow is divided into four 
operations: bus master read, bus master write, slave 
read, and slave write. 

For a bus master read/write operation from a 
16-bit peripheral device operating on an 8-bit bus, 
four control signals must be generated by the 
8/16-bit control unit: address bit 0 (AO), peripheral 
hold (WAIT), bus read (RD), and bus write (WR). 
The AO line is generated by the 8/16-bit control logic 
to indicate which byte is to be transferred in bus 
master modes only. Otherwise, the AO generated by 
the system is used to indicate which byte is being 
accessed. The WAIT line holds up the peripheral 
during transfers. The RD and WR lines are required 
to indicate successive transfer cycles on the bus, 


Hidden transfers 

The peripheral’s signals will only strobe active 
once because it does not know that two transfers are 
being executed. The slave transfer flows are almost 
identical, except the CPU is generating the bus sig- 
nals and the transfer directions are reversed (ie, a 
bus write goes into the peripheral). 

For this 16- to 8-bit data flow example, the cata 
on the upper byte only needs to be latched when data 


In addition to a state flow 
diagram, a timing diagram 
can be used to describe such 
data read/write operations as 
a master bus read. 
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TRANSLATOR 


The 16-to 32-bit conversation logic diagram indicates 
the complexity of the bus and tunnel logic control. it 
must convert between different signal conventions 
and polarities as well as generate extra functions and 
bus arbitration control signals. 





is being read (as bus master) or written (as a bus 
slave). An interface to handle this operation needs 
to latch data coming from the 8-bit data bus into 
the peripheral, it also needs to act as transceiver 
when the peripheral is sending data out to the system. 
A device with a clocked, tri-state output that has an 
8-bit wide latch in one direction and a tri-state trans- 
ceiver in the other direction would be ideal for 
accomplishing such aa interface. 

The Am2982 8-bit bidirectional [/O port provides 
a good enough match to the logic and allows the 
upper data bus latch and upper data transceiver chips 
to be combined on one IC. It provides two 8-bit 
clocked I/O ports, cach with tri-state output con- 
trols and individual clocks and latch enables. An 
Am2949 bidirectional bus transceiver completes the 
Jogic required for the data path function. 

The state flow control requires logic that can move 
sequentially from state to state, hold in a particular 
state, and be reset or initialized back to a predefined 
state. Depending on the number of states required 
(generally less than 16 distinct states for a design of 
this complexity), a 3- or 4-bit counter should be able 
to solve the problem niccly. 

Considerable bus control logic is required to gen- 
erate the data path flow logic and the bus control 
signals. This is especially true if the peripherals and 
CPUs use different signal conventions (eg, when AS, 
DS, and R/W use address latch enable, RD, and 
WR). Conversion from onc signal convention to 
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another, changes in signal polarity, and provision 
for extra functions (such as generating AQ) require 
alot of logic synthesis ability. Hf the peripheral has 
bus master capability, such additional information 
as bus arbitration controls must be fed into the next 
state determination logic in order to decide what con- 
trol sequence to follow. 


Customized interface minimizes cost 

An 8/16-bit control interface between the Am8052 
CRT controller and an 8-bit CPU provides a good 
example of how customizing a general interface can 
reduce costs. (The CRT controller is designed with 
a 16-bit data interface.) The onboard DMA unit 
fetches data from system memory and the CPU polls 
the CRT coniroller’s internal status and control 
repisters. Because the CRT controller does not 
modify system memory, however, a bus master write 
operation is unnecessary. Thus, there is no reason 
(o generate a system write control signal (WR). 

In addition, the control and display information 
must be aligned on word boundaries. This require- 
ment relieves the 8/16-bit control logic from funneling 
the bytes and performing odd/even byte transfers. 
It also saves control inputs from the CRT controller 
because all transfers are words; that is, no need 
exists for upper and lower data strobes or byte high 
enable inputs. 

The bus master read operations are standard 16-bit 
data transfers divided into two &-bit transfers. The 
CPU’s slave accesses are cither pointer writes (to 
select the desired control/status register) or 16-bit 
data read/write operations. (Pointer write operations 


PERIPHERAL 
SOE 


The state machine and the bus and latch controls 
have to be coupled in order to transfer data between 
the 8- and 16-bit buses. This generalized machine is 
designed with the assumption that the peripheral has 
bus master capability. If this is not the case, the 
design can be greatly simplified. 
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are actually 8-bit operations because only the lower 
8 bits of the data form the register address.) The bus 
master read operation can be represented by a state 
flow diagram or a timing diagram. Conceptually, 
state flow diagrams are easier to understand, but tim- 
ing diagrams usually convey more information. 
Other state flow diagrams can be derived directly 
from the timing diagrams of the CRT controller to 
8-bit interface. 


Simplifications allow synthesis on one device 

Two special conditions must be met in the state 
machine implemented in the 8/16 interface. First, 
before a new transter cycle is attempted (when the 
state machine is waiting in the initial state, SO), 
memory acknowledge (MRDY) must be inactive. 
This prevents interference from the last transfer. 

The second special condition occurs when the 
CRT controller asserts the R/W line to indicate a 
write operation. Although the CRT controller does 
not write data into system memory, when it updates 
the upper 8 bits of the 24-bit address latch the R/W 
line indicates a write Operation (in conjunction with 
AS). The CRT controller is not actually performing 
a System data write, only an address latch update. 
The state machine, therefore, must not start a bus 
sequence if the R/W line is held active low by the 
CRT controller during a bus master operation. 

These simplifications in design allow the CRT con- 
troller to 8-bit CPU control interface to be synthe- 
sized in a single AMPAI.22V10 programmable logic 
array device. In addition, the bus control signals are 
converted from AS, DS, and R/W to RD and WR. 
The minimum CRT controller and bus control sig- 
nals that must be generated are RD, AO, DS, and 
R/W. Although the CRT controller uses DS and 
R/W as inputs during a bus master operation, the 


The logic for contro! and 
data transfer between an 
Am8&052 and 8-bit CPU has 
the control interface 
implemented in an 
AmPAL22V 10. 


BUSACK Sup At 


PAL device must convert the CPU RD and WR sig- 
nals to DS and t/W for slave I/O operations. 

The signals AO and RD are generated by the con- 
trol logic when the CRT controller is performing a 
read access to system. The WAIT (or not READY) 
signal to the CRT controller must also be generated 
by the control logic. The data flow controls require 
six additional controls to load and strobe the latch, 
and to enable transceivers to pass data to and trom 
the 8-bit bus. Theoretically, 4 more bits (outputs) 
are required to represent all the control states needed 
to manipulate the 8/16-bit control logic. This means 
une design appears to need 14 output logic units in 

a PAL device to perform the required task. 

Reducing the 14 output cells to the 10 cells avail- 
able in the PAL. device requires a closer look at the 
timing and output switching functions. The AQ and 
RD control lines are in effeet part of the system: bus 
control and, therefore, cannot be multiplexed easily. 
The DS and R/W lines to the CRT controller are 
also fixed because they must be valid throughout the 
entire transfer cycle as well. 

This leaves 6 of the 10 output logic cells of the 
PAL, device to represent the remaining 10 identified 
control lines. This method of minimization involves 
careful state synthesis, analysis of the signal switeh- 
ing functions during the transfers, and utilization 
of several control pins on the CRT controtter. By 
using the BREQ, BACKI, BACKO, CS, and C D 
inputs to the PAL. device, we can reduce the num- 
ber of unique states required to 8 instead of 13. This 
reduces the number of logic cells required for the 
state machine from 4 to 3 bits. 

At this stage, the design requires seven control sig- 
nals to manipulate the data transfer registers and 
WAIT line. The two latch enables (CEs and CDp) 
on the Am29$2. bidirectional 1/O port: can. be 
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permanently enabled. By controlling the clock signal 
to the latches, the controls required for three pins 
can be reduced to one. The interface control state 
machine will only use the correct side of the dual 
latches on the bidirectional I/O port: 

The Am8052 CRT controller helps considerably 
with its own control bus interface. Two signals 
provided by the CRT controller, TBEN and RBEN, 
switch the data transceivers in the correct direction 
regardless of the type of data transfer (as a bus 
master or bus slave). When the controller is a bus 
master performing a read operation, or when it is 
a bus slave undergoing a write operation, therefore, 
the RBEN signal is strobed to obtain the correct 
polarity. By using this line, two of the. remaining 
six control lines can be eliminated (REN on the 
Am2949 and OEas on the Am2952). Although the 
TBEN line performs a similar function, it does not 
function correctly in a 16- to 8-bit multiplexed 
bus environment. 

Two of the remaining control lines (OE,s on the 
Am2952 and 10 on the bidirectional bus transceiver) 
must be generated by individual cells in the PAL 
device. The two clock enables on the Am2952 are 
permanently enabled. The two Am29582 clocks are 
tied together to minimize the amount of logic re- 
quired in the PAL device used to generate clock 
strobes to the latches. 

This leaves the design with three logic cells and 
four output functions (the WAIT line to the CRT 


controller and the 3 state bits). Careful analysis of 





the state flows and ming diagrams indicates that 
the WAIT line is only asserted in 4 of the 8 states. 
A clever assignment of state numbers to the state 
flow sequence allows the WAIT line to be absorbed 
into the 3 state encoding bits. The logic equations 
for the AmPAL22V10 device can be derived directly 
from the timing diagrams. 

An unusual problem might occur when a periph- 
eral device operates as a bus slave on a smaller data 
bus, such as a 16-bit peripheral to 8-bit CPU. During 
the first slave write operation, the chip select CS is 
enabled by the bus master making the access. No 
actual data—just the data latch—is strobed into the 
peripheral, however. After the first byte of data has 
been written, the second access causes the full 16-bit 
data to be strobed into the peripheral. 

If the designer is using a common CS function to 
both the peripheral and the 8/16-bit control logic, 
the controller logic must be designed not to glitch 
or strobe any of the control lines to the peripheral 
(it must prevent DS, R/W from being enabled, for 
example). For some peripheral devices, glitches on 
the control lines might cause the register to be written 
accidentally onto a register that will be overwritten 
in the next write cyele anyway. With other periph- 
erals this might be a catastrophic event. Many 
devices acting as bus slaves have write recovery time 
requirements (ie, a certain minimum interval between 
conseculive write operations). Glitches on the con- 
trol lines might force the next (and final) write oper- 
ation to be delayed—or cause a violation of the 


The data bus and control 
interface between an 8-bit 
8088 CPU and a 16-bit 
Am9516 DMA controller uses 
an AmPAL22V10 for 


control, and a 74LS161 for 
state sequencing along with 
a bidirectional 1/0 port 
and transceiver, 
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device specifications. Glitches might evade any spe- 
cial addressing/register accessing scheme used in the 
peripheral. This might occur, for example, if the 
slave device requires the user to write the address 
of the register that was accessed immediately before 
the register was written. In this case, glitches or use- 
less control strobes could wreck the sequence. 

The problem can also be solved by using two lines. 
In this solution, one of the lines would go to the 
peripheral device and the other would connect to the 
8/16-bit controller. The chip select to the peripheral 
is activated each time a slave read Gecurs (for both 
upper and lower byte accesses), or When a slave write 
operation occurs and the unlatched 8-bit data is 
being written. The chip select function to the 8/16-bit 
controller is chosen each time the peripheral is 
selected normally (for slave read/writes on both 
upper and lower 8-bit data transfers). This problem 
is bypassed completely when two separate chip select 
functions are used: one for loading up the Am2952 
latch during a slave write/read and one to strobe the 
Am8052 controller into action when it is needed by 
the 8-bit CPU. 


Bus conversion maximizes flexibility 

A data bus and control interface to an 8088 8-bit 
microprocessor and Am9516 16-bit DMA controller 
can be created using four devices: an AMPAL22V10 
for the control block, a 74LS161 counter for the state 
sequencer, an Am2982 bidirectional I/O port, and 
an Am2949 bidirectional transceiver. 

This design incorporates certain simplifications, 
The DMA controler requires word accesses only 
during command chaining and for slave register ac- 
cesses. The 8/16-bit data transfer interface for bus 
master operations (ic, DMA data transfer functions) 
is handled automatically as a programmable option. 
During slave write operations, the first byte output 
to the DMA controller must have an odd address 
and the following second byte an even address. Con- 
versely, during a slave read cycle, the first byte read 
from the DMA controller must be at an even address 
and the second at the next higher odd address. 

Furthermore, for bus master operations, the sys- 
tem must use the latched address line AO (1-A0) from 
the AMPAL.22V10 as its sole AO. Because the logic 
is already available, the system does not have to pro- 
vide this function, LAO now becomes the system ad- 
dress bit O with full 24-mA drive capability. 

Deciding on a means for controlling the funneling 
of the data stream—that is, transforming 16-bit data 
into 8-bit data and vice versa—was the first step in 
deriving this example. As mentioned earlier, simply 
dividing each 16-bit access into two 8-bit data trans- 
fer cycles presents one way of doing this. On out- 
going accesses (16-bit path from the DMA controller) 
during the first eyele, the upper half of the 16-bit 
path is latched while the lower half passes through 
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BEGIN 
{F (RESET) THEN ARESET{ ); 
This section defines the wiggles when the Am9516 1s bus master 


IF (HLDA) THEN ENABLE( ); 
{fF —(/S{2! * HLDA’ THEN BEGIN 
IF (S[1] * /S{0}) THEN 
LAO = /CK ~ BW + /BW * AO” 
ALE + / BW * LAO ~ JALE 
ELSE 
LAO = BW + /BW AO ~ 
ALE + /BW * LAO * /ALE 


(HLDA) THEN 
(CASE) (S{2:0}) 
BEGIN 
1) BEGIN 
RD /RW ~ DS 
A /BW ° /RW * /CK 
WR 7BW ” RW OS 
c /BW ” RW 
WAIT = 1 
ERD: 
2) BEGIN 
RD 7RW ~ DS 
B BW 
A /BW ° /RW 
WR /BW * RW * DS 
C /BW ° RW 
WAIT = /BW 
END; 
3) BEGIN 
RD /RW* OS" B 
B BW ” CK 
A /BW * RD 
WR /BW * RW * DS 
C /BW > RW 
WAIT = BW 
END; 
5) BEGIN 
RD = /RW * DS 
A = /BW* /CK 
WAIT = BW 
END; 
6) BEGIN 
RD 7RW * DS 
A “BW 
END; 
7) BEGIN 
RD = /RW~ OS 
A = RD 
END; 


tt WoW a 


Hut wou 


ENO; 
This section defines the wiggels when the 8088 is bus master 
BEGIN 
LAO= AO * ALE SEL + LAO ° “ALE * SEL 
B = LAO * WR - SEL 
A /ULAQ * WR * SEL 
OS = A + /LAO~ RD” SEL 
C ‘LAO > RO” SEL 
0 =tA0‘ RD’ SEL 
END; 
END. 
This PLPL file implements an interface between the 
8-bit 8088 and the 16-bit Am9S16. 
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Programming the PAL and the counter 


In writing the Programming Language for Program- 
mable Logic (PLPL) file to control the operation of 
the AmMPAL22V10 and the 74LS161 counter, the 
inputs to the PAL device from the counter are as- 
signed SO, S1, and S2, respectively. Then, it is pos- 
sible to apply a “sculptured design” technique to 
the entire timing diagram (see figure in Panel, ‘A 
matter of timing’) by using the Case statement from 
PLPL. By assigning combinatorial equations to only 
one binary partition or column at atime (Case), the 
designer can ignore all other aspects of the design 
for the time being and generate simple equations 
directly from the timing waveforms. 

During clock time T1 of the Am9516’s word read 
cycle the state of the 74LS161 (SO, S1, S2) is cleared 
to 000 by the assertion of address latch enable 
(ALE). LAO is the only output control signal from the 
CRT controller asserted during this period. This sig- 
nal is handled as a special case, however. During 
time T2 of the DMA controller's word read cycle, 
the RD and WAIT outputs from the CRT controller 
must be asserted. This time partition corresponds 
to the state inputs S2,S1, S0=001. Therefore, the 
first Case equations are 


CASE 
BEGIN 
1) BEGIN 

RD= /RW°‘DS ; Transform Control 
; Signals /RW and DS 
‘into Intel /RD 


(S[2:0]) 


WAIT = 1 , Assert Wait 


; unconditionally 
END; 


During time T2 of the DMA controller's byte read 
cycle, A is the only additional output not already 


a tri-state buffer onto the 8-bit bus. During the sec- 
ond cycle, the tri-state buffer is turned off and the 
previously latched half of the data is driven onto the 
bus. On incoming accesses (8-bit path to 16-bit path), 
the process is reversed. 

The control mechanisms that perform this cycling 
depend on the WAIT and R/W signals passing to 
and from the DMA controller, and on the ability 
to enable or disable the latches and transceivers selec- 
tively. The Am2952 bidirectional I/O port was 
chosen because of its dual registers and its flexible 
control. The AmMPAL22V10 device was chosen to 
match the required number of control pins and func- 
tions. Since the complexity of this design requires 
the use of all of the PAL’s I/O pins for control func- 
tions, however, it was necessary to use a 74LS161 
counter to provide the state sequencer function. 


Programming with PLPL 

It has long been the logic designer's ‘‘art’’ to merge 
the often very different concepts and notations of tim- 
ing information with Boolean logic. Yet, the evolu- 


accounted for in the Case statement. This signal 
allows a byte of data to flow through the bidirec- 
tional bus transceiver into the DMA controller 
during byte read operations. Some additional con- 
straints are placed on this signal, however: it must 
only be asserted in time T2 on byte read operations 
(the B/W input) and it must be delayed by a half 
clock period from the rising edge of T2 (CK signal). 
Thus the Case statement becomes 


CASE (S[2:0]) 
BEGIN 
1) BEGIN 
RD= /RW‘DS ; 
A= IBW‘/RW'/CK ; enable the 
, receiver 


WAIT =1 
END; 


__ Finally, by examining the last time T2 elements 
(WR and C) during the DMA controller's byte write 
cycle, the remaining terms in Case 1 are derived. 
With the exception of LAO, the remaining equations 
were developed in the same fashion. Clearly, this 
“sculptured” technique is a very simple and 
methodical means for arriving at the Boolean re- 
quirements for a logic block. 

As the PLPL listing shows, the signal LAO was 
handied slightly differently from the previously dis- 
cussed method. The number of product terms gen- 
erated via the Case statement made this approach 
necessary. The number exceeded the upper limit 
(16 terms) for a programmable logic array. As a prac- 
tical matter, therefore, it was necessary to optimize 
this signal manually. However, it should be noted 
that this step will not be necessary once the fully 
optimized version of PLPL becomes available. 





tion of a syntax to fully express this art has taken 
a long ime. AMD recently developed such a language 
for programming the AmMPAL22V10, however. 

“Programming Language tor Programmable 
Logic,’ or PLPL, allows the designer to specify a 
design using multiple input formats. This specifica- 
tion flexibility supports the variety of design 
approaches necessary to express different design 
problems efficiently. These formats range from sim- 
ple sum-of-products Boolean equations to high level 
constructs. PLPE. also supports the input specifica- 
tions for many types of AND/OR based devices, in- 
cluding all of the current AMD programmable logic 
array and PROM devices. 

PLPL. is block structured, and includes the high 
level language constructs If-Then-Else, Case, and 
For; all familiar to many programmers of the C and 
Pascal languages. Macros, functions, constants, and 
variables may also be used in PLPL. The language 
also facilitates use, clarity, and self-documentation. 

Such current programmable logic technology and 
associated programminy languages as PLPL allow 
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A matter of timing 


The complex AmMPAL22V 10 design used the accom- 
panying timing diagram to correspond to the 
desired waveforms. They are partitioned by the 
respective binary state (or count) from the counter. 
The desired timing requirements during the 
period when the DMA controller is bus master 
appears below. During time 11, address latch 
enable (ALE) is asserted by the DMA controller to 
denote the beginning of the cycle; a short time later, 
an address is driven onto the bus. This address is 
valid at the falling edge of ALE. The control signal 
LAO (latched AO), therefore, must be valid at this 
time, as well. In this phase of the cycle, it must also 
be high to enable the odd byte from memory to be 
loaded into the bidirectional !/O port. In addition, 
the assertion of ALE performs the function of reset- 
ting the 74LS161 counter to 0000 in order to syn- 
chronize the cycle. 
__ During time T2, the DMA controller will assert its 
DS signal. The timing for this signal, in conjunction 
with the R/W signal (asserted in T1) must be trans- 


Am9516 


formed into an 8088-equivalent RD signal. During 
a word read cycle, this RD signal also must be arti- 
ficially negated and then reasserted to accomplish 
a double byte read. At the same time, the DMA con- 
troller must be “parked” in order to multiplex or 
assemble a word. Thus, the WAIT signal is also 
asserted at time T2. During time TW (S2, S1, 
SO = 010), the receiver clock enable contro! signal 
B must be asserted in order to allow the next sys- 
tem clock’s rising edge to strobe the upper byte into 
the bidirectional I/O port. This is accomplished dur- 
ing the next TW period (S2, $1, SO=011). ot 

During the remainder of the word read cycle, RD 
is negated and then reasserted after LAO has been 
forced low to address the even byte. A is then 
asserted to allow both the previously latched upper 
byte and the current lower byte to be driven onto 
the DMA controller's pins. And finally, the WAIT sig- 
nal is negated, allowing the DMA controller to finish 
its read cycle by strobing in the 16 bits of command 
data on its data pins. 


Am9516 Am9516 
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highly organized application-oriented control blocks 
to be formed easily. These tools can conceptually 
raise the designer above the details of the design at 
the logic level and directly translate the necessary 
response characteristics from a timing diagram. 
This approach can be referred to as a ‘‘sculptured 
design’’ technique because it is analogous to the way 
solid stone is formed according to an artist’s image. 
Raw logic can be transformed directly into useful con- 
trol functions from the desired timing information. 
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The AmMPAL22V10 is, in essence, a fuse-program- 
mable gate containing up to 22 inputs and 10 out- 
puts. It can define and program that architecture of 
each output on a pin by pin basis. Thus, the designer 
is free to optimize the design mix between registered 
and combinatorial functions as needed. 

The AmPAL22V10 is programmed by opening fus- 
able links in any or all of its 10 output macrocells, 
as well as in its AND gate array. The AND gate struc- 
ture is very similar to other PAL devices; therefore 
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it allows the same powerful, yet familiar features. 
However, itis the AmMPAL22V10’s 10 output logic 
macrocells that give the designer substantial new 
design freedom. Moreover, at each macrocell out- 
put is a tri-state output buffer controlled by a 
separate output-enable AND gate. 

These macrocells provide the AMPAL22V10’s key 
features. They can be configured to make any or all 
of the I/O pins act either in sequence or in combi- 
nation and have either active-high or active-low char- 
acteristics. Furthermore, the output enables can 
individually control the direction of the pins so they 
act as outputs, inputs, or bidirectional ports. 

A number of trade-offs and limitations are appar- 
ent in a design that so dramatically affects the input 
and output of the system. The most obvious limita- 
tion stems from under utilization of 16-bit periph- 
erals on an 8-bit bus—the speed of all I/O operations 
are cut in half. As a result, bus utilization will 
increase if the 16-bit peripheral represents a signifi- 
cant factor of the bus use. A CRT controller such 
as the Am8052 might use 5 to 10 percent of the bus 
bandwidth for display information when using 16-bit 
I/O. Converting to 8-bit 1/O would double bus use 
to 10 to 20 percent. Another factor that might affect 
the bus usage ts the efficiency of the 8- to 16-bit con- 
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version control logic. If the state machine designed 
to perform the 8/16-bit (or 16/32-bit) conversion is 
improperly designed, extra transfer overhead might 
be introduced. This might mean a sequential transfer 
of two 8-bit values would take twice as long a single 
16-bit transfer. 

The design constraints might limit the use of the 
peripheral to byte-only operations during data trans- 
fers (as in the design using the DMA Am9516 con- 
troller), and slow it down by a factor of two during 
command operations. For such a DMA device as the 
Am95l6, the extra time required for command fetch- 
ing is not usually a significant portion of bus time. 

System designers will have to weigh the cost of 
the extra overhead on a case-by-case basis. The ben- 
fits may well justify these limitations—particularly 
when the bus is self-limiting, but the device charac- 
teristics allow for value-added designs. In addition 
to bus degradation for certain configurations, extra 
logic and design effort are involved. Most interfaces 
outside a system’s immediate family require some 
kind of extra interface logic, however. By manipulat- 
ing the signals and incorporating them into program- 


mable logic devices such as the AMPAL22V 10 device,, 


therefore, most of this logic is free. 
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Hanging a user-customized macrocell at each output of an LS!/-level 
fuse-programmable array logic gives it the flexibility of a gate array. 
The result: Less hardware is needed for high-level functions. 





Designers of semicustom ICs have long appre- 
ciated the in-house programming capabilities of 
programmable array logic devices. But gate arrays, 
with their LSI densities and wider opportunities for 
customization, are an inviting alternative, even if 
they must be sent out for metal-mask processing. 
Which approach should 
the designer use? 

An agonizing choice be- 
tween convenience and 
versatility is no longer 
necessary. The first fuse- 
programmable chip capa- 
ble of implementing LSI 
circuits blends the advan- 
tages of both types. The 
result 18 more cost- 
effective designs with 
higher-level functions 
than previous pro- 
grammable logic devices 
offered and a shorter 
turn-around time than is 
possible with gate arrays. 

The new chip, the 
AmPAL22V10, can be 
thought of as a fuse- 
programmable gate array, 
since at twice the density of previous programmable 
array logic devices, it replaces the equivalent of log- 
ic circuits having 500 to 1000 gates (see “Blending 
Programmable Logic and Gate Arrays,” p. 97). In 


Brad Kitson, Section Manager 
David Laws, Managing Director 
Warren Miller, Section Manager 
Advanced Micro Devices Inc. 

901 Thompson Place, PO Box 3453 
Sunnyvale, Calif. 94088 





Reprint with permission from ELECTRONIC DESIGN 


fact, because of its flexible, programmable architec- 
ture, the chip spans the breadth of standard SSI, 
MSI, and present-generation PAL devices, often ac- 
complishing what could not be practically done with 
several such chips. 

Designers program the 22V10 by opening fusable 
links in any or all of its 10 
output macrocells, as well 
as in its AND-gate array, 
developing the needed log- 
ic functions. Though the 
AND-gate structure 
makes the unit similar to 
other PAL devices, the 
chip’s 10 output logic mac- 
rocells afford a substan- 
tial new degree of design 
freedom (Fig. 1). 

What’s more, at the 
chip’s output pins are 10 
three-state output buff- 
ers, each fed by a mac- 
rocell and controlled 
through a separate out- 
put-enable AND gate. 

The macrocells provide 
the 22V10’s key features: 
They can be configured to 
make any or all of the I/O pins act either sequen- 
tially or combinatorially and have either active 
highs or active lows. Additionally the Output En- 
able can individually control the I/O terminals to 
act as output pins, input pins, or bidirectional ports. 
Thus the designer can readily change the chip’s ar- 
chitecture. 

Within the array, each AND gate is fed by the 
true and complementary levels of the chip’s 12 input 
lines, one of which doubles as a clock line. Also, each 
AND gate is fed by the true and complementary 
levels of the 10 feedback outputs from the output 
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logic macrocells. 

Each of the AND gates therefore has 44 inputs, 
and there are a total of 182 AND gates in the array. 
The result is over 5800 fuses that can be pro- 
grammed to perform the desired function. 

A total of 120 of the input AND gates are distrib- 
uted to drive 10 OR gates, forming the logical part 
of the AND array. In addition, one AND gate is 
assigned to control each of the 10 three-state out- 
puts, and the remaining two gates activate synchro- 
nous register presets and asynchronous register 
resets. 

The array itself consists of 10 AND-gate macro- 
cells (five similar pairs), each logically ORing the 
outputs of anywhere from 8 to 16 AND gates. The 
AND gates are distributed so that the two out- 
ermost macrocells in the array logically OR to- 
gether eight AND gates. The next two outermost 
macrocells logically OR together 10 AND gates; the 


Output 
macrocelt 


AND-gate 
macrocell 


CLK/t, 


Programmable 
AND 


array 
(44 X 132) 


AND-gate 
macrocell 


Vala Ths 


AND-gate 
macrocell 





1. A tuse-programmable LS! device combines the efficiency 
of programmable logic arrays with the flexibility of gate 
arrays. Ten user-definable output macrocells allow the 
designer to set the logic outputs individually for one of four 
configurations. They can even be used as inputs or 
dynamically controlled {/0. 
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next, 12 AND gates; and so on, traveling inward 
along the array. In other words, moving toward the 
array’s center, two AND gates are added to succes- 
sive pairs of macrocells, ending with the two inner- 
most macrocells, each ORing 16 AND gates apiece. 

The variable distribution of AND gates is partic- 
ularly useful when implementing counting, ex- 
clusive-OR functions, and complex state machines. 
For example, a 10-bit binary counter has only one 
AND gate on the LSB but requires an additional 
gate for each succeeding bit, for a total of 10 AND 
gates to implement the MSB. Should a hold, parallel 
load, or specialized function also be desired, the 
MSB requires even more AND gates. 

To program the array into its familiar sum-of- 
products Boolean function, a designer blows the cor- 
responding fuse links. Those links initially con- 
nected the chip’s pins and their complements to all 
132 of the 44-input AND gates. As mentioned, the 
designer can also program any OR-gate output to be 
a dedicated input or a dynamically controlled I/O 
pin. This is possible because the connection from an 
output macrocell to its output pin is through a 
three-state output buffer and is controlled by an 
output-enable AND gate. Turning off the connec- 
tion turns the output pin into an input. Otherwise 
the pin remains connected to the output macrocell, 
fed by the OR gate. 


An output with a difference 


The chip’s output section is where it appears more 
like a gate array than a PAL device. The output 
macrocells allow the designer, by individually set- 
ting the outputs to be either sequentially or combi- 
natorially and either active high or active low, to 
program either the true or complementary version 
of a logical term—whichever makes the most effi- 
cient use of the chip’s AND array and fits the chip 
to the application. Furthermore, independent con- 
trol over the active state helps the designer conform 
to common logic conventions, thus avoiding con- 
fusion and error. 

For example, certain functions, such as chip se- 
lects and resets, tend to be active low. The output 
macrocells avoid contradictions with convention by 
inverting outputs, individually changing them when 
necessary to conform. Finally, by individually 
selecting either sequential or combinatorial out- 
puts, the macrocells impart to the chip the capacity 
for implementing more functions than is possible 
with existing PALs. 

The macrocell (Fig. 2a) contains three main logic 
blocks: a rising-edge-triggered D-type flip-flop 
with asynchronous reset and synchronous preset in- 
puts, a four-to-one output-path selection multiplex- 
er, and a two-to-one feedback-path selection multi- 
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plexer. Two fuses are imbedded in each macrocell to 
control the four possible output configurations: R,, 
selects sequential or combinatorial operation; P,, se- 
lects active low or active high. 

With both fuses intact, the output is sequential 
and active low (Fig. 2b). The Q output of the flip- 
flop passes to an inverting output buffer, and the Q 
output feeds back tothe AND array internally. This 
configuration is particularly suitable for state- 
machine designs. Alternatively, by programming 
(blowing) P,, the designer can make the output ac- 
tive high. 

On the other hand, by programming only R,, the 
designer makes the output combinatorial and active 
low. For this scheme, the OR-gate output bypasses 
the flip-flop and feeds the inverting output buffer 
directly. At the same time, the feedback multiplexer 
passes the output back into the array (Fig. 2c). 

By programming both fuses, the designer makes 
the output combinatorial and active high, and, 


again, the feedback multiplexer passes the output . 


back to the array. This configuration serves best for 
“glue” logic and for generating complex control 
functions. 

Finally, when the device is programmed in the 
combinatorial configuration, the programmable 
Output Enable can be used to transform an output 
pin into a dedicated input or a dynamically con- 
trolled I/O terminal. Once enabled as a dedicated 





input, however, a pin’s output function is sacrificed. 
This capability is extremely valuable when par- 
titioning functions into the device. The limitation is 
no longer the numbers of input devices, but the total 
number of device pins. Lastly, as a dynamically con- 
trolled I/O, the pin serves both functions, meeting 
most bus requirements. 


Intelligent use 


One of the most challenging jobs in designing 
microprocessor systems is installing intelligent pe- 
ripheral controllers—that is, controllers that can 
transfer data themselves and so unburden the pro- 
cessor from inefficient polling operations. One solu- 
tion is to build a custom intelligent controller from 
scratch using the 22V10 (Fig. 3). This controller gov- 
erns data flowing between a microprocessor and up 
to eight peripheral devices, all conforming to the 
Small Computer Systems Interface standard (SCSI, 
the ANSI X3T9.2/82-2). 

The interface consists of an 8-bit bidirectional 
data port (DB;-DBy); three contro] outputs— 
Acknowledge (ACK), Select (SEL), and Reset 
(RST); and five control inputs—Request (REQ), 
Control/Data (C/D), Message (MSG), Busy (BSY), 
and Input/Output (I/O). There are additional 
signals, unrelated to the SCSI] standard. They in- 
clude several microprocessor controls, such as ad- 
dress lines A, and A, for setting the data’s desti- 








Blending programmable logic and gate arrays 


Centered on a fuse-programmable AND array, the 
architecture of a PAL (programmable array logic) 
device offers designers a powerful, instantly mod- 
ifiable logic building block. As a result, PAL devices 
are used in a wide variety of applications, ranging 
from microprocessor-based systems to supermini- 
computers. 

Since gate arrays are not programmed in house, 
they have a much slower turnaround time and higher 
cost. Yet gate arrays’ simple logic structure makes 
them easier to customize than PAL devices, and their 
larger size makes for denser logic structures. 

A gate array’s simpler structure can be a draw- 
back, however. For example, typical gate arrays are 
based on just two- or three-input gates; therefore, to 
implement complex functions, they often require 
many gate levels, which results in slow logic struc- 
tures. In contrast, PAL devices, like memories, make 
more efficient use of their silicon area and can be 
configured into fast logic structures. Consequently, 
even with a slight overhead for programming cir- 
cuitry, PAL densities are beginning to rival those of 
gate arrays. 

However, innovative circuit techniques and an 
oxide-isolated bipolar process team up in the 
AmPAL22V10 and combine the advantages of PAL 


devices and gate arrays. For example, the pro- 
grammable output macrocells are designed to add 
only one Schottky diode in the delay path. As a 
result, the chips selected for high speed have a worst- 
case input-to-output propagation delay and setup 
time of only 25 ns, as well as a worst-case clock-to- 
output delay of a speedy 15 ns, over the commercial 
temperature range. Also, the oxide-isolated process 
fits the chip's ability to implement complex func- 
tions into a 24-pin, 0.3-in.-wide package. 

For convenience, a feature called Preload loads the 
chip’s internal registers with an arbitrary value, 30 
that the designer can perform a complete post- 
programming functional test. In fact, Preload is a 
necessity for testing state machines, where the suc- 
cessive states depend on past and present values, as 
well as on the inputs. Without Preload, complex and 
sometimes impossible sequences would have to be 
entered before the first test could be performed. 

What’s more, platinum silicide fuse technology 
makes programming faster and increases reliability, 
and a well-controlled melting rate yields large, sta- 
ble, nonconductive gaps. Address circuitry, much like 
that of a PROM, blows one fuse at a time, and pro- 
gramming is done with an algorithm that is easily 
adapted to a wide variety of available machines. 
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nation, read and write controls (IORD and IOWR), 
Interrupt Request (INTREQ), Chip Select 
(CSSCSI), System Clock (SYSCLK), and buffer and 
latch enables (BE and LE) for passing bus data be- 
tween the processor and its peripherals. 

In operation, the controller assigns one of eight 
peripheral devices to be a master by activating SEL 
and at the same time sends the master’s address to 
the SCSI data port..From then on, when a device 
notifies the controller that it needs to transfer ei- 
ther a command or data, the controller interrupts 
the host processor. In turn, the processor reads the 
controller’s status register to determine which de- 
vice is requesting service and the type of request it 
is making. After its status register is read, the con- 
troller removes the interrupt request and carries 
out the required transfer. = 

Meanwhile the selected device issues a BSY 
signal, letting the other devices know that the bus is 
in use. The data transfer is managed by the hand- 
shake signals ACK and REQ, and the processor ac- 
tivates RST as needed by pulling both WR and RD 
low at the same time. 











Syneneonous 


reset 





Matching the chip to its application starts with 
an assessment of the available resources against 
those needed to do the job. In general, that means 
establishing that the chip has enough input and 
output pins and product terms to accommodate the 
logic functions required by the application. 

As to the pinouts, the 10 outputs required by the 
application do, in fact, match the 10 available on the 
chip. Four of the pins are used for the bidirectional 
data bus and the rest are dedicated outputs. As for 
inputs, the SCSI controller requires 11, which the 
chip more than satisfies with its total of 12. 

Next the designer develops the product terms 
needed to execute the controller’s job and compares 
the number and size of the required terms with the 
number available from the chip. The resulting con- 
troller equations (see Table 1) number 10, and these 
nestle neatly into the chip’s 10 available AND-gate 
macrocells. Also, the maximum number of terms in 
any one equation—in this case, those making up the 
acknowledgment function—is only 6, compared 
with the chip’s capacity of 8 to 16. Thus the control- 
ler function fits easily into a single programmable 


Output 
selection 
multiplexer 


S; 


2. Each of 10 output macrocells (a) are independently fuse-programmed for one of four 
configurations. Fuse RA, controls whether the outputs are sequential or combinatorial, and 
fuse P,, determines whether they are active high or active low. With no fuses blown, the 
output is sequential and active low (b). Blowing only R, gives a combinatorial active-low 
output, and blowing only P,, gives a sequential active-high output. Finally, blowing both 
fuses changes the output to combinatorial and active high (c). 
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logic array, as can more complicated micro- 
processor peripheral interfaces. 

The 22V10's real power, however, becomes appar- 
ent when it is needed to fill an application that 
appears to demand more than the chip's resources 
can deliver. Insufficient input or output pins or 
product terms are frequent shortcomings of many 
devices when the job is very formidable. With its 
highly versatile I/O architecture, however, the 
22V10 can accommodate conditions that would 
make chips even larger than itself inadequate. 


Table 1. Peripheral controller equations 


Combinatorial equations 
BE = CS: AD*Aon* Ay 
LE = CS: WR: Ag 

D3 = Ay*C + Ay? ACK 
D2 = Ay*M + Aq* SEL 
Dy = A1°8 + Ay* RST 
Do = Aqy*1+Ay°A 


Sequential equations 
INTREQ=Re(l°C*M+3*C*M+I°C*M + ICM) 


ACK=CS « WA+ Az* Ay *D3+R* ACK*( cS * wa’ Ag" At) + 
R*CS+RD*Ag* Ay 


SEL = CS: WR* Ag+ Ay + D2 + B+ SEL*( cog" wat Az" At) 


AST = RD WR + CS* WR* Ag+ Ay + Dy + 
RST-( CS" WR‘ Ap* Ay) 


3. The problem of designing an intelligent peripheral 
controller is simplified by the 22V10's flexible I/O structure. 
With the freedom to assign different architectures to 
different pins, the designer can make better use of the 
chip's input and output pins and product terms. 





For example, one of the most complicated func- 
tions to perform with a digital computer is floating- 
point arithmetic, or more specifically the nor- 
malizing of numbers in the process. As a result, no 
standard MSI or LSI logic for normalization exists. 
Yet the programmable array is well-suited to inte- 
grating at least part of this complicated logic func- 
tion. 

Normalization is necessary because a floating- 
point number is represented in two parts: a man- 
tissa and an exponent. The mantissa is a fractional 
number that is adjusted, or normalized, so that a 
binary 1 always occupies the most significant bit. To 
normalize a floating-point number without chang- 
ing its value, a compensating adjustment must be 
made to the exponent. In effect, the exponent, a 
power of two, is incremented for each bit position 
that the mantissa is shifted to the left. 

Because the process of normalization can be very 
costly if done wholly in parallel, it is more efficient 
to process a number one byte at a time. Further- 
more, because of the function’s complexity, it is rea- 
sonable to partition the normalization unit into two 
units: an execution unit, which acts on the incoming 
data, and a control unit, which directs the execution 
unit. Thus the two units together form a byte-wide 
floating-point normalization (BFN) unit, with the 
programmable logic array making up the execution 
unit (Fig. 4). 

The design of the execution unit breaks up log- 
ically into the four main instruction categories; De- 
tect, Normalize, Merge, and Shift. Detect tests the 
incoming data for a binary one in any bit position. 
When such a nonzero byte appears, a flag, F, 
informs the control unit and tells it to begin the 
normalization process. Next, the Normalize com- 
mand determines the number of bit positions to 
shift the incoming byte, in order to put its first bit 
inthe most significant position. Then the Shift com- 
mand adjusts the input data to the left by the spec- 
ified number of bit positions, filling the lower bits 
with Os. Finally, the Merge command adds in the 
full numbers’ remaining bits, adjusting them so 
that the entire number is intact and shifted. 

From the beginning, even before the equations of 
the execution unit have been fully developed, it ap- 
pears that the resources of the programmable array 
will be insufficient for the job (see Table 2). Four- 
teen input pins are needed, against the chip's 12, 
and 12 output pins are needed, whereas the chip has 
only 10. Nevertheless, a closer look at the execution 
unit equations, with an eye toward consolidating 
I/O pins and functions, turns up paths around the 
seeming shortages. 

One “problem” that proves not to be one is the 
number of outputs. Although they number 13, not 
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all are active at the same time. For example, the 
outputs Vy-Vz, which specify the number of shift 
positions as a three-bit binary number, are active 
only when no other outputs are. Thus'they can share 
their output circuitry with other signals, relieving 
the output shortage. 

However, combining the shift outputs with other 
terms creates a product-term deficiency. Yet, again, 
careful observaticn reveals a solution. One way to 
reduce the product terms of a function is to invert it, 
using the programmahle array’s output macrocell. 
The most likely candidate for such a reduction is the 
function with the most product terms. Although 
there are several, the best, because it is relatively 
removed functionally from the others, is the detec- 
tion flay, F. Its function is given by: 


F = D;+ Dyt D,+ D,+D,+ D.+ D,+Dy 


where D-; through Dy are input data bits, which, if 
any are high, indicate a nonzero byte. 


Table 2. Allocation of chip resources 
Output pins 
tnitlal 


require- 
ments 


F (detection flag), 
V2-Vo, 114 required 
R7-Ro 


When De Morgan's theorem is applied, the long 
OR string of active-high data reduces to a single 
AND term, saving a total of seven product terms. 
Still, two input pins must be fcund for the pro- 
grammable array to fully incorporate the logic of 
the execution unit. Since De Morgan’s theorem can- 
not minimize the logic any further, the only answer 
is to locate any product terms that underuse the 
chip’s resources, combine them, and lend their out- 
put pins to the cause by making them I/0 pins. 

Examination of the instruction input specifica- 
tion (see Table 3) reveals that during Detect only 
three of the five instruction inputs are necessary, 
freeing the two others to be used as outputs. There- 
fore instruction bits I) and I, ean share the outputs 
that were previously dedicated to shift bits Vp and 
V,, thus meeting the execution unit’s final require- 
ments in a single chip.O 


Table 3. Instruction definitions 


Instruction Detection 
code (I4-I9) flag 


Detect OOXXX 





Normalize 
Merge 
108 required Shit 


Data input (100 ns), 


OOXXX 
OIN2N2No 
10N2N4No 


Exponent Input 


D,-Do 


Instructions, 
la-lo 


Values, 


V2-Vo Exponent 


Status BEN X 
FN) (datection | execution adjustment 
flag and unit functional 


others) unit 


Byte-wide aay A 
normalization ( 
control unit 


Function unit 
clock (50 ns) 
(CLK) 


BFN functional unit 


Result output, 


7 


Exponent output 





4. Floating-point arithmetic is a hardware-intensive process that, for the most part, has 
defied integration. Yet by partitioning the tasks into smaller pieces, a designer can 
apply the power of the programmable array to the job. The chip's flexibility 
contributes to its successful implementation as a floating-point execution unit, even 
though its resources may seem insufficient. 
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XOR PLDs Simplify Design of 
Counters and Other Devices 





XOR PLDs unique architecture is better 
suited than that of conventional AND-OR 
PLDs to certain applications. Once you un- 
derstand the basic theory behind their oper- 
ation, you can easily configure them to per- 
form the tasks of such devices as binary 
counters and video shift registers. 


Chris Jay, Monolithic Memories Inc 
Many designs—notably high-speed counters for ac- 


cessing memories, for counting events, or for sequential 
control in high-speed processor systems—require state 


INOUTS, FEEDBACK AND 1/0 


INOUTS, FEEDBACK AND VO 


Pe 


a a oH: 








machines that step through their sequence of operation 
by a binary count. XOR PLDs are ideal devices for 
these types of jobs, but they are rarely considered for 
such applications—perhaps because design engineers 
don’t understand their useful, but unusual, architec- 
ture. 

Like the more sophisticated of today’s conventional 
devices (Fig la), exclusive-OR (XOR) versions have a 
registered architecture (Fig 1b). The distinguishing 
features of the latter are two pairs of product terms 
(multiple-input AND gates), which drive two summing 
terms (OR gates), which in turn feed two inputs of an 
XOR gate. The combinational XOR gate’s output con- 
nects directly to the D input of the register. (For a 
theoretical discussion of the advantages of XOR PLDs 
over AND-OR PLDs, see box, “Principles of XOR PLD 
operation.”) 


Fig I—You can easily distinguish between a conventional AND-OR PLD (a) and an XOR PLD (b). The latter has an XOR gate preceding the 
register. 





Reprinted from EDN, May 28, 1987 
© 1987 CAHNERS PUBLISHING COMPANY 
a Division of Reed Publishing USA. 
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The distinguishing features of an XOR 
PLD are two pairs of product terms, which 
arive two summing terms, which in turn 
feed two inputs of an XOR gate. 


The applications described here make use of MMI’s 
20X8A and 20X10A PAL devices. These 24-pin XOR 
PLDs have an improved 22.2-MHz clock and worst-case 
propagation delay of 30 nsec compared with older, 
12.5-MHz, 50-nsec parts. 


XOR PLDs bring benefits to counters 

The prime use of XOR PLDs is in counters. Fig 2a 
shows the pinouts and Fig 2b the Palasm logic equa- 
tions of a 10-bit, loadable, up/down counter. You can 
design such a counter into a single 20X10A PAL device. 
The counter has ten registers, QO through Q9, capable 
of binary, up/down counting. The up control is high for 
an up count and low for a down count. Registers Ql 
through Q8 load synchronously through data inputs D1 
through D8; the loading is enabled by an active-low /LD 
input (see box, “Palasm notational conventions”). Be- 
cause of a pin limitation of the package, Q9 loads via the 


10-BIT EVEN BOUNDARY LOADABLE 
UP/DOWN COUNTER 


PAL 
20X10A 


up/down-control pin. This pin (the D9 input during 
loading) controls the counter’s up- or down-counting 
mode. Q0 is always loaded with a logic low. 

A counter larger than 10 bits requires two or more 
XOR PLDs. Unlike MSI counters, which come with 
carry-in and carry-out pins, you must incorporate in 
your design provisions for cascading XOR PLDs. When 
cascading two XOR PLDs, you must provide a carry- 
out-enable signal from the least significant counter to 
the count-enable input of the most significant counter. 

When the least significant counter has reached its 
maximum count, it should enable the most significant 
counter to increment its count after the next rising 
edge of the clock pulse. However, time delays arising 
from the time needed to decode and propagate the 
Carry signal could slow the system down. To avoid this 
delay, your design should generate a look-ahead-carry 
output. 


ing Qo*/LD #COUNT 
+ LD 7 LOAD 
s+; GND + GND ; 


2= /Q1*/LD 
+ /D1*LD 
s+: QO*/LD*UP 
+ /QO0*/LD*/UP 


/Q2*/LD 
+ /D2*LD 
i+t QO*#Q1*/LDAUP 
+ /Q0#/Q1*/LD*/UP 


/Q3*/LD 
+ /D3*LD 


*+HOLD 

LOAD 
;COUNT UP 
s;COUNT DOWN 


*HOLD 

+ LOAD 
;COUNT UP 
*COUNT DOWN 


7HOLD 
+LOAD 
*#COUNT UP 
;COUNT DOWN 


s+: QO*Q1*Q2*/LD*UP 
+ /Q0*/Q1*/Q2*/LD*/UP 


/Q44/LD 

+ /D4&*LD 

r+ QO0#Q1#Q2#Q3*/LDAUP 

+ /Q0%/Q1*/Q2*/Q3*/LD*/UP 


/Q5*/LD 

+ /D5*LD 

t+t Q0#Q1#Q24Q3*04*/LDAUP 

+ /Q0*/Q1#/Q2*/Q3*/Q4*/LD*/UP 


/Q6*/LD 
+ /D6*LD 


+HOLD 

*LOAD 
?COUNT UP 
7COUNT DOWN 


;COUNT DOWN 


+HOLD 

7LOAD 
#COUNT UP 
#COUNT DOWN 


r+2 QO#Q1*Q24*Q3*Q44Q5#/LD*UP 
+ £Q0*/Q1#/Q2*/Q34*/Q4*/Q5*/LD*/UP 


/Q7*/LD 

+ /D7*LD 

t+2 Q0#Q1#Q2#Q3#Q4#Q5#Q6*/LDAUP 

+ /Q0*/Q1*/Q2*/Q34/Q4%/Q5*/Q6*/LD*/UP 


/Q8*/LD 

+ /D8*LD 

$2 QO#Q1#Q24Q3 #Q4*Q54Q6%Q74*/LDFUP 

+ /QO*/Q1#/Q24/Q3*/Q4*/Q5 */Q6%/Q7*/LD*/UP 


/Q9*/LD 

+ /UP*LD 

343 QO*Q1*Q24Q3 #Q4%Q5 *Q6 4Q7 *Q8 ¥/LD*UP 

+ /QO*/Q1*/Q2*/Q3#/Q49/Q5*/Q6*/Q7 */Q8*/LD*/UP 


+HOLD 

#LOAD 
;COUNT UP 
;COUNT DOWN 


+HOLD 

7LOAD 
;COUNT UP 
;COUNT DOWN 


7HOLD 

7LOAD 
sCOUNT UP 
7COUNT DOWN 





Fig 2a—This diagram shows the pin designations for a 20X10A XOR PLD configured as a 10-bit, loadable, up/down counter. This design is 


not cascadable. 
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Principles of XOR PLD operation 


The configuration of an exclu- 
sive-OR (XOR) PLD simplifies 
counters and other state ma- 
chines that progress through a 
binary-encoded counting se- 
quence. You do not, of course, 
necessarily need a PLD with 
XOR gates to design a binary 
counter; you can use the simpler, 
sum-of-products (SOP) architec- 
ture of a conventional PLD (see 
Fig la in the main text) to 
implement the logic equations 
that define a binary-count 
sequence. 

However, the conventional 
PLD requires a larger number 
and AND gates, or product 
terms, for a counter’s higher- 
order registers than does an XOR 
PLD. The Boolean equations that 
follow prove this point. They are 
required for a generalized 
binary-count sequence and are 
rendered in Palasm notation (sce 
box, “‘Palasm, notation conven- 
tions”). 

If QO is the Icast significant 
register, then 


/Q0 := QO 

This is a straightforward binary 
function—divide by 2. If Q1 is 
the nest significant register, it 


requires, the Boolean sum of two 
product terms: 


/Q1 := QO*Q1 +/Q0*/Q1 


For register Q2 the equation 
becomes 


/Q2 ;= Q2*Q1*QO + /Q2*/Q1 + 
1Q2*/Q0 


which requires three product 
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terms. From a gencralized counter 
with N registers, the Nth regis- 
ter, QN, requires N+1 product 
terms: 


/QN := QN*Q(N-1)* ... Q1*QO 
+ /QN*/Q(N-1) 


+ /QN*/QN1 
+/QN*/Q0 


For example, the most signifi- 
cant register is a 10-bit counter, 
Q9, would require the summing 
of 10 product terms, using 10 
AND gates, in a convential 
AND-OR PLD. Consequently, 
although you can write abstract 
counter equations for AND-OR 
PLDs, you may not be able to 
find a real PLD with enough 
product terms to embody those 
equations. 

An XOR PLD, on the other 
hand, uses its product terms 
more economically; no matter 
how significant the weighting of 
the register in a binary counter, 
each register requircs only two 
product terms. 

Before delving into the de- 
tailed Boolean equations that 
support the preceding contention, 
you should get a feel for the 
logic underlying the XOR PLD. 
Consider that, in a binary-count 
sequence, the polarity of a 
register’s output toggles one 
clock period after the state in 
which all of the less significant 
registers were a logic one. 

Next, imagine that you have a 
2-input XOR gate driving a reg- 
ister. The output of the register 
is fed back to one XOR input; 
the other input sees a product 
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term comprising all the less-sig- 
nificant digits. The XOR gate 
will hold the register in its 
existing state until the product 
terms of the less-significant 
digits go True, whereupon the 
XOR gate will toggle the output 
of the register. 

The following is a generalized 
equation for a register (QN) ina 
binary-count sequence using the 
XOR PLD: 


QN := QN :+: Q(N-1)*Q(N-2)* 
... Q2*Q1*Q0 


You can consider the term 
Q(N-1)*Q(N-2)*... 

Q2*Q1*Q0 as a Carry input to 
register QN. 

When Q(N-1)*Q(N-2)*... 
Q2*Q1*Q0 = 0, the register QN 
is in a hold condition, and QN’s 
output feeds back through the 
fuse array and the XOR gate and 
loads into itself. Only when 


QIN-1)*Q(IN-2)*... 
Q2*Q1*Q0 = 1 


will the XOR function invert the 
polarity of QN and strobe the 
inverted state into the register in 
synchronicity with the rising 
edge of the next clock pulse. 
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7 a a a 


Unlike MSI counters, which come with 
carry-in and carry-out pins, you must in- 
corporate in your design provisions for cas- 
cading XOR PLDs. 


The logic in the least significant counter can decode 
the penultimate count and feed the D input of a 
carry-output-enable register. When the counter incre- 
ments to the final count, the carry-output-enable signal 


D6 D7 O08 
0 0 0 


Oo 0 O 
Q3 Q4 Q5 Q6 Q7 8 


is then clocked into the carry-output-enable register. 
The propagation of the carry-output-enable signal to 
the next counter stage is therefore coincident with the 
generation of the output of the ultimate count from the 


D9 010 011 012 O13 O14 


UP DO Of D2 03 04 DS 06. 
bie Ae oR  CEBEOD 


 PALZ0X19A 


~ @0..a1 02°03 “a4 -05 


O 0 0 0 O O O 
Q9 Q10 Q11 Q12 Q13 Q14 Q15 Q16 Q17 


ov 


Fig 3a—Unlike the counter illustrated in Fig 2a, this counter uses a look-ahead-carry-out technique. Provision of this signal renders 


the counter cascadable. 
Fig 3b 


 UP#/Q0*Q1*Q2*93*94*Q5 *064Q7*Q8*/LD 
+ — /3P#Q0#/Q1*/Q2*/Q3*/Q4*/Q5*/Q6*/Q7*/Q8*/LD 


Q0*/LD 
Lo 


Z/QLS/LD 
/DL*LD 


s QO*/LO*UP 
/Q0*/LD*/UP 


7Q24/LD 
/D2*LD 

: Q0*Q1*/LD*UP 
700*/Q1*/LD*/UP 


7Q3*/LD 
/D3*LD 

rt: Q04Q14Q2*/LD4UP 
7/Q0*/Q1*/Q2*/LD*/UP 


/Q4*/LD 
/D4*LD 
¢ Q0*Q1*Q2*Q3*/LO*UP 
+ — /Q0*/Q1*/Q2*/Q3*/LD*/UP 


1Q5*/L9 
/O05*LD 
ss Q0*Q1*Q2*Q3*Q4*/LD*UP 
7Q0* /Q1*/Q24/Q3"/Q4*/LD*/UP 


/Q6*/LD 
/D6*LD 
+ Q0#Q1*92*Q3*04*05*/LD*UP 
+ £Q0#/Q1*/92*/93*/Q4*/Q5*/LD*/UP 


/07#/LD 

/D7*LD 

Q0#Q1*0924934*04405*Q6*/LD*UP 
7Q0#/Q1*/Q2*/Q3*/04"/05*/Q6*/LD* /UP 


/Q8*/LD 

/D8*LD 

0492 #92493 "04495406 "Q7*/LD4UP 
700*/Q1*/Q2*/Q3*/Q4*/Q5*/Q6*/Q7*/LD*/UP 


Fig 3c 


CEBEO = += UP*Q0*Q1*02#Q34Q4405*Q6"Q7*08*/LDACEI 
+ /UP*/Q0*/Q1*/Q2*/Q3*/Q4*/05*/06*/Q7 
*/Q8*/LD*CEI 


/Q0*/LD*/CEI 
QO*/LO*CEI 
s+: L0*/D0 


/QL*/LD 
/D1L4LD 

+: QO*/LD*UP*CEL 
/QO0*/LD*/UP*CEL 


/Q2*/LD 
/D2*LD 

s+: QO*Q1*/LD*UP*CEI 
/Q0*/QL*/LO*/UP*CET 


/Q3*/LD 
/D3*LD 

z+: Q0*Q1*Q2*/LD*UP*CEL 
1Q0*/Q1*/Q2*/LD*/UP*CEL 


/Q4*/LD 
/D4*LD 

rt: Q0*Q1*Q2*Q3*/LO*UPACEL 
/Q0*/QL*/Q2*/Q3*/LD*/UP*CEI 


/Q5*/LD 
/D5*LO 

: Q0*Q1*Q2#Q3*Q4*/LD*UP*CEL 
/00*/Q1*/Q2*/Q3*/Q4*/LD*/UP*CEL 
/Q6*/LD 
/D6*LD 

: Q0*Q1*Q2*Q3*Q4*Q5*/LD*UP*CEL 
/Q0*/Q1*/Q2*/Q3*/Q4*/Q5*/LD*/UP*CEL 
/Q7*/LD 
LD 


z Q0*Q1*Q2*Q3*Q4*Q5*Q6*/LO*UP*CEL 
/Q0*/Q1*/Q2*/Q3*/Q4*/Q5%/Q6*/LD*/UP*CEI 


/Q8*/LD 
LD 


: Q0*Q1*Q24Q34Q49Q5*Q6*Q74*/LD*UPACEL 
/Q0*/Q1*/Q2*/Q3*/Q4*/05*/06%/Q7"/LD*/UP*CEI 
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registers in the least significant counter. By anticipat- 
ing the ultimate count in this fashion, this scheme 
avoids the propagation delay involved in decoding the 
final count. 

The counter designs specified by the sets of logic 
equations presented in Figs 3b and 8c use this 
look-ahead-carry-out technique. The designs are cas- 
cadable, 9-bit up/down counters; Fig 3c’s specification 
generates both Carry and Borrow signals on a look- 
ahead basis. You can use these specifications to create 
an 18-bit counter by routing the Carry/Borrow output 
(CEBEO) of the first up/down counter to the count- 
enable input (CEI) of the second up/down counter (Fig 
3a). These counters are capable of clocking at rates as 
high as 22.2 MHz. 

To cascade more than two counters for high-speed 
counting, your design would require a composite carry- 
enable output to the more significant counters in the 
chain. You would need to generate a look-ahead-carry 
output from only the least significant counter; higher- 
order counters in the chain would require a count- 
enable output generated from the ultimate, or final, 
count rather than the penultimate count. Fig 4a shows 
a technique for cascading more than two counters in a 
synchronous-count chain. Depending upon the availa- 





Palasm notation conventions 


For consistency with the design spercifications, 
the text of this article uses the Palasm notational 
conventions for Boolean operators instead of the 
classical symbols. Palasm is MMI’s PAL assem- 
bler, which converts Boolean-logic equations into 
fuse maps for PLDs. (Ed Note: Readers familiar 
with other PLD compilers may be more used to 
the C-language conventions used in Cupl and 
Abel. One compiler, logIC from Kontron, allows 
you, thankfully, to choose your own favorite sym- 
bol set.) 
« / represents a logical inversion. 

* represents a logical AND. 

+ represents a logical OR. 

= represents a logical equality. 

+: represents an exclusive- OR (XOR. 

= represents the state-machine operation 

“Becomes equal after the rising edge of a 

clock pulse.” 





bility of input pins, you could incorporate the required 
gates into the PLD itself, as Fig 4b shows. 

XOR PLDs have many applications outside of binary 
counting. The example of a video shift register shows 
how video applications can benefit from the 
programmable-polarity feature of the XOR gate. 

Fig 5a shows a typical video system using the 
PAL20X8A. Fig 5b gives the PLD design specification. 
An XOR PLD, when functioning as a video-shift regis- 
ter, loads row data, in parallel, from a character ROM 
or RAM. A control input to the XOR PLD (reverse 





Fig 4—You can cascade two or more XOR-PLD counters in a synchronous-count chain with either external (a) or internal (6) logic. 





ot Monolithic hari Memorles cl 


XOR PLDs Simplify Design of Counters and Other Devices 








~ PAL VIDEO 


* CONTROL 


PAL20XBA - 
VIDEO SHIFT 
REGISTER - 


ROW 
ADDRESS 
AO, A1, A2 


CHARACTER 
ADDRESS 
. 8 BITS 


CHARACTER: ° 
ROM . 


MPU 
DATA DO0-D7 





Fig 5a—XOR PLDs suit other applications besides counters. The 
device’s programmable output polarity is a vital part of this design, 
which uses an XOR PLD as a video shift register that incorporates a 


cursor control. 


/DO*LD*/REV*/CURS*/RST 
+ DO*LO*REV*/CURS*/RST 
s+: /QL*/LD*/CURS*/RST 
+ RST 


t= /D1*LD*/REV*/CURS*/RST 
+ DI*LO*REV*/CURS*/RST 
t+: /Q2*/LD*/CURS*/RST 
+ RST 


/D2*LD*/REV*/CURS*/RST 
+ D2*LD*REV#/CURS*/RST 
t+: /Q3*/LD*/CURS*/RST 
+ RST 


t= /D3*LD#/REV*/CURS*/RST 
+ D3*LD*REV*/CURS*/RST 
t+: /Q4*/LD*/CURS*#/RST 
+ RST 


/D4*LD*/REV*/CURS*/RST 
+ D4*LD*REV*/CURS*/RST 
t+: /Q5*/LD*/CURS*/RST 
+ RST 


/DS*LD*/REV*/CURS*/RST 
+ D5*LD*REV*/CURS*/RST 
s+: /Q6*/LD*/CURS*/RST 
+ RST 


/D6*LD*/REV*/CURS*/RST 
+ D6*LD*REV*/CURS*/RST 
t+: /Q7*/LD*/CURS*/RST 
+ RST 


/D7*LD*/REV*/CURS*/RST 
+ D7*LD*REV*/CURS*/RST 
t+3) /LD*/CURS*/RST 
+ RST 


7LOAD DO 

;LOAD INVERSE OF DO 
;SHIFT FROM Q1 
+RESET QO LOW 


*CURSOR LOW CLEARS /QO0 
#LOAD Dl 

+LOAD INVERSE OF Dl 
tSHIFT FROM Q2 

7CURSOR LOW CLEARS /Q1 
+RESET CLEARS Ql 

7LOAD D2 

7LOAD INVERSE OF D2 
+SHIFT FROM Q3 

*CURSOR LOW CLEARS /Q2 
#*RESET CLEARS Q2 

7LOAD D3 

#LOAD INVERSE OF D3 
*SHIFT FROM Q4 

+RESET CLEARS Q3 
#CURSOR LOW CLEARS /Q3 
7LOAD DE 

7LOAD INVERSE OF D4 
#*SHIFT FROM Q5 

+CURSOR LOW CLEARS /Q4 
#RESET CLEARS Q4 

#LOAD DS 

7LOAD INVERSE OF D5 
+SHIFT FROM Q6 

#CURSOR LOW CLEARS /Q5 
#RESET CLEARS Q5 

#LOAD D6 

+LOAD INVERSE OF D6 
*SHIFT FROM Q7 

#CURSOR LOW CLEARS /Q6 
RESET CLEARS Q6 

#LOAD D7 

#LOAD INVERSE OF D7 
#SHIFT HIGH INTO /Q7 
+RESET CLEARS Q7 





The XOR PLD series has many applica- 
tions outside of binary counting. 


PAL VIDEO 
REFRESH 


; COUNTER 
11-BIT 
REFRESH 
ADDRESS 


> 
AO 


S BOe* * B10 
AO AO AO 


$ MPU 
« ADDRESS 
A10 


VIDEO RAM « 
e 


WwW 


video, or REV) determines whether or not the data 
loaded into the register is to be inverted. Therefore, 
the XOR PLD can supply normal-video or reverse-video 
information. 

This design also incorporates a cursor control, which, 
when active, sets all of the register’s outputs high. The 
design also has a reset line to clear all the registers. 
When you release the load input, the clock input will 
serially clock the data out of register QO as a video-data 
stream. The clock input to the PLD determines the dot 
rate of the video information and must come from an 
external dot-rate generator. 

The Palasm logic-equation files presented in this 
article omit comments, pin definitions, and device- 
simulator specifications because of space limitations. If 
you want the entire Palasm file, you may call the author 
at (408) 970-9700. EDN 
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The PAL2ORA10 Story— 
The Customization of a 


Standard Product 


ing with the new generation of 32-bit, high- 

performance microprocessors is maintaining both 
byte and word addressability within the microprocessor’s 
32-bit-wide word format and accommodating both high- 
and moderate-speed memory components and peripherals in 
the same system. Consider this example: Motorola’s 32-bit 
microprocessor, the MC68020, has a feature known as dy- 
namic bus sizing. This feature helps maintain both byte and 
word addressability within the microprocessor’s 32-bit word 
format. The microprocessor has internal logic that manipu- 
lates and directs words smaller 
than 32 bits. However, the 
MC68020 requires additional ex- 
ternal logic to handle 8- and 
16-bit memory arrays and pe- 
ripherals and to maintain byte 
addressability of 32-bit memory 
arrays. Consider this as 
well: The MC68020 is available in both 12-MHz and 
16.7-MHz speed grades. The 16.7-MHz speed translates into 
60-ns machine cycles and, consequently, into 90-ns memory 
cycles. Thus, the MC68020 can keep up with many of 
today’s fastest memories. Moreover, in systems that use 
both fast memories and slower, low-cost bulk ones, the 
16.7-MHz version can interface with the slower memories 
without holding back the faster ones. But to do this, the 
MC68020 must insert wait states to extend the memory cycle 
for the slower memories. 

Both design problems call for additional circuitry—that 
is, some ‘‘glue’’ logic. And because the MC68020 employs 
asynchronous control signals and the memories need syn- 
chronous control, the additional circuitry must be capable 


A mong the greatest challenges one faces when design- 


A new, registered, asynchronous 
PAL device originated with suggestions 
from the manufacturer’s 
customers. 


of both asynchronous and synchronous operation. A prod- 
uct from Monolithic Memories, Inc., the PAL20RA10, pro- 
vides what we believe is an elegant, single-chip solution to 
these problems. 

The PAL20RA1O0 is one of the most recently developed 
members of Monolithic’s programmable array logic 
product family. As with all other PAL devices, it contains 
an array of programmable links that the user can configure 
to implement a variety of logic functions. The PAL20RA10 
is unusual in that the customer not only can configure the 
internal logic transfer function but also can implement logic 
functions to control the clock, 
the output enables, and the 


register. In addition, the 
user can configure the outputs 


and active high or active 
low. We will discuss 

these features, as well as a universal port interface adapter 
application, in greater detail later in this article. 


PLDs and the semicustom market 


Programmable logic devices, or PLDs, fit between stan- 
dard logic and customer-designed circuits and thus are con- 
sidered semicustom circuits, along with gate arrays and 
standard cells. PAL products are a type of PLD. Discus- 
sions of semicustom integrated circuits often center on 
technological issues: process capabilities, device speed and 
power dissipation, number of available gates, and so on. 
But the customization of a device involves more than just 
meeting the customer’s functional requirements. The device 
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set and reset operations for each 


to be registered or nonregistered, 





Copyright © 1986 The Institute of Electrical and Electronics Engineers, Inc. 
Reprinted with permission from 1EEE MICRO, 
10662 Los Vaqueros Circle, Los Alamitos, CA 90720. 
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Table 1. 
Logic devices rated by various selection 
criteria. 
Standard Gate Standard Full 

Criterion logic PLDs arrays cells custom 
Sourcing and 

availability 1 1 3 4 4 
Performance 5 3 3 
Space 

efficiency 5 4 2 2 I 
Ease of 

design* 2 1 2 4 4 
Ease of design 

changes 3 1 2 4 4 
Design 

cost** 1 | 3 4 5 
Time to 

market 2 ] 3 4 5 
System-level 

mfg. cost 5 3 3 2 1 


* With the exception of standard logic, this means ease of use of the design 
tools, i.e., ease of understanding the design tool concepts, ease of doing the re- 
quired input, etc. 

** Design cost is the sum of the NRE (non-recurring engineering) costs and such 
non-recurring manufacturing costs as masking, test vector generation, etc., 
that are part of the product development cycle. 
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should also meet the customer’s requirements for price, 
availability, risk, ease of use, and support. Only when all of 
these requirements have been met has a device been truly 
customized. 

Within this definition, even standard 7400 series logic 
might be considered customized. It meets most customers’ 
requirements for low cost, availability, low risk, and high 
ease of use, and it provides just about any desired function. 


PLDs and gate arrays—semicustom 
alternatives and trade-offs 


The PLD was first conceived as a replacement for dis- 
crete TTL logic designs. Customers wanted higher integra- 
tion than that possible with standard SSI/MSI logic, but 
they could not find the functions they wanted in standard 
LSI circuits and did not want to pay for fully customized 
ones. The PLD provided a new option—semicustom 
LSI—that had the advantages of SSI/MSI availability and 
many of the advantages of fully customized LSI without 
such drawbacks as high cost and long development time. 


Today, gate arrays are rapidly becoming competitive in 
the same applications as PLDs. However, PLDs provide 
several advantages over gate arrays that make them much 
more amenable to customer needs for many applications. 

The primary advantages of PLDs are their immediate 
availability, the lower risk they entail, and their lower over- 
all cost per application. A design engineer, upon completing 
his design, can immediately customize a PLD to his specifi- 
cations. If he discovers that he has made a mistake in his 
design, he can correct it and program another device. Since 
he customizes a part just before it is used, he undertakes no 
risk in committing to a particular design. If he needs to 
make a change, he can program a part from inventory to 
the new specification—no parts are wasted. He can program 
the new part in minutes. 

According to a survey that appeared in the January 12, 
1984, issue of EDN, the two greatest disadvantages of gate 
arrays are their high development cost and the lack of sec- 
ond sources for them. PLDs, in contrast, have an extremely 
low development cost and offer multiple sources. The short, 
simple development cycle for PLDs translates into easier 
use, lower costs, shorter time to market, and greater flex- 
ibility. Table 1 rates standard, semicustom, and full-custom 
integrated circuits according to various criteria used in 
choosing an IC for a design. A ‘‘1’’ indicates the best 
choice; a ‘‘4’’ indicates the worst. 

Dataquest, a market research firm, lists in the December 
28, 1984, issue of its Nielsen Dataquest Research Newsletter 
(‘‘Gate Array Impact on the ASIC Marketplace,’’ by Katy 
Guill and Andy Prophet) five needs of the gate array user, 
all of which are met efficiently by PLDs. The first is for 
quick delivery of prototype units. With PLDs, prototypes 
are available in minutes once the design is complete. The 
second need is for easy-to-use design automation tools that 
require a minimum amount of time to learn. PLD software 
tools include assemblers and higher-level languages that en- 
able a design engineer to automatically convert whatever de- 
scription he has created into a fuse plot. The PLD can then 
be programmed as easily as a PROM. 

The third need is for prototype units that work the first 
time. PLDs, being standard parts, are fully tested during 
production and can be quickly verified after customization. - 
If a design error has been made, the design can be quickly 
modified and another prototype can be programmed in 
minutes. Using personal-computer-based software and a 
standard programmer, the design engineer can stay at his 
own desk and proceed to alter his design on the PC, and 
then instantly download the new design to the programmer 
and the new device. The device can then be dropped into 
the target application and the design verified. The ease with 
which PLDs can be verified encourages experimentation 
with designs. 

A fourth need is for low development and production 
costs. PLDs have extremely low development costs—only 
the up-front costs of the software and programmer. Their 
design cost consists simply of the design engineer’s time. 
The production costs for PLDs are driven down by the mul- 
tiple sourcing and high-volume production characteristic of 
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standard products. Gate arrays provide neither of these 
advantages. 


The fifth need is for convenient design centers and train- 
ing. The PLD design center is very convenient indeed—the 
engineer’s personal computer. Our company, Monolithic 
Memories, offers tutorial aids in several formats. Moreover, 
the standard nature of the PLD makes it easier to under- 
stand and use than the more customized and thus less 
“friendly’’ gate array. 


There are many other factors that influence the choice of 
semicustom device technology, of course, such as architec- 
tural efficiency. The ease of use of the PLD unfortunately 
leaves it much less efficient architecturally than other forms 
of semicustom logic. In a gate array, the user constructs log- 
ic functions using two-input NAND gates. In a PLD, the 
user constructs logic functions using higher-level macrocells. 
Naturally, there will be more wasted silicon in the PLD, al- 
though its optimized macrocell and efficient fuse array will 
still provide an effective structure. It is the gate array’s very 
architectural efficiency that is the source of its high develop- 
ment cost, since that efficiency derives from computerized 
trial-and-error routing and a custom mask, both of which 
are expensive. 


Figure | gives the 1985 sales volume for various types of 
ICs and estimates the 1990 volume for them. All types show 
healthy growth. Because standard cells are starting at a low 
level and because they are expected to become very popular 
as the IC market matures, they are expected to show the 
highest growth rate. Linear arrays, though relatively new 
and technologically difficult, should also show impressive 
growth. PLDs and gate arrays should also show very 
healthy growth, with the growth in PLDs (367 percent) ex- 
ceeding that in gate arrays (234 percent). Part of the greater 
growth of PLDs is attributable to their ease of use, but part 
is also attributable to the ease with which they can be repro- 
grammed—for circuit designs that are likely to change, pro- 
grammable logic is the only practical solution. 


It is very difficult to directly compare PLD cost and gate 
array cost—one is comparing apples and oranges. PLDs 
have a minimal development cost and a standard unit cost. 
Gate arrays, on the other hand, have a large development 
cost and a small unit cost. Moreover, it is very difficult to 
determine the PLD architecture that is equivalent to a par- 
ticular gate array and vice versa. Currently available PLDs 
such as Monolithic’s MegaPAL devices have as many as 
5000 four-input NAND-gate equivalents. However, a 
5000-gate gate array probably cannot replace a 5000-gate- 
equivalent PLD exactly, and such a PLD cannot replace a 
5000-gate gate array exactly. Such replacement is very 
application-dependent. A MegaPAL array, for example, is 
limited to 64 inputs and 32 outputs. If a gate array has more 
inputs and outputs, the MegaPAL device cannot replace it. 
And if the gate array cannot duplicate a feature of the 
MegaPAL array—such as its large fan-in on its AND gates, 
for example—then the gate array cannot replace MegaPAL 
array. Even at lower gate densities, comparisons remain 
application-dependent. 
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Figure 1. Growth of the worldwide application-specific 
IC market from 1985 to 1990. Dollars are in millions. 
(Source: Integrated Circuits Engineering Corp.) 


For purposes of providing a general idea of the cost effi- 
ciencies over volume of gate arrays and PLDs, we can com- 
pare them as if they did have one-to-one equivalency, how- 
ever. The nonrecurring engineering, or NRE, charge for a 
gate array can vary from $15,000 to $75,000, depending on 
complexity and design methodology. If a gate array has a 
$35,000 NRE charge and a $1.00 unit cost, its cost per unit 
drops almost inversely with volume, as shown in Figure 2. 
Now let us examine a specific type of PLD, the PAL device. 
If a PAL device costs $5.00 for a single unit and drops in 
price with volume according to standard pricing curves, it 
shows a more steady decline in price with volume (the PAL 
plot in Figure 2). The crossover point in this comparison is 
almost at 100,000 units. The graph shows quantatively the 
reasons why gate arrays become cost-effective over larger 
volumes, but it leaves out several other factors, including 
gate equivalencies, risk, and time to production. 

Let us compare the design and production costs for PLDs 
and gate arrays. For a PLD, design cost means the design 
engineer’s time, as we mentioned above. This is already a 
committed cost. However, for a gate array, design cost can 
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vs. gate array cost per unit. The break-even point 
for PAL devices and gate arrays occurs at about 
100,000 units. 


Figure 3. The basic PAL consists of two j 
arrays—a fuse-programmable AND array 
and a fixed OR array. This arrangement 
follows the sum-of-products format 
familiar to the design engineer who uses | 
Boolean logic. | 





Figure 4. Two-input AND array with four AND 
gates. An input (in or /in) is disconnected from an 
AND gate when the fuse associated with that input 
is blown. 
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Figure 2. PLD (PAL and HAL device) cost per unit 
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be a sum of several factors. The time a designer spends on a 
computer to create a design can be very expensive, and the 
software he needs to create and test the design can also be 
expensive. Production costs for both types of device include 
not just the cost of the silicon but also the cost of masks, 
fabrication, and labor. And if a device does not work as ex- 
pected, the entire process must be repeated. 

With a PLD, the only purchase cost is that for the silicon 
itself. The PLD is mass-produced, so all the supporting 
design efforts—fabrication, test generation, and reliability 
enhancements—are spread over time and many devices. The 
result is much lower cost to the user. 

For high-volume, fixed applications, a PAL device can be 
replaced with a mask-programmed hardwired array logic, or 
HAL, device. The HAL device replaces the fuses of the 
PAL circuit with a final mask step that permanently 
metallizes the logic connections. This alternative is an ex- 
cellent match for the volume applications that are attractive 
to gate arrays. The HAL circuit combines the lower unit 
cost of the mask-programmed logic device with the pro- 
totyping capabilities of the PAL circuit. It offers advantages 
unmatched by any other form of semicustom logic. Assum- 
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Figure 5. An “x” indicates an intact fuse. The two- 
input AND array with four AND gates (product 
lines) from Figure 4 is shown along with a two- 
output OR array. Note that the AND array is user- 
programmable, whereas the OR array is hardwired. 
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ing an NRE charge for a HAL device of $5000 and a unit 
cost of $2.00, we can plot the relationship of the HAL 
device to the PAL and gate array (see Figure 2 again). 


PAL devices—historical background 


PAL devices were introduced by Monolithic Memories in 
1978. They consist of two arrays of logic gates. The first 
array consists of AND gates that form Boolean products of 
the inputs, and the second array consists of OR gates that 
sum these products (see Figure 3). Hypothetically, there are 
n inputs to the AND array and m outputs from the OR 
array. The AND array consists of p AND gates (and p prod- 
uct terms), and all n device inputs, both inverted and non- 
inverted, are potential inputs to any of the p AND gates. In 
an unprogrammed PAL device, every device input is actu- 
ally connected to every AND gate. A device input ceases to 
be an input to a particular AND gate if the fuse that con- 
nects that input to the gate is blown. Fuses are blown ac- 
cording to a fuse map that is generated by the user via soft- 
ware. Figure 4 shows a simple two-input array with four 
AND gates. Figure 5 shows a simplified notation for this 
circuit—an ‘‘ x”? is used to indicate that a fuse is intact, and 
a single line is used to represent all lines of connection with 
an AND gate. (Note that the use of the single line does not 
imply the connection of those lines to each other.) Figure 5 
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also shows a two-output OR array that uses the same con- 
ventions for intact fuses and lines of connection. Most PAL 
devices have a considerably greater number of inputs, prod- 
uct terms, and outputs than shown here, but this example 
illustrates the general PAL structure. 

The arrangement of the AND and OR gates follows the 
sum-of-products format familiar to the design engineer who 
uses Boolean logic to express his logic requirements. As 
implied above, the engineer can program the AND array in 
a PAL device by using links similar to those used in a 
PROM circuit. He opens the particular links needed to con- 
figure the logic to the user’s specification. 

Before discussing the PAL20RA1O0 in particular, we 
should note the differences between PAL devices and other 
PLDs. Most PLDs consist of an AND array followed by an 
OR array. This is also the architecture of memory devices, 
including bipolar PROMs. In a PROM, the AND array is 
fixed and completely decodes the inputs. The OR array is 
programmable and holds the information at each decoded 
location, or the word at every address (Figure 6a). 

The first PLD was developed by Signetics and is called a 
field-programmable logic array, or FPLA. The FPLA is an 
implementation of the general PLD logic combination—an 
AND array followed by an OR array—in which fuses are 
used for interconnections in each array. The FPLA allows 
both arrays to be programmed (Figure 6b) and thus pro- 
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Figure 6. PROM AND-OR array (a)—-the AND array is fixed to decode the address, whereas the OR array 

is programmable for the stored data; field-programmable logic array, or FPLA (b)--both the AND array and 
the OR array are programmable; PAL device (c)—the AND array is programmable whereas the OR array is 
fixed. The PAL device offers much of the flexibility of the FPLA along with the low cost and easy program- 


mability of the PROM. 
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Figure 9. PAL16R4 logic diagram. This device has four registered outputs with a single (synchronous) 


clock. 
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Figure 10. History of the speed im- 
provements and power consumption 
reductions for devices in the Medium 


PAL family. 


New product development 

New product ideas in semiconductor companies come 
from a variety of sources—from customers and those in 
direct contact with them to design engineers who have 
ingenious ideas about how to add features without 
adding cost, power, or timing delays. Since customers 
are usually willing to describe the product that would be 
ideal for them, they are often a good source of ideas. 
Product ideas usually pass through the following steps. 

First, the idea is generated. Typically this is in response 
to a limitation in a current product and is an evolutionary 
step. Revolutionary ideas such as the original PAL 
concept are rare and usually come from approaching 
old problems in a new way. If the idea is a customer’s 
suggestion, it is usually reported to the factory in a 
trip report. 

The idea, whether generated internally or by a cus- 
tomer, is next submitted to a group typically consisting of 
representatives from product planning (including some- 
one from the product area into which the idea falls, 
whose job is to examine such ideas), marketing, design, 
and product engineering. The group discusses the new 
idea and responds to it; it either drops the idea, giving 
reasons for doing so, approves it for further considera- 
tion, or asks for more details. 

If a product idea is approved for further consideration, 
a member of the product planning group canies out 
market surveys and analyses. He contacts customers, 
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often in person, and locates third-party market studies 
and research. He analyzes internal capabilities and seeks 
out information about products that could compete with 
the proposed product. At most companies, the result of 
his efforts is a new product approval document consisting 
of a description of the product and its applications, alter- 
natives to the product, a discussion of the market for the 
product, and a marketing strategy for it. 

If the product is approved by all the representatives in 
the group, it is assigned to design engineering. The 
design engineer is free to use whatever tools he chooses 
within the definition set out for him. This definition in- 
cludes such items as performance requirements but rarely 
specifies the process technology. It is often modified as 
new requirements arise. 

Design engineering is finished with the product only 
after it has been successfully produced on the production 
line. It is then released to product engineering, which 
takes on full responsibility for the product for its lifetime. 
Product engineering is typically involved before this point, 
however, making sure the product is progressing well and 
problems are being eliminated quickly. 

The final step is release to the marketplace. For most 
companies, release to market means sending out data 
sheets and samples to the field once production quan- 
tities have become available. For programmable products 
such as PAL devices, software and programmer support 
must also be in place. 
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Figure 7. Some of the 
first PAL devices 
from Monolithic 

Memories. 


Figure 8. The first 
registered PAL 
devices. 


vides a great deal of flexibility. However, its programming 
circuitry and fuses lead to a large, slow, power-consuming 
device and make programming expensive and difficult. 

In 1977, John Birkner of Monolithic Memories got the 
idea of taking the FPLA architecture and hardwiring the 
OR array, leaving only the AND array programmable (Fig- 
ure 6c). The result was called a PAL circuit, PAL being the 
acronym for programmable array logic. (Monolithic Memo- 
ries has since registered PAL as a trademark describing its 
products.) A PAL device offers higher speed and consumes 
less power than an FPLA. The first PAL devices were de- 
signed so that engineers could program them just like 512 x 
4 PROMs, using phantom fuses to fill in beyond the 512 
fuses in the actual devices. Thus, engineers could program 
them on widely available, inexpensive programmers. 

The first family of PAL devices included both combina- 
torial and registered devices. Combinatorial devices consist 
of just a programmable AND array followed by an OR 
array, with no registers or other macrocells. They are fairly 
simple and provide little more than a varying number of 
inputs and outputs (see Figure 7). Registered devices, on the 
other hand, provide clocked registers for the outputs and 
allow the previous output state to be stored and used for 
determining the next state. These types of devices are much 
more challenging. A register allows the device to be used as 
a state machine, one in which the outputs depend not only 
on the present inputs but also on the previous state. 
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Since a state machine can be implemented in a variety of 
forms, the registered PAL architecture is difficult to define. 
To do so, one must specify the total number of registers, 
whether a register can feed its output back into the logic 
array, the nature of the control of the output enable follow- 
ing the register, and common register features such as preset 
and clear functions. These specifications have a major influ- 
ence on the potential applications of the device. 

Monolithic’s original registered PAL product family, 
whose members were designed to be used as state machines, 
is the Medium 20 family, where Medium describes the logic 
density and 20 is the number of pins. The family consists of 
four devices, all with 16 complemented inputs to the AND 
array and eight outputs from the OR array. They have vary- 
ing numbers of registers—zero, four, six, or eight—which is 
reflected in the part numbers, PALI6L8, PALI6R4, 
PAL16R6, and PALI6R8 (see Figure 8). The 16R4 directly 
implements Mealy state machines, whereas the 16R8 directly 
implements Moore state machines. In the 16L8, 16R4, and 
16R6, combinatorial outputs are enabled by individual 
product terms from the AND array. The product term 
enables allow the combinatorial outputs to function as in- 
puts. These inputs, along with feedback from the registers 
and eight dedicated inputs, provide the 16 inputs to the 
AND array. The logic diagram of the PALI6R4 is shown in 
Figure 9. 

The architecture of this family has proven capable of 
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(Top view) 


Figure 11. 7474 dual D-type flip-flop. Note the two 
separate clocks that allow asynchronous operation of 
the registers. 


handling a remarkable percentage of what customers re- 
quire in programmable logic. The improvements customers 
typically request are higher speed and lower power con- 
sumption. We have made progressive improvements in 
speed, to a 10-ns propagation delay in the Medium 20D 
Series and a 6-ns one in the ECL PAL IC. We have also re- 
duced power consumption without sacrificing speed. The 
20B-4 Series requires one fourth the power of the original 20 
Series yet provides the same speed. With a propagation de- 
lay of 35 ns, the 20B-4 devices have a maximum supply cur- 
rent of 55 mA, compared to 180 mA for the original parts 
at the same speed. CMOS mask-programmed versions re- 
quiring zero standby power are also available. Figure 10 
shows the history of speed and power consumption im- 
provements in the Medium 20 family. 

While we have been making these improvements, we have 
also introduced new families with higher logic densities and 
the 40- and 84-pin MegaPAL families. We have added new 
features to these families, including additional logic capabil- 
ities such as exclusive-OR gates, arithmetic feedback, pro- 
grammable output polarity, product term sharing, register 
bypassing, and register preloading. We made these 
enhancements in response to specific customer needs, and 
they have opened new applications for programmable logic. 

Most of these new features were simply extensions of the 
original PAL concept. The original PAL family was a 
breakthrough, a new alternative offering completely new 
advantages in semicustom logic. In building on this family, 
we concentrated on speed and power improvements and not 
on finding revolutionary new architectures. However, the 
PAL20RA10, which we will discuss shortly, is another 
breakthrough concept. It did not derive from extensions to 
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the original idea but arose as a new response to the needs of 
customers using PAL devices. 


The PAL2Z0RA10 


The PAL20RA 10 was first proposed at an internal 
Monolithic Memories conference in 1981. The original con- 
cept was described as an “‘unstructured PAL”’ having inde- 
pendent programmable clock, reset, and set controls on 
each flip-flop. Until then, all PAL devices had been struc- 
tured—or synchronous—in architecture; all transitions on 
register outputs o¢curred simultaneously in response to the 
rising edge of the clock signal. The proposed ability to con- 
trol the clock independently for each flip-flop would enable 
the engineer to create asynchronous logic within a PAL 
chip. 

The idea was proposed by Harry Hughes, who was then a 
field applications engineer, or FAE, in Great Britain. FAEs 
are the company representatives who have primary contact 
with customers; they have responsibilities ranging from 
helping existing customers with complex design problems to 
introducing Monolithic Memories to potential customers. 
As the primary customer contact, they are quite familiar 
with real user needs. 

The concept of the unstructured PAL circuit had origi- 
nated with several European customers. They had been 
using the Medium 20 PAL Series in large quantities. 
However, the application logic they had been integrating 
had not always fitted well in PAL devices. They had often 
had sections of individual 7474 flip-flops mixed among ran- 
dom logic. A 7474 device (see Figure 11) consists of two in- 
dependent D-type flip-flops, each with its own data input 
and complementary outputs, and clock, preset, and clear. A 
standard PAL16R8 has eight of these D-type flip-flops but 
only one clock and no preset and clear functions. Thus, in- 
tegrating the collection of 7474s and the glue logic had re- 
quired the customer to convert a multiclock design into a 
single-clock one. This had been difficult or impossible to do 
with standard products. 

The customer requested an independent clock, set 
(preset), and reset (clear) for each flip-flop. Since the FAE 
had found other customers requesting similar architectures, 
he proposed the device. He took the 16R8 logic diagram 
and added the four control terms, as shown in Figure 12. 
He described the concept at Monolithic’s FAE conference in 
July of 1981. 

FAE conferences are held to foster technical exchange be- 
tween the factory and the field. Problems are identified and 
ideas are discussed. Now held every six months, these con- 
ferences are critical for planning future product directions. 

After it had been presented at the FAE conference, the 
unstructured PAL circuit was discussed within the factory. 
For an idea to go somewhere, factory discussion must pro- 
duce a business plan for the product that describes it coma 
pletely and estimates the profits that can be expected from 
it. New product ideas must pass several stages of analysis 
and approval. (See ‘‘New product development,”’ at left.) 
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The major problem with the unstructured PAL circuit 
was its inherent lack of testability. The power its architec- 
ture gives the customer also demands that the customer use 
it carefully. For example, the minimum setup time required 
before the register can be clocked must be carefully con- 
sidered when the clock signal is a complex product term. 
Data must be made available to a flip-flop early enough so 
that even the minimum time to activation of the clock satis- 
fies the setup time required. This required delay can be very 
difficult to calculate, especially if the clock term is under the 
control of another asynchronous register. All of the internal 
asynchronous signals must be handled with care to prevent 
invalid combinations. 

These and other testability issues caused great concern 
and led to several unique solutions. Internal test features 
were added, including the ability to test both DC and AC 
characteristics on unprogrammed devices and even the abili- 
ty to test programming characteristics without affecting the 
fuses in the actual array. Two external pins were given dedi- 
cated functions: one is an output enable that allows the 
device to be disabled during board-level testing; the other is 
for register preloading, which allows the user to gain access 
to the flip-flops without going through the fuse array. 

The final, approved product was called the 20FF10 at 
first. It had 20 array inputs and 10 outputs, and enough 
new features to earn it the name ‘‘Feature Freak’’—the 
source of the ‘‘FF’’ in the designation. However, a name 
must uniquely identify a product and must do so in terms 
the customer can recognize. The ‘‘FF”’ for ‘‘feature freak”’ 
was unique but not very meaningful to the customer, so this 
part of the name was changed to RA, for ‘‘registered asyn- 

“chronous,’’ and the new device became the 20RA10. 

The approval document for the device described its archi- 
tecture, which is shown in Figure 13. The clock, set, and 
reset are controlled by product terms, as shown in Hughes’ 
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Figure 12. Harry Hughes’ modification of the PAL16R8 logic diagram. To illustrate his proposal for a new device, 
he added four product terms (heavy lines) to control the enable, set, reset, and clock. 


original sketch of the unstructured PAL circuit (see Figure 
12 again). 

A compromise was made for the output enable. An exter- 
nal output enable is required for most applications, especial- 
ly for disabling the device during board-level testing. But an 
internal (asynchronous) output enable for independent 
enabling of the flip-flops is needed, too. One solution 
would have been an external enable pin for each output, but 
that would have been a highly inefficient use of external 
pins. The solution that was used is an enable signal that is 
the product of the external common enable pin and an in- 
ternal independent product term. Thus, the enable can be 
activated either externally (in common) or internally (asyn- 





Figure 13. The PAL20RA10 output cell. Note the 
separate product terms that control the set, reset, 
enable, and clock. Note also the output enable that 
can be controlled either externally (synchronously) 
or internally (asynchronously). 
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Figure 14. Logic diagram 
7 for the complete 
PAL20RA1O cir- 
cuit—AND array, OR ar- 
ray, and asynchronous 
registered outputs. 
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chronously), satisfying both the design engineer, who wants problem for the PAL20RA10 came somewhat by accident. 
the internal enable, and the test engineer, who wants the ex- The situation of set and reset going active simultaneously 
ternal control. had to be dealt with. We decided that allowing this situation 
Another relatively new feature was implemented in this to result in bypassing of the register provided a beneficial 
circuit—programmable polarity. This allows the output new feature. If both the set and reset product terms are 
polarity to be either active low or active high. The design activated, the register becomes transparent and the output 
engineer can then write logic equations either way, allowing becomes combinatorial. Because of this feature, the logic 
his software to match the sense of an equation to the actual designer can decide at programming time how many 
device. This enables the designer to fit more logic into a registers he wants (from zero to ten) and on which outputs 
single PAL circuit. he would like them. This architectural flexibility is unique 
The problem of how many registered and combinatorial within programmable logic. 
outputs to provide had been solved in previous devices by These features are shown in the logic diagram in Figure 
offering a family of combinations. The solution to this 14, However, though they describe exactly what the user en- 
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counters, they do not actually exist within the device itself. 
The logic diagram is only a logical equivalent to the actual 
layout of the circuit. 

The first thing to note about the actual design is that the 
programmable AND array and following OR array com- 
mon to all PAL circuits are actually implemented as an OR 
array and a following NAND array! This type of logic is 

- required because of the architecture of the circuitry that 
senses the ones and zeroes in the programmable array. Opti- 
mized for bipolar PROMs, this circuitry senses current 
flowing from one or more of the transistors in a column of 
the array. Since it senses current if at least one of the tran- 
sistors is on, the logic is equivalent to an OR gate. Thus, the 
programmable array must be an OR array, as it is ina 
PROM. To create the logic of a PAL device, we placed a 
logical NAND array after the programmable OR array, 
creating an OR-NAND structure, that is, an inverted prod- 
uct of sums, which by Boolean arithmetic is equivalent to a 
sum of products. 

To maintain the flexibility of the 20RA10, we had to 
make various trade-offs. One example is the output drive. 
The original proposal specified an output drive of 32 mA. 
We lowered this to 8 mA to reduce the power requirements. 

Although standard 24-pin products were running as fast 
as 25 ns, the highest speed we could guarantee on the 
PAL20RA10 was 30 ns. This slight decrease in speed was 
due to the additional features of the device—a combina- 
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die photo. 
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torial output signal travels through the polarity control, 
register bypass multiplexer, and output buffer before it 
reaches an output pin. Monolithic Memories has been pro- 
gressively improving its process capabilities, and the part is 
being transferred to a faster process to increase its speed. 

Beyond speed, the fabrication process used for the 
PAL20RA10 created no specific limitations. The process 
was the workhorse of the bipolar efforts at Monolithic 
Memories at the time. It is a standard junction-isolated 
bipolar process, optimized for high speed (see ‘‘Process 
technology,’’ at right). Extensive experience with the process 
results in high manufacturing yields and high reliability and, 
as a result of these, lower costs. In fact, the greatest effect 
of the process on the product is probably in the cost. The 
PAL20RA 10 process is well understood and provides excel- 
lent results for what it costs. 

A die photograph of the 20RA10 is shown in Figure 15. 
The various blocks are outlined. The die had to meet a cus- 
tomer requirement—that it fit into a 24-pin, 300-mil-wide 
SKINNYDIP package. The additional features had in- 
creased the die’s width, yet to fit into this package it had to 
be less than 140 mils wide. Some squeezing of the layout 
during the final stages of design allowed this requirement to 
be met. 

The PAL20RA10 is much more than a die, however. It 
also requires programming and software support. Program- 
ming support is the one critical service Monolithic Memories 
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be downloaded to a programmer in a standard JEDEC 
format for customization of the chip. 

Software support is a critical part of the product because 
it is through software that the customer implements his de- 
sign. A designer wants the conversion from what he knows 
to be the required logic to what a PAL programmer can 
understand to be as simple and easy as possible. PALASM 2 
allows the designer to express his function with Boolean 
equations. It then uses that input to configure output 
polarity, register bypassing, and all the other configuration 
features. 


does not provide, and thus it depends on the major PLD 
programer manufacturers for it. But to be able to support a 
product at market release, the programmer manufacturers 
must be provided with the programming specification for 
the product months before it is released. Monolithic there- 
fore keeps in close contact with these manufacturers. 
Software support is provided by Monolithic Memories 
through its PAL assembler software package, PALASM 2. 
PALASM 2 takes Boolean logic equations, tests them 
against expected outputs, and converts them to a fuse map 
for a PAL device. A description of this fuse map can then 
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Process technology 
The process used for the PAL20RA10 is a junction- 

isolated one. A cross section of the 20RA10 is shown in 

. Figure A. NPN transistors are separated from one an- 

i other by junctions. 


allows the emitter to become the minimum feature size of 
the design, providing a smaller overall die size than the 
die size that could be provided by a standard process. 

A programming fuse is located between an emitter and 


r 





The process is a washed-emitter one. In this process, a 
layer of nitride (Si,N,) is used as a mask. This mask layer 
defines the size of the emitter and the emitter contact and 
allows the emitter and the emitter contact to be the same 
size. In other processes, the emitter is usually made larger 
than the emitter contact to guarantee good contact. 
When formed at the same time, however, they can be 
the same size and still maintain proper contact. This 
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a product line that in turn leads to a collector (Figure B); 
driving a large current through this fuse opens its narrow 
neck and causes it to change from a very low resistance 
to a very high one. The process does not cause the fuse 
to explode, as some claim; it merely severs the greater 
part of the connection. This change opens the link be- 
tween the emitter and the product line. 
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Figure A. Cross section of the 

PAL20RA10—NPN transistors 
are separated from one another 
by junctions. 









Figure B. Location of the pro- 
gramming fuse in the 
PAL20RA10. 
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The new features of the PAL20RA10 required that some 
new syntactical conventions be added to the PALASM 2 
software package. In addition to the standard logic transfer 
functions, the designer has to be able to specify several nov- 
el architectural situations. For example, he has to be able to 
specify register bypassing in either of two ways: implicitly or 
explicitly. He specifies bypassing implicitly by writing a 
combinatorial logic equation using an ‘‘ =’’ sign instead of 
the ‘‘:=”’ sign, which is reserved for registered outputs. He 
specifies bypassing explicitly by activating the set and the 
reset on the flip-flop simultaneously, causing the register to 
be bypassed and the output to be combinatorial. 

The control terms are described by dot operators such as 
“*term.set.’’ We selected dot operators for the same reason 
they are used in popular word processing programs—they 
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rarely occur in natural language and can therefore be given 
special significance. 

Third-party software is available that allows other forms 
of input such as schematic entry. ABEL from Data I/O 
Corporation and CUPL from Assisted Technology both 
provide high-level support for PLDs. 

As we asserted earlier, true customization of the 
PAL20RA10 involves much more than matching the func- 
tion to the customer’s requirements. Speed and power con- 
sumption must be acceptable, design support for the device 
must be available, pricing must be reasonable, product 
availability must be high, and quality must be high. The 
PAL20RA10 has been in full production for well over a 
year, and to the good fortune of Monolithic Memories all of 
these requirements have been met. Moreover, FAEs are ex- 
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cited about teaching customers about a product that 
originated with one of their own members. In fact, the 
20RA10 was the first product proposed by an FAE to go 
into production at Monolithic Memories. Market accep- 
tance has been beyond expectations, and as a result FAEs are 
being brought even further into the product planning process. 
The PAL20RA10 is the first member of a new family of 
devices; new products are in various stages of proposal, de- 
sign, and production. A 20-pin spin-off, the PALI6RA8, is 
now in full production. A higher-speed version is in design. 
A version in a 40-pin DIP was considered, but since 
20RA10 outputs operate independently, the higher density 
and resulting larger package were difficult to justify. Two 
24-pin, 300-mil-wide parts can do more in less board space 
than one 40-pin, 600-mil-wide device. 







ADDRESS 


Figure 16. A 
PAL20RA10 used 
as a universal port 
interface adapter 
for a Motorola 
MC68020 32-bit 
microprocessor. It 
generates logic that 
maintains both 
byte and word ad- 
dressability within 
the 68020’s 32-bit- 
wide word format 
and that makes 
possible the use of 
both high- and 
moderate-speed 
components and 
peripherals in a 
single system. 
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The PAL20RA10—an 
application example 


To demonstrate the capabilities of the PAL20RAIO and 
the PALASM-based design process, we present an applica- 
tion that uses the PAL20RA10 as a universal port interface 
adapter for a Motorola MC68020 microprocessor (Figure 
16). This example should help demonstrate the features we 
described above and point out how easily the 20RA10 can 
be customized to a particular user’s application. 

The design engineer began this design by considering the 
large blocks: the microprocessor, the memory, and the I/O. 
Now, near the end of the design, he needs to place some 
glue logic between the microprocessor and the I/O ports. 
He is open to anything from standard to full-custom logic, 
and he considers each alternative. 

Because he is very unsure of the finality of the design, he 
does not want to commit himself to a major development 
effort. Therefore, he rules out full-custom logic and stan- 
dard cells. He may be able to make a gate array work, but 
he does not have time to work with a foundry in specifying 
the design nor time to wait for the foundry to deliver the 
part. In fact, he would like to have the part immediately. 
Moreover, since his design is not yet proven, he cannot jus- 
tify the high cost of a gate array development effort. 

He could use off-the-shelf ICs easily enough, but he does 
not want to take up that much space on his board. He has a 
limited amount of room with which to work. He could use 
standard LSI devices, but none provide the functions he re- 
quires. Thus, he arrives at the programmable logic device as 
the logical alternative. 

As we said above, in this design the PAL20RA10 func- 
tions as an asynchronous universal port interface adapter 
for a 32-bit microprocessor, the MC68020. Among the 
greatest challenges the engineer faces when designing with 
the new generation of 32-bit, high-performance micropro- 
cessors is maintaining both byte and word addressability 
within the microprocessor’s 32-bit-wide word format and 
accommodating both high-speed and moderate-speed mem- 
ory components and peripherals in a single system. 

The MC68020 has a dynamic bus sizing feature that helps 
solve the first of these two problems—it has internal logic 
that manipulates and directs words smaller than 32 bits. 
However, it needs external logic to handle 8- and 16-bit 
memory arrays and peripherals. It even needs such logic to 
maintain byte addressability of 32-bit memory arrays. This 
external logic must be able to decode the transfer size con- 
trol signals from the MC68020 so it can generate memory 
chip selects and handshake back to the microprocessor with 
the port size acknowledge signal. The PAL20RA10 is well 
suited to this task. 

External logic also helps the MC68020 solve the second of 
the two problems. The 16.7-MHz speed of the MC68020 
version used here translates into 60-ns machine cycles and 
90-ns memory cycles, and therefore the MC68020 can keep 
up with many of today’s fastest memories. But it may have 
to work with slower bulk memories as well. To do this with- 
out slowing down the high-performance memories, the 
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Table 2. Table 3. 
DIP switch settings used in the universal port DIP switch settings used in the universal port 
interface adapter to choose the data bus port interface adapter to choose the number of wait 
size. States. 


Position Data bus Position Number of 
PS1 PSO port size WS1 WSO wait states 


No port present 
8-bit 
16-bit 
32-bit 


be Da Or alld eal 
meoere 
mmeere 
mone 
WN © 


68020 must be able to insert wait states into the memory cy- 
cles of the slower memories. The 20RA10 can help the 
68020 do this by intercepting microprocessor memory access 
requests and, when needed, generating control signals to the 
microprocessor requesting wait states. 

In this design the PAL20RA10 functions as a universal 
port interface adapter. Both blocks of external logic we al- 
luded to in the previous two paragraphs are implemented in 
a single PAL20RAIO. The asynchronous capabilities of this 
PAL circuit are required by this design because the 
MC68020 provides asynchronous control signals that must 
be processed by the PAL device between clock signals. On 
the other hand, the synchronous capabilities of the 20RA10 
are needed for generating memory control signals. An inter- 
nal PAL state machine must be synchronous with the sys- 
tem clock. Therefore, neither a combinatorial (i.e., asyn- 
chronous only) nor registered (i.e., synchronous only) PAL 
device will fill the bill. The PAL20RA10’s unique combina- 
tion of features allows it to provide a single-chip solution. 

The PAL20RA10 used here generates the byte data select 
signals (JUD, UMD, LMD, and LLD) and the current port 
size select signals (DSACKO and DSACK]) and performs 
wait state generation. The designer selects the data bus port 
size and the number of wait states by setting DIP switches 
(Tables 2 and 3). 


not merely extend existing capabilities; it adds novel 

and highly useful features. It is also the first product 
produced by our company that originated with a customer. 
Its technical and market performance has exceeded our ex- Marc Baker is a strategic marketing engineer at Monolithic 
pectations and has proven to us the wisdom of listening to Memories, Inc., where he is responsible for the merchandis- 
customers. As a result, we have devised ways of listening to ing and document literature for programmable products. 
them even better. 32 He holds a BSEE from Stanford. 


T he PAL20RA10 is a breakthrough product. It does 


Vince Coli is Monolithic Memories’ strategic marketing 
manager in the Programmable Products Division. He was 
one of the original authors of MMI’s handbook on pro- 
grammable devices. He holds a MSEE from Santa Clara 
University. 


Questions about this article can be directed to Marc A. 
Baker, Strategic Marketing, Programmable Products Divi- 
sion, Monolithic Memories, Inc., 2175 Mission College 
Blvd., Santa Clara, CA 95054. 
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CMOS Logic Cell Arrays, only recently available from more than one source, are almost 
as useful as small gate arrays, but don’t carry the risk of commiting a design to silicon. 








By Marc A. Baker 
and Chris Jay 


Product Applications 
Monolithic Memories Inc. 
Santa Clara, Calif. 


he rise of advanced CMOS tech- 
nology has dramatically affected 
the PLD market. Although not 
as fast as bipolar devices, CMOS prod- 


the circuit by deselecting its chip se- 
lect input. Large EPROM devices 
may store three or four configura- 
tions, and tying the higher-order ad- 
dresses to DIP switches allows the 
user to select one of a given number 
of logic functions. Thus one card may 
carry a large EPROM and an LCA, 
allowing for some degree of de- 
ferred design. 





The LCA is actually a PLD, but its dense CMOS RAM 
enables designers to configure logic cells within 
the device and to specify I/O blocks. 





ucts are gradually approaching the per- 
formance that was expected of bipolar 
about two years ago. 

The logic designer gains two signifi- 
cant benefits from the latest in 
CMOS: greater densities for more 
complex logic, and reduced power con- 
sumption at low clocking rates. Since 
cutting power consumption means 
smaller power supplies and heat 
sinks, both factors reduce the size of 
electronic equipment. 

Most CMOS parts can be backed up 

by batteries, making them useful for 
RAM-based devices such as the Logic 
Cell Array (LCA). 
Although the LCA is based on a dense 
RAM architecture, the array itself is not a 
conventional RAM-based device. Actual- 
ly, it’s a PLD, but the dense CMOS RAM 
enables the designer to configure logic 
cells within the device and to specify /O 
blocks. The interconnected logic cells, 
called CLBs (configurable logic blocks) 
and IOBs (input/output blocks) allow com- 
plex circuits to be constructed on-chip. 
And the LCA’s battery backup powers 
down to save data. 

Its dense architecture enables 
complex VLSI functions to be pro- 
grammed in the cell array. Also, the 
random glue logic that would other- 
wise ride on system cards is incorpo- 
rated into the array. Because the 
LCA is RAM based, it must be config- 
ured on power-up from a non- -volatile 
source. Configuration data can .be 
read from a CMOS EPROM on a 
byte-wide basis. When the reading is 
done, the EPROM is switched out of 


Dual-layer metal, advanced twin- 
well CMOS has produced a program- 
mable logic device that can extend the 
PLD into the area of low-end gate 
arrays. 

Any custom or semicustom device 
must be tested, and PLDs are no ex- 
ception. But since they are only func- 
tionally configured, functional testing 
offers a high degree of reliability. 
Such testing can be done with the pro- 
grammer, immediately after pro- 
gramming. Production volumes can 
be taken care of simply by using an 
automatic handler. 


An alternative to user program- 
ming is factory programming: Manu- 
facturers can furnish factory pro- 


gramming or can create custom masks 
for hard-wired versions of the pro- 
grammed pattern. 

PLDs also are less expensive than 
other alternatives. Examining chip 
price alone, both mass-produced SSI 
devices and custom parts look very 
good. But other costs, such as inven- 
tory handling and non-recurring engi- 
neering (NRE) charges must be taken 
into account. 

One way to estimate the cost of 
such items as inventory, testing and 
board space is to add about $1 per 
chip. Thus, if five SSI chips cost only 
25 cents each, they actually cost the 
user $1.25 each, or $6.25. Those five 
devices can therefore be economical- 
ly replaced by a $5 PLD. Today, 
many PLDs cost less than the devices 
they'll replace, and prices will con- 
tinue to drop. 

A gate array can replace multiple 
PLDs, but creating a gate array de- 
mands a large NRE charge for, say, 
producing masks and generating test 
programs. The NRE charges for a 
large gate array, including design 
costs, masks, a possible design fix 
and test programs, can be tens of 
thousands of dollars. A PLD has no 
NRE charges. 

With the LCA, testability becomes 
an interesting issue—since the de- 
vice is RAM based, test programs can 
be loaded into it. Once configured for 
testing, test vectors may be applied 
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Gate arrays cost less than PLDs, but their high NRE can make the difference In the long run. 
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to determine if the logic cells and I/O 
blocks are functioning properly. For 
example, when configuring an LCA 
with buried diagnostic shift regis- 
ters, serial diagnostic data may be 
shifted through the system to a seri- 
al output port, where the data would 
be available for analysis. When a 
complete system is configured for di- 
agnostic testing, internal nodes that 
were previously inaccessible become 
available via a serial diagnostic 
scan loop. 


When the LCA serves as part of a 
larger system, it can be initialized 
with buried shift registers perform- 
ing functional tests on the system 
itself. Once testing is complete, the 
LCA can be set up for the required 
function. 


Suppliers 


In 1978, only two suppliers were 
producing PLDs. Now there are more 
than a dozen, with several sources 
for the most popular parts. Higher- 
volume production and second sourc- 
ing alleviate concerns with factory- 
programmed devices, like gate 
arrays. Control of the design is re- 





er PALs are in the works for line 
encoding/decoding, terminal adapter 
interfaces, CRC error correction, 
frame synchronization and other ap- 
plications. 

Since the LCA is reconfigurable, it’s 
possible to modify the logic after de- 
sign. In telecommunications, that 
means that functions can be changed to 
accommodate different communica- 
tions standards and protocols, or both, 
without resorting to separate logic cir- 
cuits for each. 


Tomorrow's Market 


Tomorrow’s market will be driven 
more by users rather than suppliers. 
Applications beyond computers will 
be found, and engineers will push for 
the required technology. Moreover, 
users will drive product definitions. 
Software will be an integral part of 
the design solution and will require 
more adaptation to a user’s design 
environment. An important factor in 
the increasing role of the user in the 
PLD market is the fact that PLDs are 
becoming part of up-front system de- 
sign, not just last-minute replace- 
ments of discrete logic. 

New settings for PLDs in high-speed 


Since the device is RAM based, test programs can be 
loaded into it, and vectors can be applied to see if the 
logic cells and YO blocks are functioning properly. 





turned to the customer. 

In addition, the customer has 
choices not only in silicon but in ser- 
vice and support. Manufacturers have 
varying types of quality control, test 
procedures, reliability verification, 
software and technical support, price 
ranges and delivery capabilities. Asa 
recent survey in EE Times showed, 
these issues are very important. Qual- 
ity control was the primary thing want- 
ed in a supplier, and software support 
was the second most important issue. 


Telecom Applications 


Because CMOS devices are not power 
hungry, they are welcomed in telecom- 
munications. 

Low power consumption in modems, 
fax machines and the like means more 
features and more functions without an 
increase in size. 


Further, in central switching sta- 
tions, units must be kept compact 
because many line interface cards 
are likely to placed in one rack, and 
many racks might share one cabinet. 

The LCA can put its talents to good 
use in telecommunications, includ- 
ing time-division multiplexing, line 
encoding/decoding and corner bend- 
ing circuits. Also, zero-standby-pow- 


processors, superminis and military 
equipment are fueling advances in bi- 
polar speeds. In the past, the fastest 
digital systems could not employ pro- 
grammable logic, but today’s PLDs are 
matching and surpassing their speed- 
iest rivals. 


CMOS technology is being forced 
to meet the demands of telecom- 
munications, as well as adapt to in- 
dustry and instrumentation. Such 
equipment calls for low power and 
immunity to temperature, power and 
noise fluctuations. CMOS promises 
to meet those needs, although most 
CMOS PLDs today simply 


Specific Parts 


Users are also demanding more ap- 
plication-specific PLDs. At first ap- 
pearing to be redundant, applica- 
tion-specific PLDs are devices whose 
architecture is aimed at a particular 
application. Such architectures may 
boast bus interface controllers, sig- 
nal generators and decoders. True, 
these devices are limited to specific 
tasks, but will serve better than 
their generic kin. Suppliers are tak- 
ing risks in pursuing limited mar- 
kets, but the clear winner is the user 
who will soon have devices tailored 
to his needs. 

Recently, there’s been an explosion 
in the number of PLD architectures 
and the amount of support software. 
The first PLDs were programmed by 
manually encoding the individual 
fuses, which then had to be blown. 
Later, software was supplied that 
automatically converted Boolean 
equations into programming pat- 
terns. Now, software can use a num- 
ber of design descriptions (equations, 
schematic capture, truth tables, etc.) 
and can simulate the design and cre- 
ate test vectors in addition to the pro- 
gramming pattern. 

A number of support packages are 
on hand to give the engineer the 
CAD facilities that ensure a success- 
ful design. Most of these packages 
play on an IBM-PC and start with 
assemblers that convert Boolean 
equations into fuse plots suitable for 
programming into the PLD. 

More complex devices require 
greater levels of software support, so 
state machine entry is made possible 
by many PLD languages. PLD assem- 
blers usually need not only Boolean 
literacy from system designers but also a 
detailed understanding of device architec- 
ture. State machine entry does not require 
such a high degree of understanding, so 
for these applications the logic designer 
can often use a descriptive rather than a 
mathematical language. 

User-friendly software tends to make 
an architecture transparent, and state 
machine entry ensures rapid turnaround. 
Small PLD designs can be turned around 





Users are also demanding more application-specific 
PLDs. At first appearing to be redundant, application- 
specific PLDs are devices whose architecture is 
aimed at a particular application. 





match low-power bipolar specs. To- 
morrow’s CMOS PLDs will offer the 
zero standby power, wider operating 
conditions and dc characteristics of 
standard CMOS products. 


in minutes, and more complex devices in 
several hours. 

The software of tomorrow will be 
more responsive to users’ needs, allow- 
ing for a multitude of design descrip- 
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tions while not limiting the implemen- 
tation to a particular device. Along 
with device independence wil] come the 
ability to handle multiple devices, with 
workstations offering the highest level 
of support. 

Programming equipment must also do 
more than just generate waveforms. Ad- 





en by manufacturers, tomorrow’s will 
be guided more by the needs of the 
logic designer. As new applications 
arise, pressure will be put on both 
technology and architecture. Commu- 
nication with the customer will as- 
sume new importance, since it defines 
the right combinations. In addition, 





Just as today’s market is driven by manufacturers, 
tomorrow's will be guided by designers. As needs arise, 
pressure will be on technology and architecture. 





vanced programmers will provide many 
test features, including complete func- 
tional testing. Or, programmers will be 
linked to more sophisticated testers. 

PLD suppliers are increasing the 
quality of their products, while adding 
test features to verify that quality. 
Power-up, reset and register preload- 
ing simplify testing, but the program- 
mer must be able to take advantage of 
them. 

Just as today’s PLD market is driv- 


software from third-party vendors 
will make programmable logic more 
at home across the board. 

PLDs will continue to be used along- 
side discrete logic, fixed LSI functions, 
and gate arrays. But as the market 
matures, the benefits of the PLD over 
other options will become greater. In 
time, programmable logic will continue 
to enjoy a higher growth rate than the 
overall IC industry. a 
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A look at the architectural differences 
between PALs and other programmable logic devices 


PROGRAMMABLE LOGIC devices are 
integrated circuits that hardware designers 
can program to perform specific logic 
functions. Most PLD functions are avail- 
able from many vendors and in several 
technologies with different speed, power, 
and cost options. As with standard 7400 
chips, PLDs are available off your local 
distributor's shelf. PLDs offer one distinct 
advantage over standard 7400 discrete 
logic: They are user-programmable. 

Most PLDs consist of two arrays of 
logic gates—an AND array followed by an 
OR array. The input signals to a PLD must 
first pass through an array of AND gates 
where combinations of the input signals 
are formed. Each group of AND com- 
binations is called a minterm in Boolean 
algebra or a product line in PLD nomen- 
clature. Then the product lines are 
summed in an array of OR gates. The in- 
put buffers generate both the true and 
complement of the input signals. 

Three basic types of AND/OR array- 
based PLDs exist: programmable read- 
only memories (PROMs), programmable 
logic arrays (PLAs), and programmable 
array logic (PAL) devices. The types are 
distinguished by the programmability of 
their arrays, 

In a PROM, the AND array is fixed and 
the OR array is programmable. In a PLA, 
both arrays are programmable. PAL 
devices have a programmable AND array 
and a fixed OR array. I will compare the 
PAL device to the PLA and PROM and 
then examine the architecture of some 
commonly used PAL devices. For a brief 
history of the PAL device, see the text box 





Vincent J. Coli 


“Evolution of PALs” by John Martin 
Birkner on page 208. 


PROMs 

While most people think of PROMs as 
devices for storing fixed programs and 
memory, the PROM is also ideal for logic 
applications requiring less than 10 
inputs—especially when many product 
lines are required. PROMs designed as 
logic devices are usually referred to as 
PLEs (programmable logic elements). 

Figure | shows the PROM'’s fixed- 
AND/programmable-OR arrays. For a 
discussion of notation used to describe 
PLD devices, see the text box “PLD 
Notation Panel” on page 210. Every input 
combination is available in the AND ar- 
ray, whether that combination of inputs is 
required or not. Since the AND array is 
hard-wired, it is not possible to perform 
logic minimization between input com- 
binations. 

The OR array is programmed to select 
the AND gate combinations (or product 
lines). Since every OR gate is connected 
to each product line, outputs may share 
product lines. For those familiar with 
memory design, the fixed-AND array is 
often called the address decoder, while the 
programmable-OR array stores the mem- 
ory bits. Another way of looking at this 
is that PROMs store the logic transfer 
function as a lookup table in memory. 

The advantage of PROMs is that every 
input combination can be decoded. The 
disadvantage is that the number of input 
pins available is restricted because the ar- 
ray size must be doubled for each addi- 


tional input. The arithmetic works like 
this: A PROM with n inputs and #7 out- 
puts requires an OR array of 2” lines deep 
by m lines wide. For example, 2 PROM 
with 10 inputs and 8 outputs requires an 
OR array of 2'° by 8 or 8192 fuse loca- 
tions. An Ilth input would require that the 
array size be doubled to 16,384. Cost and 
performance constraints limit PROMs to 
13 inputs and 8 outputs. PROMs designed 
specifically for logic applications feature 
either 5 or 6 inputs and [6 outputs. 


PLAs 
The PLA structure offers the highest level 
of flexibility because both arrays are pro- 
grammable. Figure 2 shows the PLA’s 
programmable-AND/programmable-OR 
structure. Because their OR arrays are 
programmable, PLAs, like PROMs, can 
share product terms among outputs. For 
example, one product line would be saved 
if two outputs required the same input 
combination (i.e., product line), 
Programmability in the AND array 
removes the restriction found in PROMs 
that the AND array must be large enough 
to provide all possible input combinations. 
This works because, statistically, only a 
continued 


Vincent J. Coli is a strategic marketing 
manager for Monolithic Memories Inc. 
(2175 Mission College Blvd., Santa Clara, 
CA 95054). He has worked with the PAL 
products for the past six years with MMI. 
Vincent holds a B.S. in chemical engi- 
neering and an M.S. in electrical engi- 
neering. 


Reprinted with permission from the January 1987 issue of BYTE magazine. 
Copyright © by McGraw-Hill, Inc., New York 10020. All rights reserved. 
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The Evolution of PALs 


omputers used to be constructed 

from SSI, MSI, PROM, and RAM 
chips connected in jigsaw-puzzle fashion 
on many printed circuit boards plugged 
into a connector backplane. The com- 
puter designer’s task was to build a func- 
tional unit such as a processor, disk con- 
troller, I/O controller, or memory board. 
If the design overflowed onto another 
board, connectors and ribbon cables had 
to be added. This made the design more 
expensive and sometimes risky due to 
noise coupling. The name of the game 
was to get it all on one board. 


Mixing and Matching TTL Chips 
Designers who had studied switching 
theory, information theory, Boolean 
algebra, and Quine-McCluskey mini- 
mization at college soon found that their 
textbooks would not be of much use. 
They learned that the practical art of 
computer design did not consist of op- 
timizing an architecture with an or- 
thogonal instruction set. It consisted of 
mixing and matching the collage of exist- 
ing TTL chips onto a single board until 
an approximation of the design goal was 
reached. They did not design state- 
control sequence logic from top-down 
state-graph theory, but rather, slapped 
down a 74174 hexadecimal register and 
some 7400 NAND gates. Control-logic 
design theories usually consisted of 
following signal lines around the logic 
schematic until, through superhuman 
powers of concentration, designers 
achieved clarity. 

Designers found the information they 
needed in the catalogs of young semi- 
conductor companies in California, 
Arizona, and Texas. A favorite was The 
TTL Databook by Texas Instruments. 
Most logic designers believed that 
74-series TTL parts found in this book 
would be second-sourced and could be 
“designed in.” 

A processor design would begin with 
the block diagram consisting of an ALU. 
data path and register file, microprogram 
memory and sequencer, and then a small 
and obscure block called “control logic.” 
It might have been a small block on the 
diagram, but the control logic usually 
represented the majority of the chip 
count. 

The control logic consisted of SSI/MSI 
gates and flip-flops connected together in 
random fashion, and there seemed to be 
no way to reduce it. The control logic 
also represented the area of highest 
design errors and was easily recognizable 
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on the printed circuit board as the area 
with all the ‘‘cuts and jumpers.” The 
engineering change notice (ECN) was the 
standard remedy for such errors and was 
a constant source of agony between 
manufacturing and engineering. Manu- 
facturing would use yellow wires to stand 
out on the green PC board. Engineering 
would use green jumper wires to camou- 
flage embarrassing mistakes. 

The engineering manager would “‘pilot 
release” the current revision PC board 
as soon as the green wire count was low 
enough to pacify manufacturing. The 
design engineers would then flee to the 
next design, where they were expected 
to cram even more functions onto the 
single PC board to beat the competition's 
new threat. 

I was convinced that there must be a 
better way to build computers. So, in 
1975, I packed my bags and headed for 
Silicon Valley. I remember seeing the first 
single-chip microprocessor systems on 
the market. They had one microcomputer 
chip surrounded by a sea of over 100 
SSI/MSI chips. The new LSI chips 
needed either some good planning so that 
they could talk to each other or some 
good “glue chips” to hook them up. 

The PROM, pioneered by Harris and 
Monolithic Memories, showed some 
promise as a universal and general- 
purpose glue logic element. Applications 
like memory-address decoding began 
showing up for the 32-word by 8-bit 
PROM. National Semiconductor pio- 
neered the programmable logic array 
(PLA) ina 14-in, 8-out. 96-product-term, 
24-pin fat (0.6 inch wide) DIP, bench- 
marked for 96-character EBCDIC-to- 
ASCII conversion. 

Intersil made a field programmable 
logic array, or FPLA, in the National pin- 
out, but with about half the product terms 
at 48. Signetics increased the package 
pins to 28, making the 16-in, 8-out. 
48-product-term 825100 FPLA. These 
first attempts at providing the computer 
designer with LSI glue were met with 
mild enthusiasm. The new glue chips 
were too big (fat DIPs) and were slow. 
expensive, and hard to use. 

Monolithic Memories was the first 
company to take advantage of the bipolar 
fuse-link PROM technology to make 
some fast little FPLAs, as we first called 
them. We put them in industry-standard 
20-pin skinny (0.3 inch) DIPs, for 
minimum PC board area. We also re- 
duced the two programmable arrays down 
to one for 35-ns high-speed operation and 


lower cost. We mimicked the TTL data- 
sheet specs down to the same ter- 
minology, graphics, and printing style to 
make the computer design engineer 
secure in replacing old 74-series TTL 
chips. We added programmable three- 
state output enable for I/O pin allocation. 
We added output registers with feedback 
for direct implementation of  state- 
machine control logic from state graphs. 

We designed the programming algo- 
rithm to be compatible with existing 
PROM programmers, making low-cost 
programming possible. The first PAL 
programming module had a PAL in it. 
This presented a chicken-and-egg prob- 
lem that we solved by emulating with 
some PROM and SSI chips. The first 
PAL to be programmed was, of course, 
the pattern for the PAL programmer 
module. I headed the project. specified 
the design, and sold the customers. H. T. 
Chua provided a clever and ingenious cir- 
cuit design. 


New Design Methodology 
The new chips required a new design 
methodology. Actually, it was the same 


_method that we learned in school, so we 


had to drag out our old textbooks and 
relearn Boolean logic and top-down state- 
machine design. We showed the designer 
how to “design your own chip” using 
Boolean logic equations. We wrote the 
first silicon compiler, PALASM (PAL 
Assembler), and published the FOR- 
TRAN source in the PAL Handbook 
(available from McGraw-Hill), along 
with numerous design examples. 

The PAL chips replaced SSI/MSI chips 
at a chip-count reduction of 5 to t. Data 
General gambled on the new single- 
sourced chips by designing them into the 
MV8000 computer (see The Soul of a 
New Machine by Tracy Kidder). 

Apple put six PAL chips in the Macin- 
tosh. Soon the PAL chips were no longer 
single-sourced, as National Semiconduc- 
tor, Texas Instruments, Advanced Micro 
Devices, and others joined in licensing 
the now-patented PAL chips from MMI. 
Now you can find these chips every- 
where. Look at the PC expansion boards 
in this magazine; you can recognize PAL 
chips by their easy-to-read part-number 
system (e.g., PALI6L8 and PALI6R8). 


John Martin Birkner is president of Struc- 
tured Design Inc. and coinventor of the 
programmable array logic (PAL) device. 
He can be contacted at 988 Bryant Wavy, 
Sunnyvale, CA 94087. 
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PLD Notation Panel 


PROGRAMMED (OPEN) 
FUSE LINK 


UNPROGRAMMED (INTACT) 
FUSE LINK 


TTL AND GATE SYMBOL 


ABC 


PLO REPRESENTATION 





i 
Figure A: Differences in the logic 
notation for a TTL AND gate and a 
PLD AND gate. 


Figure B: The partially programmed 
product line to implement A*B. 
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Figure C: Portion of the PLD array programmed to implement A*B. 
Having all fuses intact in the second product line causes a logic zero to be 
input to the second AND gate, which does not contribute to the sum at the 
OR gate. 
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Figure D: The PAL device implementation of the function /Output = A*B 
+ /A*B. The standard combinatorial logic diagram of the function /Output 
= A/B + /A*B is shown in the inset to figure D above. 
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ecause PLD structures are much 

different from ordinary TTL gates, 
new logic notations were developed for 
them. Figure A shows the logic 
convention adopted for a three-input 
AND gate. The PLD representation for 
an AND gate is called a ‘product line.” 
Note that the three vertical lines are the 
inputs (A, B, and C), which are con- 
nected to the AND gate inputs through 
fuse links. An unprogrammed (or closed) 
fuse link is represented by an X at the 
intersection of an input line with a 
product line. If you wanted to disconnect 
one of those inputs from AND gate C, 
for example, you would remove the 
appropriate X from the point of inter- 
section for the C input line with the 
product line to signify a programmed (or 
open) fuse link. This product line, which 
now implements the A*B function. is 
shown in figure B. 

Since every input is available to every 
product line in a PLD, it is convenient 
to show the input lines as long lines 
running vertically through the array. 
Also, two input lines are associated with 
each input pin because both input 
polarities are available ina PLD. There- 
fore, the input buffer is shown with both 
a noninverted (true) and inverted (com- 
plement) output path; each path is hard- 
wire connected (shown as a dot) to an 
input line. 

Figure C shows a portion of a PLD 
array illustrating the input lines and 
buffers. Notice that an OR gate is added 
to the structure. All the fuse links in the 
lower product line are left intact, leaving 
the product line in a logic low (since true 
inputs are ANDed with complements), 
while appropriate fuse links in the upper 
product line are programmed to imple- 
ment the A*B function from figure B. 

It is common to implement two or 
more levels of logic gates such as an 
AND/OR/invert circuit in a PLD. For 
example, consider the following function 
implemented in a PAL device: 


/Output = A‘/B + /A*B. 


Shown in figure D are the standard com- 
binatorial logic diagram (see inset) and 
the PAL logic equivalent for this 
function. 

Notice the details added to figure D 
that magnify the programmed fuse link 
for B in the upper product line. This 
magnification details each fuse link and 
its associated diode for a bipolar PAL 
device. A CMOS PAL device is similar, 
except an ultraviolet cell would substitute 
for the fuse link. 
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limited number of product terms is re- 
quired in any equation. Eliminating 
redundant combinations with logic 
minimization techniques, such as Kar- 
naugh maps, can reduce the required 
number of product terms even more. 
Theretore, almost any combination of in- 
puts can be decoded in a PLA. 

PLAs were the first products offered 
specifically for logic applications. Due to 
programming limitations, early PLAs 
were available only in mask-programmed 
versions. Just like on a ROM, a logic 
designer would indicate on the vendor's 
PLA AND/OR logic map where the de- 
sired connections were to be made. The 
vendor would then tool up a custom metal 
mask for the PLA to implement the 
customer's logic. Today, most PLAs are 
user-programmable. However, mask-pro- 
grammed PLA structures are used often 
in the control section of LSI/VLSI stan- 
dard logic chips. such as microprocessors, 
and offered in standard-cell libraries. 

In the world of engineering, there are 
always compromises. The facts reveal that 
a performance and silicon-die size penalty 
must be realized to provide the flexibility 
of programming both arrays. PAL devices 
are generally 5 to 10 nanoseconds faster 
than PLAs at the same power level and 
save the silicon area required to program 
and verify the second array. It turns out 
that the flexibility of a programmable-OR 
array is not required for most PLD ap- 
plications, but it can be useful for com- 
plex state-machine and sequencer applica- 
tions. 

Because of the long history of PLAs, 
their nomenclature can be a little confus- 
ing. Early vendors of user-programmable 
PLAs called their products FPLAs to 
highlight their “field programmability” 
and to distinguish FPLAs from factory 
mask-programmed PLAs. Just as ROMs 
and PROMs could be easily distinguished, 
so could PLAs and FPLAs. However, 
since most of the PLAs offered today are 
programmed by the customer, many ven- 
dors have dropped the F prefix and sim- 
ply call them PLAs. Furthermore, PLAs 
designed for sequencer applications are 
called PLSs (programmable logic se- 
quencers). 


PAL Devices 
Figure 3 shows the programmable- 
AND/fixed-OR array structure of a PAL. 
As with the PLA, having the AND array 
programmable lets the user program only 
the desired input combinations. But fix- 
ing the OR array requires that certain 
product lines be tied to specific outputs— 
typically, eight product lines per output. 
Many people use PAL and PLD synony- 
mously. Several PLD vendors add an E 
prefix to PLD, to come up with EPLD, 
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which signifies ultraviolet erasable PLDs. 
Just as there are PROMs and EPROMs, 
now there are PLDs and EPLDs. 

The name HAL (hard array logic), 
refers to mask-programmed, or ROM, 
versions of PAL devices. If the volume of 
devices needed were large, converting a 
design to a HAL might be appropriate 
once the design is thoroughly debugged 
with the PAL. 

While all PAL devices are characterized 
by a programmable-AND/fixed-OR array 
structure, there is a whole line of PAL 
devices with different options. They come 
with varying numbers of inputs and out- 
puts. They might have feedback paths 
from the output back to the array. Some 
of these pins can be programmable I/O 


n INPUTS 


an 
PRODUCT 
LINES 


ANDO ARRAY (FIXED) 








pins. They can have active-high or active- 
low outputs, or the output polarity might 
be programmable via an XOR gate anda 
fuse. Some come with registers at their 
outputs and are good for making sequen- 
tial circuits. Let's look at two commonly 
used PAL devices: the 16L8 and the I6R8. 


The PALI6L8 
One popular combinatorial PAL is the 
PALI6L8 (figure 4). Notice how the pins 
on the left side and bottom of the logic 
diagram (pins | to 9 and pin 11) are used 
for inputs and the pins on the right (pins 
12 to 19) are available as outputs. Pins 12 
and 19 can be used only as outputs, but 
six of the outputs (pins 13 to 18) are also 
continued 
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Figure 1: A simplified diagram of the fixed-AND/programmable-OR array 
structure of the PROM. Figures in this article are reprinted from Monolithic 
Memories’ PAL Handbook (3rd ed.) with permission from Monolithic Memories. 
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Figure 2: A simplified diagram of a PLA showing that both arrays are 


programmable. 


available as inputs via the feedback line 
connection after the inverting output buf- 
fer. This feature, called programmable 
I/O, lets the user program each of these 
six pins to be either an input or output. 
I'll discuss programmable I/O in more 
detail later on. Now the PALI6L8 part- 
numbering scheme should be a little more 
obvious; 16 signifies the maximum 
number of potential inputs (10 dedicated 
inputs and 6 programmable I/O), while 8 
signifies the number of outputs and L 
signifies the output type, which is active 
low for this PAL part type. 

Refer to the logic diagram in figure 4 
and you'll see that the vertical lines run- 
ning through the array, numbered 0 
through 31, are the input lines. Notice that 
each input or I/O pin is associated with 


two input lines; one input line is connected 
to the true (or noninverted) sense of the 
input buffer, while the other input line is 
connected to the complement (or inverted) 
sense. This allows availability of both 
input-signal polarities to the array. 

The horizontal lines running through 
the array, numbered 0 through 63, are the 
product lines. You can think of each of 
these product lines as an AND gate with 
32 inputs, which corresponds to the total 
number of input lines. Actually, both the 
true and complement of every input signal 
are connected via fuses to each product 
line before the device is programmed. 
This is the programmable-AND array in 
the PAL structure. To program the array, 
the user selects different combinations of 

continued 
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Figure 3: A simplified diagram showing the programmable-AND/fixed-OR array 


structure of a PAL device. 


input signals by disconnecting, via the 
blown fuse, the unwanted input signals in 
a product line. In total, 2048 fuses are 
available in this PAL device (64 product 
lines by 32 input lines). 

Notice that each output pin has eight 
product lines associated with it. The lower 
seven product lines of each group are 
summed at the OR gate, while the upper 
product line is connected to the inverting 
output buffer. The lower seven product 
lines and the OR gate provide the sum- 
of-products logic power for the PAL 
device. The OR gate determines whether 
any of the product lines are active, or true, 
and then the output buffer inverts the 
signal from the OR gate for output. Note 
that a product line with all fuses left in- 
tact will not affect the sum at the OR gate, 


since the logical result of each input 
ANDed with its complement is false. 


Programmable I/O 
This upper preduct line associated with 
each output controls the three-state logic 
in the output buffer. When this product 
line is active, or true, the output is enabled 
and the sum-of-products logic determines 
the output state. However, when this prod- 
uct line is inactive, or false, the output is 
disabled with the three-state buffer in the 
high-impedance state. This lets the out- 
put pin drive a three-state bus just like a 
74S240 octal buffer. Furthermore, since 
most PAL devices feature an output drive 
capability of 24 milliamperes, they are 
quite handy for bus interfacing. 
continued 


at Monolithic hart Memorles at 














2-707 





Introduction to Programmable Array Logic 








1 INPUT LINES (0-31) 
0123 456 7 8 91013 12131415 16171819 20212223 24252627 28293031 


See eee PIN 
eee oe See eee 


NUMBERS 
———e 
At Segawa 
+] +4 Hitt Att ty 

= aes ss 
B SSH RSCTTHREE-STATE PRODUCT LINE 


tt +f tt ft ttt 7- WIDE OR GATE 
oe es 







NOU BwresO 











8 PRODUCT 
LINES. NOTE: 
LOWER 7 

GO TO 

OR GATE 


18 





=se aes —— Nuinvertine 
: tet cmitimiiiminimalimiiimiil ee 
Pama EE LEB Lec eles ee eee < \ BUFFER 
ee [St etimiE Ris LE ee FEEDBACK PATH 


ere) 
PEWN- OOM 


BUFFER 


Seas sa a — Pee 
| sie clic io 


1? 





RAR ewe 
WHHOVM UD 





i) 
t 
ik 
{ 
t 
| 
| 
1 
| INPUT 
{ 
| 
| 
t 
| 
| 


~>o 16 





WEMLRANN 
HKHOWOVANea 


. imiimitin sia 
pawn Coes ee 





15 





WWwWwUwwuw i 
ORNINUAW A 


oy Po 


PRODUCT LINES (0-63) 


crc 


14 









pePerand 
NOU EwNHO 


jiimisimisi minima] 
Pee 


Soa 





13 





AMO rca: 
Amr: 


imiimiiniiin imi ae _ fe 
ee > eae its ae 
| CCT eT nn 


12 


PA DOU nF 
Gn Lowe Go 





; Emi MIiIMILAMTLIMLTiMiE a — r 
Pe Se ee) cee 


or.23 4567 $8 9101] 12131415 16171839 20212223 24252627 2829303) 
NOTE: FUSE LINKS 
NOT SHOWN 





Figure 4: The actual logic diagram of the combinatorial PALI6L8. The fuse links are not shown so that a logic designer 
may place an X at those points where a fuse should be left intact. 
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The three-state product line, along with 
the feedback path on six of the outputs, 
makes the programmable I/O feature 
work. The pin is an input to the AND ar- 
ray when all the fuses in the three-state 


enable product line are left intact, while 
the pin is an output when all the fuses are 
programmed. Note that a product line will 
always be true, regardless of input com- 
binations, when all fuses are pro- 





grammed. The programmable I/O feature 
lets the user allocate pins for input or out- 
put as required by the application. 

An even higher level of flexibility is 
possible if you let the logic in the product 
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Figure 5: The actual logic diagram of the PALI6R8. Notice the D-type flip-flops at the outputs. 
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line determine the pin’s direction. This is 
done by programming a condition in the 
product line for which the pin will be an 
output. You can use this feature to allocate 
available pins for I/O functions or to pro- 
vide bidirectional transfer for operations 
such as shifting and rotating data. 

The PALI6L8 is used in applications 
such as complex decoders, encoders, 
multiplexers, comparators, and replace- 
ment of SSI/MSI random logic. Another 
way of viewing this is that the PALI6L8 
programmable AND array contains 2048 
fuses. You can program these fuses to 
create almost any configuration of up to 
250 AND, OR, and inverter gates, which 
is roughly 250 equivalent gates. 


PALs with Registered Outputs 

and Feedback 

The structure of registered PALs is similar 
to that of the PALI6L8 except for the ad- 
dition of the registered outputs. In the 
PALI6R8, each of the eight registers is ac- 
tually a D (data) flip-flop that is clocked 
on the rising edge (see figure 5). The 
clock signal (pin 1) is shared by all eight 
flip-flops. Each OR gate sums eight prod- 
uct lines and is the D input to the flip-flop. 
The Q output from the flip-flop is avail- 
able both for feedback into the PAL array 
and for output from the device. Either 
polarity of the feedback signal is available. 


This feedback lets the PAL device 
“remember” the previous state, and it can 
alter its function based upon that state. 
Thus, registered PAL devices are ideal for 
implementing single-chip state sequencers 
and state machines. 


Conclusion 
PROMs are limited in the number of in- 
puts they can handle, since every input 
combination is made available. They are 
useful for applications that require a large 
number of product terms but few inputs. 
PLAs are the most flexible of the 
AND/OR array PLDs with both arrays 
programmable. This flexibility makes 
them slower, since the signal has to prop- 
agate through two programmable arrays. 
PAL devices with their programmable- 
AND/fixed-OR array structure can ac- 
commodate more inputs than PROMs 
because, statistically, not every input com- 
bination is required. With only one array 
programmable, they are faster than PLAs. 
PAL devices with registered outputs are 
particularly useful for building sequential 
circuits. PAL devices also can provide 
feedback, altering the function of a given 
state based on the condition of the im- 
mediately prior state. The overriding ad- 
vantage of all PLDs, however, is the in- 
tegration of multiple functions onto a 
single programmed circuit to save board 
space and reduce chip count and cost. @ 
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Supermini Design 





Becoming the technology of choice in superminicomputer 
Jesign, programmable array logic mixes with alternative 
devices to meet design constraints. 








by Bradford S. Kitson and 
B. Joshua Rosen 


Programmable array logic devices have been a 
driving force behind the latest generation of 32-bit 
superminicomputer designs. These supermini- 
computers range from redesigns of existing archi- 
tectures that reduce cost or overcome packaging 
limitations, to designs requiring ultrafast turn- 
around, to high performance architectures speci- 
fied to take full advantage of such devices. 
Superminicomputers designed with program- 
mable array logic (PAL®) include the VAX@11/730 
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from Digital Equipment Corp, the Mv/8000 and 
Mv/10000 from Data General, and Computervision’s 
APU “(analytic processing unit). A reimplementa- 
tion of the original VAX-11/780, the VAX-11/730 sup- 
piies 25% of the performance in 10% of the board 
space. Programmable array logic was chosen by 
the MV/8000 designers to allow the shortest possible 
design cycle and to catch up with their competitors. 
The Mv/10000 upgrades performance of the 
MV/8000. The Computervision APU was designed 
for high performance with PAL devices in mind, 
and provides excellent examples of how to use such 
devices to their fullest. 


Logic Design Alternatives 

Although by no means the only option, pro- 
grammable array logic has become the technology 
of choice in superminicomputer design. Logic 
design alternatives (Fig 1) include standard prod- 
ucts (fixed-function devices), semi-custom 
(programmable logic, gate arrays, and standard 
cells), and fully-custom logic devices. In supermini- 
computer design, the primary alternatives are stan- 
dard products, gate arrays, and PAL devices. The 
best choice for any given function depends upon 
the design alternative capabilities, the design con- 
straints, and the function actually being 
implemented. 


Reprinted with permission of COMPUTER DESIGN 
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DIGITAL 
LOGIC FAMILIES 
SEMICUSTOM 
PROGRAMMABLE GATE 
LOGIC ARRAYS 


Fig 1 Basic categories of digital logic present designers with 
trade-off opportunities. Standard products fit where cost is a 
major concern; custom and semi-custom products can be used 
where high diversity is desirable. 


STANDARD 
CELLS 








Via fuse programming, a PAL device allows the 
designer to construct a custom device or group of 
devices that precisely implement a desired func- 
tion. In contrast, fixed-function transistor- 
transistor logic (TTL) small scale integration/ 
medium scale integration (SSI/MSI) alternatives 
seldom seem to fit any application in the desired 
way. Thus, the SSI/MSI designer usually pays 
penalties via extra logic levels in the critical path 
and an increased package count. Gate arrays allow 
custom devices to be created via mask program- 
ming. However, designers using a gate array must 
finalize architecture early in the design cycle. Any 
errors will require a mask change, which can take 
months. 

In most cases, the best choice is a combination of 
the alternatives and probably includes some 
memory as well. Typically, the six basic design con- 
straints are performance (speed), cost, density 
(packaging), power dissipation, reliability, and 
design turnaround. Assigning a priority to these 
constraints will usually define the logic alternative 
that a machine is based on. Actual implementation 
trade-offs are made at the function level. The three 
basic functional portions of a design are data path, 
control path, and interface. 


Standard Products Have Their Place 


Standard products are defined as devices created 
for a wide market. Examples of standard products 
are TTL SSI/MSI, fixed instruction set metal oxide 
semiconductor (MOS) microprocessors, and micro- 
programmable large scale integration (LSI) building 
blocks. These devices are usually multiple sourced 
and produced in high volume, resulting in lower 
individual device costs. In a design where cost is the 
main concern, and performance, power dissipation, 
density, and design turnaround are of little or no 
importance, standard products are probably the 
best choice. In a design such as a superminicom- 








puter, where these other considerations have a high 
priority, inherent disadvantages limit standard 
product use. 

While SSI/MSI devices offer fast individual gates, 
on a system level their density, power dissipation, 
and reliability characteristics are not as good as 
those of the alternatives. In addition, design turn- 
around characteristics are inadequate because 
changes usually require printed circuit (PC) boards 
to be laid out again. In most superminicomputers, 
therefore, SSI/MSI gates are used only in selected 
critical path functions that require one or two gate 
levels (at the expense of density and power dissipa- 
tion), and in interface applications, such as bus 
buffers, latches, registers, and transceivers. 

Standard LSI products (mostly bipolar) offer 
exceptional performance, power dissipation, density, 
and reliability characteristics, but their inflexible 
architectures limit their application range. Super- 
minicomputer designers rely on proprietary, highly 
complex architectures to differentiate their designs 
from those of their competitors, and LSI imposes 
an architecture. Therefore, applications are limited 
to general purpose, well-defined functions in the 
data path, such as parallel multipliers and arith- 
metic logic units (ALUs) that can benefit from their 
high performance characteristics. 


Gate Arrays Are “Cast in Concrete”’ 


Semi-custom gate arrays are defined by inte- 
grated circuit (IC) manufacturers as large arrays of 
unconnected gates. End users specify how gates are 
interconnected with actual interconnection occurring 
at the metal-mask layer of the IC process. The main 
advantages of gate arrays are high density and the 
ability to customize a design, while primary dis- 
advantages are cost and design turnaround. Each 
custom device is single sourced and low volume; 
therefore they are not cost-effective unless the 
application is density limited or the volume is very 
high. Gate arrays can adversely affect design turn- 
around because the system designer must design 
both the Ic and the system in which the IC is used. 
In addition, any change in the gate array requires 
new masks and a delay for each mask iteration. 

By using gate arrays only where the design can be 
defined early, designers minimize these turnaround 
disadvantages. They then hedge their bets by sur- 
rounding the gate arrays with logic that can correct 
any design bug(s) discovered later on. As in LSI, 
what is ‘‘cast in concrete’’ is usually the data path. 
However, gate arrays allow more of the data path 
to be integrated because the device can be optimized 
for specific design requirements. A proprietary 
16-bit ALU slice might be a typical gate array. 

Gate arrays are also used to interface multiple 
onboard buses together as data path ‘‘glue.’’ 
Control-path and interface applications, however, 
tend to be too likely to change. Therefore, control 
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path functions are based on implementation tech- 
niques such as writable-control-store (WCS). Inter- 
face applications frequently require changes 
because of the need to interface one designer’s 
board to another’s. Consider the critical timing 
between a cache, an instruction fetching unit, and 
multiple register-files. The exception handling 
capability required for typical operations, such as a 
cache miss, can be an indeterminant problem 
affecting all of the above-mentioned functional 
units in the system. Should a bug occur in the cache 
unit, the interface section of all other units will have 
to be changed to accommodate the correction. 


PAL Structure Paves the Data Path 

Combining simplicity and flexibility, the basic 
PAL Structure is a fuse-programmable AND gate 
array that drives fixed connection OR gates, allow- 
ing logic to be implemented in sum-of-products 
(AND-OR) Boolean form. 

By selectively blowing the appropriate fuses, a 
PAL device can implement any logic function as 
long as the number of inputs or AND gates required 
does not exceed the number provided in the device 


chosen. In the AMPAL16R4, for example, the true 
and complement version of each of the 16 inputs is 
connected via fuses to each of the 64 AND gates in 
the device (Fig 2). PAL devices provide additional 
features, such as programmable input/output (1/0) 
pins and registered outputs with internal feedback 
that further enhance their ability to implement 
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logic functions efficiently. Programmable 1/Os are. 


especially useful for trading off the number of 
device outputs for inputs to fit the exact number 
required by the logic functions being implemented. 
Internal registered feedback is desirable when 
implementing complex state machine designs. 
Programmable logic devices, like gate array, are 
semi-custom devices. Created by the IC manufac- 
turer, they are alterable by fuse programming for a 
specific application. Designed specifically for logic- 
oriented applications, PAL devices enable designers 
to create custom devices with fast turnaround time. 
PAL devices compare favorably to alternative 
devices in terms of performance, cost, density, 
power dissipation, and reliability. They fall behind 
gate arrays and LSI in density and power dissipa- 
tion, and behind SSI/MSI in individual device cost. 


>I INPUT/OUTPUT 


INPUT/OUTPUT 


Fig 2. Asa fuse programmable AND 
gate array that drives fixed OR gates, 
programmable array logic im- 

plements logic 
Boolean forms. 


in sum-of-product 


SSI INPUT/OUTPUT 
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(Note however, that PAL devices are cheaper on a 
system basis.) 

Performance and density characteristics of PAL 
devices have led to significant applications in the 
data paths of superminis where they are used along 
with LSI and/or gate arrays. These devices serve to 
‘“‘glue’’ the LSI and the ‘‘cast in concrete’ gate 
arrays into the system. If necessary, the design turn 
around capability that they display can be used to 
optimize the data path architecture and, in some 
cases, fix a bug in a gate array by reprogramming the 
PAL devices around it. Typical data path functions 
for the devices include barrel shifters, masking, 
code conversion, and multiple bus interface. 

Design turnaround becomes especially beneficial 
in the control path and interface portions of a 
design. Most control path functions are highly ran- 
dom and are prone to change and/or error While 
WCS allows design changes to be made by rewriting 
microcode, PAL devices permit changes that cannot 
be made in microcode, or would adversely affect 
system performance. For instance, one supermini- 
computer manufacturer has established the rule 
that a gate array can be used in the control path 
only if eight or more PAL devices are necessary for 
the same function. In interface design, from a den- 
sity standpoint, PAL devices allow the interface to 
merge with the data path ‘‘glue’’ function. Since 
the interface is just as likely to change as the 


PARTICAL 
PRODUCT 


REGISTER GENERATORS 


MULTIPLIER 
ACCUMULATOR 


RESULT 





Fig 3 Multiplier calculates 56-bit x 56-bit product. Partial 
product generation logic uses seven 8 x 8 slices to form 8 x 
56-bit multiplication array. Carry-save adders implement 
3-to-2 counting techniques to form two operands that are 
summed in a lookahead ALu. 








control path, but without the benefit of WCS, design 
turnaround becomes a factor. If one board's 
designer needs to change the interface, the designs 
of many other boards are impacted. Updates on all 
boards can be easily made by reprogramming one 
or more PAL devices. These devices can also pro- 
vide the drive capability required by this appli- 
cation area. 


Designing with PAL Devices in Mind 


Trade-offs among PAL devices and other digital 
logic alternatives were key factors in Computervi- 
sion’s APU design. In conjunction with standard 
products, such as ALUs, LSI multipliers, and 
memory devices, PAL devices were used to create a 
patented architecture not feasible in standard TTL 
SSI/MSI. 

Designed as a very high speed 32-bit supermini- 
computer for enginecring applications, the Com- 
putervision APU is twice as fast as competitive 
designs, yet occupies the same board space. Its 
designers, instead of merely replacing TTL SSI/MSI 
with PAL devices, used PAL as customizable logic 
building blocks. This allowed them to implement 
powerful logic functions in a minimum of space. 

The APU processor board set is divided into four 
modules. The parser/sequencer contains an instruc- 
tion processor that fetches and decodes instruc- 
tions in parallel with the execution unit. The 
control processor performs address and integer 
computations and the floating point pipe (FPP) per- 
forms both scalar and vector floating point opera- 
tions. The cache/address translation unit contains 
a 256-slot area page table entry cache and a 
16K-byte memory cache. 

Approximately 25% of the chips in the APU board 
set are in PAL. Since the APU is the first implemen- 
tation of the new CPU architecture, many design 
aspects are subject to change as the architecture 
evolves. PAL devices permit designers to rapidly 
modify hardware to fit the architecture needs. and 
to implement feature and performance enhance- 
ments with minimal impact on the development 
schedule. 

Ability to generate a very large number of custom 
ICs (over 200 different PAL codes are used in the 
APU), significantly reduces the processor’s size 
while increasing overall performance. This means 
that, although the APU and Digital Equipment 
Corp’s VAX-11/750, a gate array-based machine, 
consume exactly the same amount of board space, 
the APU is more than twice as fast. In fact, the 
APU’s Fortran performance is substantially faster 
than that of the VAX-11/780, a machine that con- 
sumes 5.2 times as much board space as the APU. 

Designed as a high speed arithmetic extension to 
the APU execution engine, the APU FPP, unlike 
comparable machines, is an integral part of the 
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internal architecture and not an optional add-on. 
As a result, the FPP not only accelerates scalar and 
vector floating point arithmetic, but also performs 
byte, word, double-word, and quad-word string 
operations. In addition, it serves to enhance the 
performance of important nonfloating point 
instructions such as Procedure Call and Return. A 
total of 79 PAL devices are used for both control- 
and data-path applications on the FPP board. 


Counter Solves Multiplication 
Problem 


The heart of the FPP is the multiplier section 
(Fig 3). Double-precision floating point multiplica- 
tion requires the calculation of a 56-bit x 56-bit 
product. Unfortunately, 56 x 56 parallel multipliers 
do not exist on silicon. From a cost/performance 
standpoint, the best solution is to use a number of 
small multipliers to build an intermediate sized par- 
allel multiplier and then produce a large product 
(56 x 56) in multiple cycles. 

Partial product generator FPP logic uses seven 
Am25S8558 8 x 8 multiplier slices to implement an 8 x 
56-bit multiplication array. Each multiplier chip 
produces a 16-bit product. In general, the 8 most 
significant bits (MSBs) of each partial product gen- 
erator must be added to the 8 least significant bits 
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Fig 4 Carry-save adders use 3-to-2 
counting (echniques to recode any 
three equally weighted bits into a 


2-bit field (a). Sixteen PAL units 
form triple 3-to-2 counters (b) to 
reduce the three multiplication 
operands to two intermediate 
results. 
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(LSBs) of the next higher slice to generate the full 
64-bit partial product. Exceptions are the 8 MSBs 
and LSBs. 

This technique also requires the ability to accu- 
mulate partial products with the partial products 
from previous cycles. Thus, each cycle must be 
accompanied by two additions: the partial product 
summation and the intermediate product accumu- 
lation. While this can be done by following the 
multipliers with two levels of lookahead adders, 
usually 74S181s, the resulting nanocycle time is 
approximately three times longer than the partial 
product generation time of the 8 x 8 multipliers. 
Modifying this scheme, however, by adding reg- 
isters between each level of logic, the pipeline 
multiplier reduces the nanocycle time to near the 
propagation delay time of the multiplier chips plus 
the clock to output time of the multiplier register 
plus the setup time of the intermediate result reg- 
ister. This scheme has two disadvantages: increased 
pipe latency, caused by the two extra levels of 
pipelining, and a high parts count. 

Still another technique involves replacing one 
level of the pipe and one level of lookahead adders 
with carry-save adders between the partial product 
generators and the pipeline registers. Carry-save 
adders are used to implement a technique that is 
called 3-to-2 counting. As seen in Fig 4(a), any 
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Without adding pipe latency, the APU 
is able to accumulate partial products 
ata rate of 8x 56 bits every 112 ns. 





combination of 3 equally weighted bits can be 
recoded into a 2-bit field. 

This makes it possible to reduce the three oper- 
ands generated by the multiplication process (high 
and low partial products and 64-bit intermediate 
product) into only two operands which may then 
be summed together in a single lookahead ALU. 
The 3-to-2 recoding requires no-carry propagate 
logic and is therefore very fast. Only one level of 
pipelining is required because of the 3-to-2 
counter’s speed, resulting in both a reduced parts 
count and a reduced pipe latency. 

In the APU floating point engine, 16 AMPALI6Ré6s, 
programmed as triple 3-to-2 counters, are used to 
reduce the three multiplication operands to two 
intermediate results [Fig 4(b)]. The registered PAL 
Outputs are connected to the input buses of the 
Mantissa ALU that is also used for floating point 
addition and subtraction. The Mantissa ALU then 
calculates the next intermediate product in parallel 
with the partial products calculations occurring in 
the 8 x 8 multipliers. This intermediate product and 
the new partial products are recoded by the 3-to-2 
counter PAL devices to form the next pair of inter- 
mediate results. This process continues until the 
complete 56 x 56 product is generated. Thus, with- 
out adding pipe latency, the APU is able to accumu- 
late partial products at a rate of 8 x 56 bits every 
112 ns, which coincides with the basic nanocycle 
machine time. 


Barrel Shift, a 3-Level 


implementation 

The APU’s barrel shifter performs left shift, right 
shift, and rotate operations of 0 to 63 bits in a 
single microcycle. Used mainly for floating point 
prescale and normalize operations, the barrel 
shifter (Fig 5) is implemented in three stages: the 
word rotater, nibble shifter, and bit shift and mask 
logic. It is controlled by associated prescale, 
leading zero detect, and mask control logic. 

Prescale logic converts the signed difference pro- 
duced by the exponent arithmetic units based on 
the comparisons of the two operand exponents, 
into an absolute shift distance. This shift distance 
is then used to right shift (prescale) the smaller 
operand Mantissa of a floating point add or sub- 
tract operation. The leading zero detect logic deter- 
mines the left shift distance required to produce a 
left-justified (normalized) result. Mask. control 
logic converts rotated data to shifted data by mask- 
ing off the appropriate leading or trailing bits to 





implement right or left shifts. These three sections 
are implemented in PAL. 

Implementing the 3-level, 64-bit barrel shifter in 
MSI requires the use of Am25S10 4-bit shifters. The 
first level is the word rotater which performs a cir- 
cular rotate of 0, 16, 32 or 48 bits. Although imple- 
mentation is simple, the MSI solution requires 16 
packages. The second level is the nibble shifter, 
essentially identical to the word rotater but wired 
to rotate 0, 4, 8, or 12 bits. The final barrel shifter 
stage requires not only bit rotate but also leading 
and trailing bit masking and sticky bit computation 
(ie, the logical OR of the masked-out bits). An MSI 
solution requires not only the 16 packages of 
AM25S10s, used in each preceding level, but also 16 
AND gate packages for masking, with another 16 
AND packages for the sticky bit computation. Con- 
trol logic for the mask operation requires as much 
logic as the entire shift path. A more practical solu- 
tion consists of building separate left and right 
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Fig 5 Implementation of 64-bit **Nearest Neighbor Shifter’ 
is in three stages: word rotater, nibble shifter, and bit shift and 
mask logic. Control derives from associated prescale, leading 
zero detect, and mask contro! logic. 
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Fig 6 Word rotation, first level of 3-level shifter, consists 
of eight PAL units, each programmed as two 4-bit rotators. 
It performs a circular rotate of 0, 16, 32, or 48 bits. 











Fiz 7 Nibble shifter makes up second level of barrel 
shifter. It is wired to rotate 0, 4. 8, or 12 bits. 


shifters, 48 packages apiece, and not implementing 
a sticky bit at all. 

Implemented in PAL, a 64-bit rotater and shifter 
with sticky bit computation requires considerably 
fewer packages than a unidirectional MSU shifter. 
The word rotater consists of eight identical PAL. 
devices programmed as two 4-bit rotaters/package 
(Fig 6). The nibble shifter requires four Am2SS10s 
and eight PAL. devices programmed as 6-bit wide, 
4-place shifters (Fig 7). The bit shift and mask logic 
requires 16 PAI. devices in the data path and two 
PAL. devices in the control path. (Fig 8). 

To implement the masking function required for 
shifting, a technique called ‘‘Nearest Neighbor 
Shifting’? (U.S. patent pending, Computervision 
Corp) is used. Each shift and mask PAI. device has 
an enable input from one mask control PAL unit. 
In addition, each shift and mask PAL device is also 
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Fig 8 Bit shift and mask logic for barrel shifter requires 16 
PAL. devices in the data path and two PAL devices in the 
contro! path. Each shift and mask PAI. device has an enable 
input from one mask control PAI. device and is connected to 
enable inputs of its left and right neighbors. The 64-bit 
masking operation requires only 16 control lines. 

















SHIFT LESS 
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Fig 9 ‘*Nearest Neighbor’ interconnection is used to 
implement masking function for shifting. A mask control PAL 
device determines if all 4 bits are to be masked; enables from 
the adjacent slice determine if the pat device is at shift 
boundary. 


connected to the enable inputs from its left and 
right-hand neighbors (Fig 9). The mask control 
PAL input determines if all 4 bits from the slice 
should be masked off. Enables from the adjacent 
slice determine if a PAL. unit is at a shift boundary. 
If only one of the neighboring slices is disabled, the 
shift and mask PAL unit masks off from 0 to 3 of 
the bits adjacent to the disabled slice, depending on 
the bit-rotation distance. In this way, the 64-bit 
masking operation can be implemented with only 
16 control lines as opposed to at least 64 for an 
SSI/MSI solution. 

In addition to performing the final shift and 
mask operation, the bit-shifter PAL unit also com- 
putes the logical OR of the masked-out bits at each 
slice position. These outputs are then logically 
ORed to generate a sticky bit. The extra hardware 
required is less than two SSI packages. The entire 
PAL barrel shifter requires 38 devices to implement 
64-bit rotation, left shifting, right shifting, and 
sticky bit accumulation. An MsStI-based left/right 
shifter, without sticky bit computation, requires a 
minimum of 96 parts. In addition, the logic re- 
quired for implementing the prescale and normal- 
ize operations is significantly reduced through the 
use of PAL devices. . 
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Introduction 


The introduction of Programmable Array Logic (PAL®) devices 
by Monolithic Memories in the late 1970s marked the beginning 
of rapid growth for the programmable logic device market. By 
offering fast turn development of user-defined circuit functions 
and significant package count reduction, programmable logic 
devices (PLDs) solved such system design problems as time to 
market, system size and cost, and the cost of making design 
changes. 


First generation PLDs, such as the PAL16L8, PAL16R4, PAL16R6, 
and PAL16R8, offer relatively simple, single-function architec- 
tures. Designers choose either devices with combinatorial out- 
puts, registered outputs, or a combination of both, to solve their 
specific design tasks. 


While simple architectures still offer the fastest operating 
speeds, more universal architectures have emerged among 
second generation PAL devices, which allow more user defini- 
tion of the device configuration. The universal “macrocell” 
approach allows users to reduce the number of different parts in 
a system and enjoy considerable flexibility in defining the final 
PLD architecture. 


Allof these devices enable logic designers to combine the tasks 
of several SSI devices into asingle PLD. System functions in the 
form of small state machines or simple data flow machines are 
possible in a single PLD. But the ability of designers to build 
significant "systems-on-a-chip” has been somewhat hampered 
either by a lack of device flexibility or the inability to make full 
use of the logic power of the PLD. This has kept the benefits of 
PLDs inaccessible to some designers. The purpose of this paper 
is to describe some of the key features of a new PAL device 
which opens up a whole range of possibilities by removing many 
of these limitations. 


The PAL32VX10 


The new PAL32VX10 takes the synchronous logic approach 
used by the PAL22V10. Judicious improvements to the macro- 
cell provide logic power far greater than anything currently 
available in 24 pins. This is accomplished while maintaining a 25- 
ns propagation delay (tpp) with only 180-mA current drain. 


The PAL32VX10 shares some of the features of the PAL22V10, 
such as varied product term distribution, and can emulate any 
PAL22V10 function. It is therefore appropriate to describe the 
individual features of the PAL32VX10 by contrasting them with 
the corresponding more limited options in the PAL22V10. The 
key features to be discussed are: 


1) Dual Feedback 
2) Programmable Flip-Flops 
3) Register Bypass Scheme 


After discussing these features we can look at the overall power 
provided by the integration of these features into a single PLD. 


Adapted from a paper presented at Northcor/86 


Dual Feedback 


The feedback scheme used in the PAL22V10 (Figure 1a) pro- 
vides two sources for the feedback signal, only one of which is 
used at any given time. When the flip-flop is used, the feedback 
signal is taken from the output of the flip-flop itself; when the 
flip-flop is bypassed, the feedback signal comes from the output 
pin, allowing for bidirectional signal flow. 


There are two basic limitations in this approach. The first is the 
fact that registered outputs cannot be made bidirectional. One 
could add a separate feature allowing such an option (Figure 
1b), but such an improvement by itself provides an incremental 
benefit. The second limitation, which is substantia! from the 
standpoint of device utilization, is the fact that if an output is 
needed as an input, the output macrocell for that pin is com- 
pletely wasted. 





(a) As It Currently Exists 





(b) A Possible, But Marginal improvement 
Figure 1. PAL22V10 Feedback Selection Scheme 


The approach taken in the PAL32VX10 is to provide two com- 
pletely independent feedback lines: one from the flip-flop, and 
one from the output pin (Figure 2). That this solves the first 
limitation is obvious; the output can always be disabled and used 
as an input without interfering with the signal being fed back 
from the flip-flop. 
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Figure 2. PAL32VX10 Dual Feedback Scheme 


But the benefits go far beyond that. Using the output pin per- 
manently as an input does not require that the macrocell be 
wasted: the flip-flop can be "buried”, allowing internal state 
machines to be built. This removes the second limitation of the 
PAL22V10 scheme. Alternatively, the macrocell can be used as 
an input register by feeding the input into the flip-flop, and then 
using the flip-flop feedback to drive other logic in the device. 
This same approach can also be used when the pin is used as an 
output, allowing the flip-flop to be loaded directly from its output 
pin. 

Thus the thirty-two inputs to the array consist of twelve dedi- 
cated inputs, ten output pin feedback inputs, and ten flip-flop 
feedback inputs. 


Programmable Flip-Flops 


PAL devices have traditionally been equipped with D-type flip- 
flops for data synchronization and state machines. D-type flip- 
flops work quite well for many state machines, but in many other 
cases, J-K or T-type flip-flops would allow for implementation of 
more complex state machines. Yet offering alternative products 
with other dedicated types of flip-flops would unnecessarily 
overpopulate the product family, complicating the designer's 
device selection and inventory tasks. 


The ideal solution is to have a single device which can have each 
flip-flop individually programmed to define its type. And, as will 
be seen, it is sometimes beneficial to be able to change types “on 
the fly”. These capabilities have been provided in the PAL32VX10. 


T-type flip-flops 

Asan example, the classic application involving T-type flip-flops 
is in the design of binary counters. When a D-type flip-flop is 
used, the Boolean equations have to account for each time the 
device output is to be a logic 1. This means that the nth bit of the 
counter requires n product terms: 


Qnt := /Qn*Qn-1*...*Q1 
+ Qn*/Q1 
+ Qn*/Q2 


+ Qn*/Qn-1 


By using a T-type flip-flop, one only has to determine when the 
flip-flop output should toggle. The nth bit toggles only when all 
lower order bits are “1”. This can be expressed as 


“Qn should HOLD UNLESS all lower order bits are HI.” 


The UNLESS operator can be implemented by an Exclusive-OR 
(XOR) gate: 


Qnt+ := Qn :t: (Qn-1*Qn-2*...*Q1) 


This requires two product terms (one on each input of the the 
XOR gate), regardless of which bit of the counter is being 
defined. 


So by providing an XOR gate with Q fed back to one input, the 
combination XOR gate/D-type flip-flop acts as a T-type flip-flop. 
Asum of products on the other XOR input provides the Toggle 
signal. 


J-K flip-flops 
The classic formula for a J-K flip-flop is 


Q+ = S*/Q + /K*Q. 


This expression can be directly implemented in a D-type flip- 
flop, providing /K fits within the available product terms. If this is 
not the case, or if finding /K is inconvenient, another formulation 
is needed. We know that by the definition of a J-K flip-flop, 


Q should HOLD UNLESS: 
the output is O and we SET to 1; 
the output is 1 and we RESET to 0; 
we are toggling (JK = 11). 


The above conditions for “not holding” can be plugged into a 
Karnaugh map and minimized to give the following expression: 


Q+ = Q:+: (U*/Q + K*Q). 


Here, with the help of the XOR gate, a J-K flip-flop is imple- 
mented directly in terms of J and K, instead of /K. Again, this is 
done by feeding Q to one input of the XOR gate, and by ORing 
several product terms into the other XOR input. These terms are 
divided between J terms and K terms; J terms are ANDed with 
/Q, while K terms are ANDed with Q. 


So we can implement J-K, T, and D-type flip-flops by using the 
XOR gate in combination with a D-type flip-flop. By setting one 
XOR input to Q,/Q, 1, or0, we can implement J-K, /J-/K, J-/K, or 
/J-K flip-flops. The other XOR input provides a sum of products 
for J and K. Of course, since S-R flip-flops are a subset of J-K 
flip-flops, they can be implemented in the same fashion. 


Reconfigurable flip-flops 

This is a step in the right direction, but one can do more. 
Consider the design of a loadable counter. This is a situation 
where a T-type flip-flop is preferred when counting, but a D-type 
is better when loading. A T-type configuration can be converted 
to D-type, however, if the LOAD control input is used to disable 
the XOR input containing Q. 


Thus by making the XOR input a product term, instead of a 
dedicated single input, the flip-flop type can be changed on the 
fly to allow each particular function to be implemented as effi- 
ciently as possible. Note that PLDs which provide programma- 
ble flip-flops without the XOR control product term cannot have 
their flip-flops altered once configured. So, for instance, adding 
a LOAD feature to a counter then requires twice as many extra 
product terms as required with a reconfigurable flip-flop. 


All of these considerations have been accommodated in the 
PAL32VX10 (see Figure 3). The single XOR product term pro- 
vides control of the flip-flop type, for either permanent or recon- 
figurable flip-flops. From eight to sixteen product terms are 
available as D or T inputs, or to be divided between J and K in- 
puts. Some of the possible configurations are shown in Figure 4. 





2-720 


a' Monolithic rai Memories “4 


New PAL® Device Architecture Extends Design Flexibility 





Figure 3. The PAL32VX10 Programmable Flip-Flop 





T Register with I/O Buried Register with Dedicated Input 


Figure 4. A Few Selected Flip-Flop Configurations 
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Register Bypass 


The two primary second-generation PLDs, the PAL20RA10 and 
the PAL22V10, have different ways of bypassing their flip-flops. 
The PAL22V10 has a single fuse which configures the output 
permanently (see Figure 5a). Once the configuration is chosen, it 
cannot be changed. On the other hand, the PAL20RA10 bypasses 
its flip-flop by activating both SET and RESET simultaneously 
(see Figure 5b). This can be done dynamically, but since the SET 
and RESET affect the contents of the flip-flop, one cannot, for 
instance, clock data into the flip-flop, change to combinatorial, 
and then change back to look at the flip-flop; the data in the 
flip-flop will have been lost. 


The PAL32VX10 eliminates this restriction by using a product 
term to control the bypass multiplexer (see Figure 5c). While the 
output is combinatorial, the flip-flop is unaffected. This makes it 
possible to use the flip-flop in macrocells that are sometimes 
combinatorial. 


(a) The PAL22V10 Scheme 





(b) The PAL20RA10 Scheme 


(c) The PAL32VX10 Scheme 


Figure 5. Register Bypass Schemes 


The Complete Macrocell 


This has not been an exhaustive description of all of the features 
of the PAL32VX10 macrocell, a summary of which is shown in 
Figure 6. This paper has.merely highlighted the more revolu- 
tionary capabilities. The complete macrocell is shown in Figure 
7. All of the elements work together to offer unparalleled flexibil- 
ity, without compromising the performance of the device. 


An application will serve to illustrate the power afforded by all of 
these features operating in concert. Itis beyond the scope of this 
paper to describe the details; a brief description will suffice here. 


Programmable registered or combinatorial outputs 


Dual independent feedback paths allow I/O with combinatorial 
or feedback register outputs 


Programmable flip-flops allow J-K, S-R, T or D types 
Programmable output polarity 


Register set and register reset can be asynchronous or 
synchronous 


Automatic register preset on power up 
Varied product term distribution 
Up to 16 product terms per output 


Figure 6. Features of the PAL32VX10 





Figure 7. The Complete PAL32VX10 Macrocell 


A PDP-11 Floating-Point 


Coprocessor Controller 

This device monitors a PDP-11 bus for instructions that should 
be sent to a coprocessor. As shown in Figure 9, it provides the 
PDP instruction decode, an instruction tracking state machine 
which provides various wait states for operand fetching, and 
coprocessor instruction encoding. The 3-bit state machine is 
not needed externally; thus it is buried, and the output pins 
provide additional inputs. Reset capability and a test feature for 
examining the buried flip-flops are included. 


This application takes advantage of the varied product term 
distribution, the buried flip-flops, and J-K, T, and D-type flip- 
flops. To implement the same application without a PAL32VX10 
would require: 


1 PAL22V10 
1 PAL16R4 
1 PAL14H4 


Of course, since we are talking about replacing several PAL 
devices with another PAL device, the number of discrete TTL 
chips that are being replaced is very large indeed. 
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Summary 


The PAL32VX10 provides significantly increased flexibility as a 
result of a few carefully placed improvements. The addition of 
INSTRUCTION bbe pct i ee another feedback line, an XOR gate, and a few other enhance- 
tibet STATE INSTRUCTION ments allows full utilization of the device. The utilization can be 
MACHINE ENCODE further optimized through the ability to represent the desired 

functions in the most compact form possible. These capabilities 


combine to form the most versatile 24-pin PLD available to date. 


COPROCESSOR CONTROLLER 


PAL32VX10 





Figure 8. Coprocessor Controller 
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Introduction 


With increasing density and functionality, programmable logic 
devices are being used for such specialized applications as 
state machines. The PROSE (PROgrammable SEquencer) 
PMS14R21 device is the first device developed for state ma- 
chine design which integrates the hardware and software as- 
pects into a new design methodology. On a single device, the 
PROSE architecture combines the advantages of both PROM 
and PAL elements for implementing internal state sequencing 
functions, and provides a powerful device for designing state 
machines. These architectural features also lend themselves 
easily to the requirements of software design tools (such as 
PALASM®2 software) for providing an easy-to-use, high-level 
state machine design syntax. 


The PROSE design methodology offers high functionality and 
ease of use along with the traditional advantages of program- 
mable logic devices, such as instant customizability, fast turn- 
around time, low development cost and reduced real estate 
requirements. 


State Machine 


A state machine is defined as a digital logic system which tra- 
verses through a sequence of states in an orderly fashion. The 
state is a set of values measured at various points in the digital 
system. This state value is typically stored in a set of storage 
registers. A simple model of a state machine is shown in Figure 1. 
It consists of three basic elements: storage registers, next 
state decoder, and output decoder. 


OUTPUT 
DECODER OUTPUTS 














INPUTS NEXT STORAGE 
STRIE REGISTERS 
DECODER STATE 
OF THE 
MACHINE 


Figure 1. Genera! Mode! of a State Machine 


The storage registers perform the functions of storing present 
state of the state machine as well as the present output infor- 
mation (in which case it is also referred to as the pipeline regis- 
ter). The next state decoder decodes the present state of the 
machine along with the present input conditions and generates 
the next state of the machine. Typically it is combinatorial logic 
in PAL/PLA-based state machines and is a PROM in microcoda- 
ble sequential state machines. 


The output decoder determines the outputs of the machine from 
the present state of the machine and the input conditions. It is 
not mandatory to have an output decoder as the outputs can 
also be generated directly by the registers. 





Adapted from a paper presented at Electro/87. 
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Sequential state machines are usually classified into two 
types depending upon their output generation. In Mealy type 
state machines, outputs are a function of the present state 
and input conditions. The output decoder takes the present 
state and input conditions, decodes them and generates the 
outputs. In Moore type state machines, outputs are strictly a 
function of the state of the machine. Thus, the Moore machine 
usually requires a simpler output decoder to decode the 
present state only, or none at all in a case where the storage 
registers can be used to provide these outputs directly. 


As transitions to the next state are dependent upon the 
present state and input conditions, the operation of a Mealy 
machine can be emulated by a Moore machine, provided that 
one cycle delay is allowed between input conditions and output 
generation. 


State Machine Representation 


The first step in the design methodology requires converting 
the design problem into its state diagram representation. The 
state diagram shown in Figure 2 consists of state identifiers, 
input conditions and the outputs generated. The states are 
represented by bubbles, with arrows indicating transitions to 
other states. The input conditions as well as outputs gene- 
rated are associated with these transitions. 


INPUTS 
OUTPUTS 





aU 


INPUTS 
OUTPUTS 





INPUTS 
OUTPUTS 








OD CD 
OUTPUTS INPUTS 


OUTPUTS 
Figure 2. A Typical State Diagram 


There are two major operational requirements for state ma- 
chines as shown in Figure 3. 


e Control sequencing 
e Output generation 


Control sequencing is changing from one state to another in a 
state machine. If the transition from the present state to the 
next state is dependent upon the present state, it is direct 
control sequencing. If the transition from the present state to 
the next state is dependent upon the present state and the 
input conditions, it is called conditional control sequencing. 
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As mentioned earlier, the output generation can be based upon 
the present state for a Moore type of state machine, or the 
present state and inputs for a Mealy type of state machine. 
The state diagram representation includes both the control 
sequencing and output generation representations. 
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INSTRUCTION OUTPUTS 
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SEQUENCING 
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GENERATION 
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Figure 3. Instruction Execution in State Machines 


The design process involves converting the hardware problem 
to this state machine representation. The system design is 
analyzed for its signal logic and timing requirements. The out- 
puts generated are based upon the system signal logic re- 
quirements, and the state transitions depend upon the system 
timing requirements of these signals. 


Traditional Microprogrammable 
Sequencers 


Traditional high-end microprogrammable sequencers fit this 
general model of state machines. They use simple clocked 
D-type registers as pipeline registers as shown in Figure 4. 
Typically they do not have an output decoder and allow only 
Moore machine implementations. 


TEST 
igi 


Es faa 
Es ournen omen fous ADDRESS 


INSTRUCTION] NEXT 
OP-cODE |stTaTe| CUTPUTS 
fo | 


INSTRUCTION 





DECODER 





PIPELINE 
REGISTERS 


Figure 4. Traditional Microprogrammable Sequencer 






For the next state decoder, microprogrammable sequencers 
use a PROM array. Each location of the PROM functions as a 
state. PROMs have been traditionally used for this purpose 
because of their familiarity and relative ease of use. This 
PROM is addressed by the pipeline register field storing the 
present state. The PROM data bits are inputs to the pipeline 
register which are loaded at every clock cycle. The PROM data 
consists of the next state (branch), output logic values, and 
an instruction op-code. 


Output generation in microprogrammable sequencers is the di- 
rect transfer of PROM data to the output field of the pipeline 
register on every clock cycle. 


For direct control sequencing, next state values from the PROM 
are loaded into the pipeline registers at every clock cycle, rep- 
resenting a state transition. In most microprogrammable se- 
quencers, a binary counting mechanism is also added to the 
state registers. This allows the PROM to be addressed with a 
state counter. This is effectively control sequencing with serial 
binary state numbers. The PROM next state value is used only 
when a non-serial branch is required. A branch select muitiplex- 
er is used to select between this serial count and the branch for 
the next state of the machine. 


For conditional contro! sequencing an input condition testing 
multiplexer is used. Usually it tests for one of the inputs at logic 
high or low. Based on this input condition, the branch select 
multiplexer is controlled, providing conditional branching. 


The instruction op-code stored in the PROM for every state 
controls the sequencer operation for that state. It decides be- 
tween direct and conditional control sequencing. It also decides 
the input condition to be tested. These instruction op-codes are 
stored in the data form and are decoded just before execution. 
Extra hardware is required to decode these instruction op- 
codes and generate internal multiplexer control signals. This 
slows down the device operation by the time required to decode 
the instructions. 


The combination of instruction op-codes, output, and next state 
values is called the instruction microcode. The design process 
requires writing instruction microcode based upon the state 
diagram. This approach to a microprogrammable sequencer, 
though popular, imposes several restrictions on the state ma- 
chine operation as the instruction op-codes are predefined and 
fixed. These fixed op-codes can only perform certain opera- 
tions and the state machine design is usually modified to fit 
those restrictions. This means that the sequencer operation is 
usually not optimal for the system design problem. 


For example, condition testing, which is restricted to one input 
testing only, may not fit a design requiring simultaneous testing 
of two signals. The branching is also restricted to a two-way 
branch. For multi-way branching, a microprogrammable 
sequencer requires multiple cycles which could disrupt the 
system signal timing requirements. In some devices further 
dedicated hardware is added to improve the functionality of the 
sequencer. Such hardware is usually under the control of in- 
struction op-code and provides features such as testing more 
than one input or branching to more than one next state. 
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Figure 5. A Block Diagram for the PROSE Device 


PROSE Device Architecture 


The PROSE device architecture is very similar to this model of a 
state machine. Its architecture (Figure 5) is optimized to per- 
form the functions of a sequential state machine as represent- 
ed in the model. For the storage registers, the PROSE device 
provides a set of twenty-one pipeline registers. These registers 
are used to store the present state and the outputs generated 
by the machine. This is similar to the microprogrammable 
sequencers. Beyond this however, the similarity ends. The 
architecture of the PROSE device has been defined specially to 
optimize its functionality. 


The next state decoder in the PROSE device consists of a 
PROM. This PROM stores the next state, outputs and branch 
control information. The output generation is similar to micro- 
programmable sequencers, and is a direct transfer of PROM 
data to the output field of the pipeline register. There are eight 
outputs in the PROSE PMS14R21 device. 


Control Sequencing 


Direct control sequencing in the PROSE device is implemented 
by the PROM. The depth of the PROM is 128, allowing up to 128 
states. Seven bits (AQ-A6) of the pipeline register’s next state 
field are used to address the PROM to provide the state transi- 
tion capability. For direct control sequencing the next state val- 
ues from the PROM are loaded into the pipeline register's next 
state field, for a state transition. 


Conditional Control Sequencing 


For conditional control sequencing, a combinatorial PAL array is 
provided. All of the eight conditional inputs 10-17 are direct in- 
puts to this PAL array. This PAL array provides two major ad- 
vantages over microprogrammable sequencers: 


e It allows selection of a combination of input signals for condi- 
tion testing. This is very useful for many system designs which 
require testing of a combination of two or more signals at partic- 
ular logic levels. This capability is not available in most 
microprogrammable sequencers. 


elt allows multiple condition testing for multiple branching. 
Microprogrammable sequencers require more than one cycle 
to implement multiple branching on multiple conditions. A four- 


way branch can be implemented in the PROSE device in a sin- 
gle cycle without disrupting the system timing requirements. 


Conditional control sequencing also requires present state 
information for allowing state-dependent conditional 
branching. The state bits A0-A6 are not used as inputs to the 
PAL array to provide the state information. An easier way to 
provide state information to this PAL array is to use extra bits 
from the pipeline registers and PROM as its inputs. These six 
bits, CSO-CS5, form the condition select field, which allows 
user-definable bit combinations to be used for any state. This 
optimizes the use of the PAL array product terms, since the 
same bit combinations can be used for testing the same input 
conditions for different states. 


Branching 


To implement branching, the two outputs of this PAL array, XO 
and X1, are XORed with the two most significant bits of the 
state address (XF1 and XFOQ) from the pipeline register. This 
gives a capability of inverting state bits AS and A6 depending 
upon the condition testing in the PAL array. This conditional 
change of next state address provides a four-way branch cap- 
ability. 

For direct control sequencing, no product term in the PAL array 
is selected. The programmable array outputs X1 and X0 are at 
logic low level, allowing next state bits A6-A0 to decide the next 
state directly. The conditional select bits, CS5-CS0, should not 
select any product terms from the programmable array for 
states with direct control sequencing. 


For a two-way branch, either X1 or X0 is asserted HIGH on con- 
ditional inputs. This causes a branch to a location where the bit 
value of A6 or A5 respectively is inverted. 


For three-way and four-way branches both X1 and X0 are 
asserted high depending on multiple condition inputs. The se- 
quencer branches to locations where the bit values of A6 and 
A5 are 00, 01, 10, and 11. The PROSE device allows up to a 
four-way branch, which is sufficient for most designs. 


In the PROSE device, the next state decoder has been effec- 
tively partitioned into a PROM and a PAL device, to utilize the 
efficiencies of the two for contro! sequencing and conditional 
testing respectively. 
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Logic Diagram 
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PROSE Device Programming 


Figure 7a shows the state diagram representation of direct con- 
tro! sequencing; Figure 7b shows conditional contro! sequenc- 
ing. Similarly, output generation can be represented as shown 
in Figures 7c and 7d for both Mealy and Moore machines. The 
next step in the PROSE device design process is to convert 
these state diagram representations into a textual entry form. 


A. Direct Control B. Conditional Control 


Sequencing Sequencing 
STATEA := VCC -> STATE B STATEC := 11 -> STATED 
+/1 -> STATEE 





C. Moore Machine 
STATEA.OUTF := O1 


D. Mealy Machine 
STATEA.OUTF := 11 ->01 


Csratea) Ceratea) 
a 
01 
Cerares)) 02 Crates) 


Figure 7. State Machine Operations and PALASM 2 Software 


The instruction/operation mechanism of the PROSE device, 
however, is slightly different from the microprogrammable se- 
quencers. The PROSE device uses PALASM 2 software for pro- 
gramming, which allows easy textual constructs that have a 
direct relationship to the hardware operation. The instruction is 
divided into two distinct segments: one for control sequencing 
and one for output generation. 


Control Sequencing Instruction 
The format for control sequencing instruction is as follows: 
(present_state) 
‘= (condition_1) -> next_state_name_1 
+ (condition_2) -> next_state_name_2 
+ (condition_3) -> next_state_name_3 
+->  default_state_name_4 


Based upon the state diagram, for each present state, transi- 
tions/branches to other states can be entered. These transi- 


tions take place in the PROM of the device. The conditional. 


inputs can also be described in the condition field of the instruc- 


tion syntax. This is the condition testing done in the PAL array 
of the PROSE device. For each present state a maximum of 
four branch states can be provided, which is the device archi- 
tecture limit. The logic values of next states differ from each 
other by the most significant two bits only. PALASM 2 software 
automatically assigns these logic values to the state names 
while minimizing state locations in the PROM. PALASM 2 soft- 
ware also assigns the values to the conditional select field while 
ensuring the optimization of PAL array product terms. 


Output Generation Instruction 
The output generation instruction syntax is as follows: 
(present_state).OUTF 
i= (condition_1) -> (output_value_1) 

+-> (default_output_value) 


The present state is the state where the outputs need to be 
generated. The default output value is the logic level of the 
outputs at the present state which is also the syntax for a 
Moore machine implementation. PALASM 2 software also al- 
lows a Mealy machine implementation using the constructs for 
generation of conditional outputs. As the architecture of the 
PROSE device allows only Moore machine implementation, 
PALASM 2 software automatically converts the Mealy machine 
description to the Moore machine equivalent. 


After the conversion of state diagram to textual input is com- 
pleted, PALASM 2 software combines information about the 
control sequencing and output generation for each present 
state. These constructs have a direct relationship with the 
state diagram representations as well as the PROSE device 
architecture. This allows an integrated design where system 
design requirements represented by the state diagrams have a 
direct relationship with the device hardware operation. 


In the PROSE device, this combination of control sequencing 
and output generation is equivalent to the instruction op-code 
of a microprogrammable sequencer. In microprogrammable se- 
quencers, the instruction op-codes are stored as data, and are 
decoded before execution. PROSE device instructions do not 
require decoding as they are stored in the device in logic form. 
This speeds up the operation of the PROSE device. The cycle 
time of the PROSE device is 25 MHz, which is better than most 
other single-pipelined sequencers. 


Figures 7a, b, c and d also show the direct PALASM 2 software 
text representations of the state diagrams. These text files are 
compiled to generate the JEDEC fuse maps for programming 
the PROSE device. The procedure is identical to that used to 
program any other PLDs. PALASM 2 software also supports 
software simulation of device operation, which helps in design 
debugging before actual programming of the parts. 


The PROSE device incorporates the Diagnostics-On-Chip™ 
(DOC™) testing capability with a serial shadow register. This 
serial shadow register can be used to conduct diagnostics 
such as walking-1 or walking-0 tests. It can also be used for 
testing the PROM array by shifting in a test vector and loading 
it to the pipeline register. The pipeline register data for the next 
clock cycle is then checked for verification. Part of the DOC 
circuitry is also used for programming, but this is transparent 
to the users. 
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Figure 8. DIP Pin Configuration 





Summary 


With continuing advances in technology, new devices are 
being introduced which are tailored to specific system design 
requirements. The PROSE device is one such device, with an 
architecture optimized for state machine design. It combines a 
PROM array and a PAL array on a single device, and utilizes 
the functional advantages of both for control sequencing and 
conditional testing respectively. This combination offers higher 
functionality over the traditional approach of microprogramma- 
ble sequencers for state machines. It also offers a newer 
design methodology where software instructions are split into 
control sequencing and output generation instructions. These 
instructions are directly related to hardware requirements, 
unlike the instruction op-codes of traditional microprogramma- 
ble sequencers, which sometimes require design compromis- 
es. The PROSE device also offers extensive simulation, thus 
offering a complete solution. 
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The PAL16R8-10 Series from Monolithic Memories is the fastest 
TTL programmable logic family, and the ECL PAL devices from 
Monolithic Memories are the fastest programmable logic devices 
of any technology. 


Do these devices simply drop into the same applications as 
previous versions? No! These high-speed PAL devices offer 
several advantages, including speed, that open up whole new 
applications for programmable logic devices. 


PAL16L8-10 


PAL16R8-10 





The World’s Fastest PAL Devices 
PAL16R8-10 Series 


Let’s look at the products under discussion. The PAL16R8-10 
Series is comprised of the four most popular PLDs in the 
industry today; the PAL16L8, PAL16R8, PAL16R6, and PAL16R4. 
The architectures (Figure 1) consist of sixteen inputs and eight 
outputs, with zero to eight flip-flops on the outputs. Although 
very simple compared to some of the latest high-density devices 
from Monolithic Memories, these architectures are very power- 
ful, and handle the majority of today’s PLD applications. 


PAL16R6-10 PAL16R4-10 






/ 


cla 





Figure 1. The Four Devices Comprising the PAL16R8-10 Series. Each Has Eight Product Terms per Output. 


The speed of the PAL16R8-10 Series is 10 nanoseconds (ns) 
maximum from input to output on any combinatorial path. The 
registered paths offer a 10-ns setup time and only 8 ns from 
clock input to data output. This translates into a maximum clock 
frequency (fMAX) of 1/(10 ns + 8 ns), or 55.5 MegaHertz (MHz), if 
external feedback is used from the flip-flop. 





fMAX no feedback; 1/(tsu + th) or 1/(twh + twl) 





It is important to differentiate this number from another common 
fMAX calculation, the maximum flip-flop toggle rate. The toggle 
rate is limited by the greater of the data setup time or the clock 
widths high and low. The toggle rate specification for the 
PAL16R8-10 Series is 62.5 MHz. However, this specification is 
unrealistic since it does provide for feedback. It is a more 
representative path that provides the 55.5-MHz specification 
(Figure 2). CLK 


[ fe 0 


(SECOND CHIP) 


[-—_*w >| tex —+|+—tsu >| 


{MAX external; 1/(tCLK + tsu) 


Figure 2. Alternative Methods of Measuring fMAX. A More Realistic Path Is the Second One, Including External Feedback. 


Adapted from a paper presented at Electro/87. 
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:CL PAL Devices 
he PAL10H20P8 from Monolithic Memories was the first ECL with the same number of inputs and outputs, but having the 
‘AL device. This device is combinatorial, with twenty inputs and added feature of output latches (Figure 3). 


ight outputs. A sequential device followed, the PAL10H20G8, 














PAL10H20P8 PAL10H20G8 
Figure 3. The ECL PAL Devices. On the Left Is the PAL10H20P8 Combinatorial Device, 
and on the Right Is the PAL10H20G8 Latched Device. 

The PAL10H20P8 and PAL10H20G8 have combinatorial propa- in this graph of speed vs. power, performance characteristics 
gation delays of only 6 ns, faster than any other PLDs. The closer to the lower left are desired. The first PAL devices, 
latched version has a setup time of 4.5 ns and a latch-to-output introduced by Monolithic Memories in the late 70s, were in the 
delay of only 2.5 ns. This provides a maximum frequency of over 35-ns range, and consumed 180 mA. Technology improvements 
142 MHz. allowed the introduction of the first 25-ns devices in 1983. In 

. early 1985 the first 15-ns devices were released, with half-power 
History of Speed Developments versions at 25 ns and quarter-power versions at 35 ns. Last year 
A short history of speed improvements in bipolar PAL devices Monolithic Memories introduced its industry-leading 10-ns 
shows how these devices compare to earlier versions (Figure 4). PAL16R8-10 Series, along with the ECL devices. 


PAL16R8A-4 SERIES 
55/50 


PAL16R8B-4 SERIES 
35/55 


PAL16R8A-2 SERIES 
50 35/90 


PAL16R8B-2 SERIES 
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35, 
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Figure 4. Speed vs. Power for Monolithic Memories Bipolar PAL Families. The Year of Introduction Is Noted. 
The PAL16R8-10 Series and the ECL PAL Devices Offer the Highest Performance in the Industry. 
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New High-Performance Applications 


The raw speed of these new devices allows their use in new 
high-performance applications. As microprocessors reach 20 
MHz and beyond, 50 MHz logic is required for peripheral and 
memory interface. Additionally, high-speed static RAMs 
require high speeds in the select logic. 


One application area for these high-speed PAL devices is high- 
resolution graphics, including counters for screen refresh, 
generation of line frame and video information, and micropro- 
cessor interface. High-speed digital processing requires 10-ns 
PAL devices for data pipeline applications and on-the-fly 
instruction decoding. High-speed minicomputers require very- 
high-speed state machines, and 55.5-MHz PAL devices are an 
efficient solution. 


Speeds Up Fastest Discrete Designs 


PAL devices have always been used to replace discrete logic, 
thereby reducing chip count, inventory, and board area; 






HHO 
poe Bass 





simplifying board design and layout; and adding extensi\ 
design flexibility. Now PAL device speed has broken a 
important barrier; the PAL16R8-10 Series allows PAL device 
to speed up a discrete logic design, even one built with th 
fastest available devices. 


Comparison to FAST Logic 


The following example compares the PAL16R8-10 Series tc 
FAST logic in a combinatorial discrete logic design. The 
design is an interface between a MC6800 microprocessor anc 
memory elements. Debouncing for the reset switch is alsc 
required. 


The discrete logic implementation is shown in Figure 5. Eight 
discrete chips are required. However, since FAST logic does 
not include 8-input AND gates, three additional chips will be 
required, making a total of eleven chips using FAST logic. The 
worst-case path delay is 27.6 ns. 





Figure 5. Memory Interface Built with Discrete Logic. Eleven 
FAST Logic Chips Are Required. The Worst-Case 
Path Delay Is 27.6 ns. 
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Figure 6 shows the same design implemented in a PAL16L8-10. 
One device replaces eleven chips, and the new worst-case path 
delay is now only 20 ns. 
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Figure 6. Same Memory Interface Implemented in a PAL 
Device. Only One Chip Is Required, and the Delay 
Is Only 20 ns Maximum. 


An example of a sequential circuit (Figure 7) is a simple dice design requires seven chips. The worst-case path for FAST logic 
game, providing two die readouts on LEDs. The discrete logic has a maximum frequency of 27.5 MHz. 





Figure 7. Simple Dice Game Constructed from Discrete 
Gates. Seven Chips Are Required, with a 
Maximum Frequency of 27.5 MHz. 
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The entire design can be implemented in one PAL16R8-10 
(Figure 8). The worst-case path has a maximum frequency of 
55.5 MHz, over twice as fast as the discrete logic version. In 
addition, power requirements remain almost the same: almost 
170 milliAmps (mA) for the FAST version, and 180 mA maximum 
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for the PAL device. A half-power version of the PAL device will 
require 47% less power than FAST logic while still providing 
higher speed. Of course, the greatest benefit is a 7:1 chip count 
reduction, reducing size and cost while increasing reliability. 


ROLL +5V 


Figure 8. The Same Dice Game Implemented in a 
PAL16R8-10. Only One Chip Is Required, and the 
Speed Is Doubled, with Little increase in Power 


Requirements. 


The PAL 16R8-10 Series devices can also be compared to FAST 
MSI functions on a one-to-one basis. For example, the clock-to- 
output time of a standard 74F374 octal register is 10 ns, vs. only 
8 ns for the PAL16R8-10. 


Astraightforward combinatorial example is a simple 3:8 decoder 
with three-state outputs, the 74F538. This device has a maximum 
propagation delay of 17 ns, while the PAL16L8-10 can implement 
the same function at only 10 ns, 41% faster. 


Of course, PAL devices add the additional capability of doing 
custom variations of these functions, if needed. For example, 
with the substitution of a registered PAL16R8-10 in the 74F538 
example, control storage functions such as load and clear can 
be added. 


Even faster discrete logic families have been announced that 
improve upon the speeds of FAST logic. Many of these families 
even use CMOS technology. But the fastest devices announced 
are at 100 MHz, equivaient to the combinatorial speed of the 
PAL16R8-10 Series. And the PAL16R8-10 Series is available 
now, providing thousands of functions through its programma- 
bility. We saw in the memory interface example that the high- 
performance logic families are not as complete as earlier 
families. PAL devices can replace this logic, or complement it 
with additional functions. 


We have shown that the PAL16R8-10 Series creates a new 
application area for programmable logic: the design of higher 
performance solutions than any discrete logic implementations. 
Speed-critical discrete logic designs are now open to replace- 
ment by programmable logic. 
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Elimination of PC Board Noise 


This capability may be arriving just in the nick of time. Discrete 
logic is today achieving speeds that challenge printed circuit 
board layout. The fast switching speeds can create noise and 
crosstalk that can bring the high-performance board crashing 
down. Note the new high-speed CMOS logic families that 
require new pinouts and extra grounds because of switching 
noise. 


Integrating this logic into programmable logic eliminates not 
just discrete chips but all those noisy and troublesome connec- 
tions between them. Putting the PC traces into silicon eliminates 
crosstalk and signal distortions caused by transmission line 
reflections, and improves the reliability of the board. In high- 
speed systems, the length of the interconnect can approach the 
signal wavelength, causing ringing in the connection tine. This 
ringing can cause a false signal value to appear on an input tied 
to the ringing line, or can even be fed back into the source device 
if feedback is used from the output. 


Improved Speed 


The reduction in interconnect afforded by PAL devices, with the 
accompanying reduction in inductance and time delays, even 
increases system speed. In high-speed logic systems, each 
12-inch length of interconnect wiring introduces a time delay of 
approximately 2 ns, about an entire gate delay. These delays 
slow down the system and can cause glitches and signal errors. 
Thus, integration of high-speed togic into PAL devices improves 
not just reliability, but also speed. 


Technology Advantages 


Two other advantages of the PAL16R8-10 Series, lower cost and 
reduced signal noise, are a result of the technology used to 
achieve its high speed. Speed improvements in semiconductor 
devices come about primarily through size reductions and 
reduced capacitance. Capacitance, since it stores charge, slows 
down devices as they have to charge and discharge capacitive 
elements. 


The 25-ns devices were achieved through the reduction of 
lateral! geometries in ajunction-isolated process. This produced 
lower collector-base capacitance and therefore provided higher speed. 
The 15-ns devices used the same junction-isolated process, but 
a reduced epitaxial layer produced the higher speeds. 


Oxide-lIsolated Process 


For the 10-ns devices, Monolithic Memories used a completely 
new oxide-isolated process to increase the speed to record 
levels. Oxide-isolation allows the transistors to be placed closer 
together (Figure 9). This greatly reduces lateral geometries 
while reducing the parasitic capacitance between circuit ele- 
ments, providing much higher speed. 


BURIED 
LAYER 


BURIED 
LAYER 









— 
um 


Figure 9. Basic Transistor fsolation Structure in Oxide- 
Isolated Process. A Fully Recessed Oxide Region 
Separates Transistors, Allowing Smaller 
Geometries and Higher Speed. 


The process utilizes a walled-base technology. It features 1.5- 
micron design rules and dual-layer metallization. The isolation 
technology uses a combination of high-pressure oxidation to 
form a fully-recessed oxide through the epitaxial layer, and a 
diffused guard band. This technology avoids any contact 
between the channel! stop and the buried layer. 


Unique Needs of Programmable Logic 


Programmable logic devices have unique process needs because 
of the high voltages incurred during programming. The oxide- 
isolation technique is especially useful because of its excellent 
insensitivity to substrate injection, which occurs when small 
transistors are driven into hard saturation during programming. 


Lower Manufacturing Costs 


Another requirement for programmable products is a good 
high-current beta so that emitter size can be minimized. The 
NPN transistors in this process have been measured to have a 
beta of higher than 20 at an emitter current of 30 mA. In addition, 
areduction in the required fusing current allowed smaller transis- 
tors in the programming array. Separation of the current sources 
for programming and normal operation improves the speed- 
power product. These features add reliability to the product 
while reducing die size. 


High Yield 


The walled diffusions improve yield by reducing manufacturing 
tolerance requirements. In addition, the process is insensitive to 
threshold problems. The result of these techniques is a high- 
yielding, reliable process that produces both the 10-ns TTL PAL 
devices and the 6-ns ECL PAL devices. 


Smaller die size and high yields result in a much lower cost of 
manufacturing. Thus, once the process is mature, it can produce 
faster products at the same or less cost than earlier technologies. 
This can be seen with the 15-ns devices, which now are cost- 
competitive with any other technology, including CMOS. 


Many users of ICs claim that it is impossible to find products that 
combine three elusive qualities: good, fast, and cheap. The 
advanced technology used in the high-speed PAL products 
allows them to fit’all three requirements well. 
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Reduced Noise 


The process also provides other important advantages, lower 
capacitance and reduced noise. An enhanced base doping 
allows lower resistance without higher capacitance. Oxide- 
isolated transistors provide much lower input and output 
capacitance levels than that provided by earliér technologies. 
The input capacitance is typically only 2 picoFarads (pF), while 
the output capacitance is only 4 pF. 


Elimination of False Logic Switching 


Lower capacitance allows your system to operate with cleaner 
switching signals between devices, providing a much more 
reliable system. This is critical in high-speed systems where fast 
rise and fall times can cause signal fluctuations. 


Guard rings limit the susceptibility of the inputs and outputs to 
voltage undershoots and overshoots. Bipolar Schottky circuitry 


is designed with nonlinear input and output impedances. | 
high-speed systems, undershoots in particular can play havo 
with false logic states, if the guard rings do not prevent them. 


New Synchronization Applications 


Another key advantage of the PAL16R8-10 Series, only indirect! 
related to speed, is its excellent resistance to metastability 
Metastability is a failure that occurs when required minimun 
setup times are violated. The PAL16R8-10 Series is less suscep 
tible to metastability, and recovers faster, than any other PLC 
and most other forms of logic. This subject is discussed in detai 
in Session 3 of Electro/87 on Metastability. 


Metastability characteristics are explicitly specified in the 
PAL16R8-10 Series datasheet (Figure 10). For example, at 21- 
MHz clock frequency and 7-MHz data input frequency, a 
metastability failure will occur only once in ten years. 


Metastability Characteristics Over Operating Conditions 
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Figure 10. Guaranteed Metastability Characteristics for the 
PAL16R8-10 Series. The High Resistance to 
Metastability Allows New Use in Synchronization 


Applications. 


Thus, a PAL16R8-10 Series device can be used as a synchronizer 
circuit, a new application that doesn’t even require its program- 
mability. Almost every digital system must at some point handle 
incoming signals that are not synchronized to the system clock 
rate, and any such system will encounter metastability. Using 
the PAL16R8-10 Series devices at the interface provides the 
lowest possible failure rate. 


Lower Metastability Means Higher 
Speed 


For applications using the PAL16R8-10 Series as a synchronizer, 
the lower metastability rate means higher clock speed. For a 
given desired MTBF, the maximum clock rate is limited by the 
susceptibility to metastability failure. The reduced metastability 
rate for the PAL16R8-10 Series allows a higher clock speed, with 
no reduction in MTBF. 


ECL Provides Highest Speed 


The two ECL PAL devices, the PAL1QH20P8 and PAL10H20G8, 
provide the fastest PLDs. 10KH ECL technology also eliminates 


many of the noise problems encountered with high-speed TTL 
boards since the internal switching voltage range is less than 
one volt, and it is often a good alternative to high-speed TTL. 
ECL design techniques achieve high speed by not allowing the 
bipolar transistors to saturate when they are on. Thus, there is no 
storage time delay caused by switching in and out of saturation. 


The elimination of saturated transistors means that current is 
always flowing in ECL devices. This is the cause of high power 
consumption in ECL circuits. However, the ECL PAL devices 
consume only 210 and 225 mA for the PAL1OH20P8 and 
PAL10H20G8, respectively. This is only minimally higher than 
the standard power consumption of TTL programmable logic 
circuits. 


New PLD Applications in ECL Systems 


However, these devices do not just speed up programmable 
logic. These devices are fully 10KH compatible and meant to be 
used in 10KH systems. They open up new applications for 
programmable logic in the highest performance ECL systems. 
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ECL MSI/LSI Functions 


since the ECL market is smaller than the TTL market, fewer 
unctions are available in ECL than TTL. This is true on all levels, 
rom SSI and MSI through VLSI. ECL PAL devices provide a 
lexible solution that can implement functions not found in fixed 
=CL logic. Future ECL PAL devices will provide higher density 
ind be capable of implementing the most complex LSI functions. 


systems are built with ECL technology primarily for high 
yerformance, and ECL PAL devices must also be high perfor- 
nance. Future ECL PAL devices will improve speeds down to 
‘he 3 to 4-ns range and be able to compete effectively with the 
speed of fixed ECL logic. 


The Speed Challenge of CMOS 
Technology 


CMOS technology is quickly encroaching on bipolar speeds. In 
fact, Monolithic Memories has CMOS mask-programmable 
devices (ZHAL™ devices) that are even faster than their bipolar 
equivalents. However, CMOS has not yet reached the best 
bipolar speeds, and is not likely to in the near future. Even if we 
exclude ECL from the comparisons, bipolar has a two-generation 
speed advantage over CMOS technology. 


The fastest CMOS PLDs available today are at 25 ns, while 25-ns 
bipolar devices have been available for years and are currently 
being shipped in huge volumes. 15-ns CMOS is in development, 
but bipolar is already at 10 ns and moving further down the scale. 


Although recent CMOS developments have been rapid, CMOS 
is not expected to pass bipolar speeds, only to get closer. As 
geometries are reduced, CMOS has to contend with short- 
channel effects and hot-electron trapping as scaling increases 
the electric field. Bipolar has fewer problems during reduced 
scaling. 


Speed vs. Density 


Many vendors of high-density CMOS PLDs claim that they have 
surpassed bipolar speeds if examined at a per-gate basis. That 
claim is relevant only if the high densities are required. Even gate 
array applications are relatively low in density, with over half 
below the 2000-gate level. Additionally, the largest application 
area for semicustom arrays is random logic, efficiently met by 
the PAL architecture. And even in those cases where greater 
density than a single PAL device is required, two PAL16R8-10 
Series devices back-to-back have a combined delay of only 
20 ns, still faster than any CMOS PLD of any density. 


Some industry observers have raised an issue of speed vs. 
density, asking which one will win out in the long run. But these 
two attributes do not compete; they are usually required at 
different times in different applications, and thus both areas will 
develop in their own market areas. 


Gate Array Performance 


Gate arrays and other forms of semicustom logic are demon- 
strating very high performance, even in CMOS. The sub- 
nanosecond gate delays of these devices, even when input and 
Output delays are taken into account, cannot be matched by 
programmable logic. For the ultimate in speed, a more custom- 
ized design is better, with full custom offering the highest speed. 


Programmable logic competes with other high-performance 
semicustom solutions in applications where the additional 
advantages of zero NRE, instant customization, ease of design 
changes, inexpensive and easy-to-use software, standardization, 
low cost, second sourcing, and other issues weigh in favor of 
programmable logic. 


Studies of gate arrays show that clock frequencies for systems 
using gate arrays are about equal to microprocessor frequen- 
cies; around 8-12 MHz, moving to 16 MHz. But further studies 
show that 76% of users require at least one very-high-speed 
(twice the clock frequency) combinatorial path through the chip. 


Since programmable logic offers only one speed through the 
chip, the device speed must match the speed of the fastest 
required path. This is one reason why high-speed PAL devices 
are required even if system clock rates are low. 


CMOS PLDs can be erasable, a feature bipolar cannot match. 
Although not erased very often in customer applications, 
erasability adds to the testability of the device during manufac- 
turing. Also, CMOS programmable cells are smaller than bipolar 
fuses, allowing a smaller die size. These advantages of CMOS 
will allow it to dominate, in the long term, the PLD market areas 
that do not require high speed. 


Usable Speed 


Just as semicustom arrays offer a greater percentage of “usable 
gates” than PLDs, or gates used vs. gates provided, PLDs can 
offer more “usable speed” than semicustom arrays. Although 
typical gate speeds in custom devices can be below 1 ns, that 
speed is not relevant if the design and simulation tools do not 
provide the accuracy required to guarantee proper timing. 


Since most of a PAL device is fixed logic, guaranteed maximum 
delays through the device can be specified for any custom 
design. In addition, minimum delays for the PAL16R8-10 Series 
are specified in the datasheet for the first time, allowing accurate 
system timing. Tools such as PLDMaster from Daisy Systems 
provide PAL device simulation in a board-level environment. 
PLDMaster can even help partition designs to take full advan- 
tage of device speeds. 


The PAL device has another advantage over other custom logic 
that provides the ultimate in timing simulation—a device can be 
programmed and immediately dropped into a circuit, to check 
timing. All of these advantages mean that designers can use 
more of the PAL device's speed than that of other custom logic, 
giving a PAL device an edge even at equivalent specs. 
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CMOS Advantages 


CMOS has several advantages over bipolar, depending on the 
technology used. In fact, CMOS technology has opened up 
more new applications for programmable logic than any other 
development, because of several unique characteristics. It can 
be lower in power. can be erasable, and can be snialler. Some of 
these advantages are important in some cases but not important 
in others. 


Low power is an important advantage, but Monolithic Memories 
has been producing half- and quarter-power versions of its 
products in bipolar, simply by increasing the resistor values. Our 
PAL16R8B-4 Series devices at 35 ns and only 55 mA are lower in 
power than many competing CMOS devices at similar speeds. 
The upcoming PAL16R8H-15 Series at 15 ns and 90 mA will 
achieve even greater efficiencies. As top bipolar speeds improve, 
the half-power and quarter-power versions will continue at the 
highest-performance CMOS speed levels. 


CMOS power specifications are often only stated at standby, 
with the device not operating. Operating power for CMOS rises 
dramatically with increasing input frequency and can surpass 
even medium-power bipolar devices (Figure 11). For high-speed 
applications. CMOS has not shown a power advantage over 
bipolar. 
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Figure 11. Power Consumption vs. Operating Frequency for 
CMOS and Low-Power Bipolar PAL Devices. 
CMOS Devices Can Consume as Much Power as 
Bipolar Circuits at High Speeds. Thus, for High- 
Speed Applications, CMOS Loses One of Its Key 
Advantages Over Bipolar. 





Bipolar’s Place in Market 


Bipolar TTL and ECL PAL devices will continue to dominate th 
large part of the PLD market that requires high speed. Bipola 
will even be dominant in the low-cost part of the market for year 
to come. 


Record Speeds and New Applications 


The 10-ns PAL16R8-10 Series and the 6-ns ECL PAL device: 
provide record speeds for programmable logic, using advancec 
process technology. Future plans call! for 7.5-ns TTL and 3 tc 
4-ns ECL PAL devices. The speed and technology open nev 
applications for PAL devices, including: 


Highest-performance computers and digital processing 


High-speed microprocessor and memory support 


Speeding up the fastest discrete logic designs 


Component integration, eliminating noise and delays from 
device interconnect lines 


Process technology advantages of lower cost and reduced 
noise 


Wide range of LSI functions to complement high-speed TTL 
and ECL discrete logic families 


e Synchronization circuits for low metastability rates 


All of these new applications show that these new PAL devices 
are not just faster devices for the same sockets, but rather open 
up whole new applications for programmable logic devices. 
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| PRODUCT DESCRIPTION PAGE 
‘AmPAL18P8 55, 90,180 | 20-pin superset 5-202 
PAL16P8 180 Programmable polarity | 5-17 
‘Am/PAL16L8 Standard 5-26, 5-184 
‘PAL10H8 90 Simple combinatorial 5-56 
‘PAL10L8 20 TTL 90 Simple combinatorial 5-56 
|PAL12H6 20 TTL 35 90 Simple combinatorial 5-56 
PAL12L6 20 Simple combinatorial 5-56 
'PAL14H4 20 Simple combinatorial 5-56 
‘PAL14L4 20 Simple combinatorial 5-56 
; PAL16H2 20 Simple combinatorial 5-56 
PAL16L2 20 Simple combinatorial 5-56 
PAL16C1 Simple combinatorial 
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PMS14Ra21 
PLS167 
PLS168 
PLS105 
Am29PL141 

















TECHNOLOGY 











fayax(MH2) 











PRODUCT PINS TECHNOLOGY DESCRIPTION PAGE 

| AmPAL23S8 20 TTL Buried registers 5-169 
PAL16RA8 20 TTL Asynchronous 5-11 
PAL16RP8 20 TTL Programmable polarity | 5-17 
PAL16RP6 20 TTL Programmable polarity | 5-17 
PAL16RP4 20 TTL Programmable polarity | 5-17 
Am/PAL16R8 20 TTL, E CMOS 58, 40, 28.5, 18, 11} 0.1, 45, 90,180) Standard 5-26, 5-184 
Am/PAL16R6 20 TTL, E CMOS 58, 40, 28.5, 18, 11] 0.1, 45, 90,180) Standard 5-26, 5-184 
Am/PAL16R4 20 TTL, E CMOS 58, 40, 28.5, 18, 11} 0.1, 45, 90,180) Standard 5-26, 5-184 
PAL16X4 20 TTL Arithmetic 5-51 

Sequencers 























30, 25 210 
33 200 
33 200 
37 200 
20 450 


Am2971 24 TTL 85 310 Prog. Event Generator 5-365 





DESCRIPTION 
+ 
PROSE Sequencer 5-315 
Programmable Logic Seq. | 5-331 
Programmable Logic Seq. | 5-331 
Programmable Logic Seq. | 5-331 
Fuse Prog. Controller 5-339 
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PRODUCT 





M2064 
M2018 


(Continued on front cover) 


RAM CMOS 











RAM CMOS _ 


: _ PRODUCT TECHNOLOGY tuax'tep Iec(mA) DESCRIPTION | PAGE 
PALIOH20EV/EG8;} 24 ECL 10KH 125 MHz 220 Registered/latched 5-381 

PAL10020EV/EG8 24 ECL 100K 125 MHz 220 Registered/latched 5-381 

PAL10H20G8 24 ECL 10KH 6ns 225 Latched 5-382 

PAL10H20P8 ECL 10KH 6ns Combinatorial 

















DESCRIPTION 





fryax(MHz) 





70, 50, 33 
50, 33 


Logic Cell Array 
Logic Cell Array 
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